See more Data Science jobs →

← Back to all jobs

Staff Data Engineer

Posted

General Electric
Headquarters: Cincinnati, OH
http://jobs.gecareers.com/ShowJob/Id/40052/Staff%20Data%20Engineer

About Us:
GE is the world's Digital Industrial Company, transforming industry with software-defined machines and solutions that are connected, responsive and predictive. Through our people, leadership development, services, technology and scale, GE delivers better outcomes for global customers by speaking the language of industry.
GE offers a great work environment, professional development, challenging careers, and competitive compensation. GE is an Equal Opportunity Employer . Employment decisions are made without regard to race, color, religion, national or ethnic origin, sex, sexual orientation, gender identity or expression, age, disability, protected veteran status or other characteristics protected by law.

Role Summary:
The Data Engineering team helps solve our customers' toughest challenges; making flights safer, power cheaper, and oil & gas production safer for people and the environment by leveraging data and analytics. The Lead Data Engineer will work with the team to create state-of-the-art data and analytics driven solutions, working across GE to drive business analytics to a new level of predictive analytics while leveraging big data tools and technologies.

Essential Responsibilities:
As a Staff Data Engineer, you will be part of a data engineering or cross-disciplinary team on commercially-facing development projects, typically involving large, complex data sets. These teams typically include statisticians, computer scientists, software developers, engineers, product managers, and end users, working in concert with partners in GE business units. Potential application areas include remote monitoring and diagnostics across infrastructure and industrial sectors, financial portfolio risk assessment, and operations optimization.

In this role you will:
Be Proficient in at least one data modality (e.g., text, numeric, image), conversant in more than one. Proficient in multiple ETL /MDM tools.

Implement Data warehouse & Big/Small data designs with automated MDM and data quality capabilities.

Demonstrate proficiency at industry standard data modeling tools (e.g., ERWin, ER Studio, etc.).

Integrate domain data knowledge into development of data requirements.

Look across multiple systems, understands the purpose of each system and defines data requirements by systems.

Identify downstream implications of data loads/migration (e.g., data quality, regulatory, etc).

Qualifications/Requirements:
Basic Requirements:
Bachelor's Degree in Computer Science, Information Technology or equivalent (STEM) with minimum 6 years of experience as data engineer.

A minimum of 3 year of experience using Hadoop ecosystem, Map-Reduce, Spark, NoSQL (HBase, MongoDB etc.), Cassandra is required

A minimum of 3 year of experience using Scripting (Pig, Python, Perl, etc.) is required

A minimum of 3 years of experience with Core Java, Java WebServices development SOAP, REST APIs

A minimum of 2 year of experience working on Database(s), SQL is required

Eligibility Requirements:
Legal authorization to work in the U.S. is required. We will not sponsor individuals for employment visas, now or in the future, for this job

Must be willing to travel up to 10%

Must be willing to work out of an office located in Cincinnati, Ohio

Desired Characteristics:
Technical Expertise:
Identification of data sources/flows and determination of respective target schemas within data lake based on data domains and the data analytics/reporting requirements.

Data analysis and profiling for the ingested data and design the integration and distribution layers’ conceptual data models.

Creation of Logical and Physical data models based on data definitions and the results of data analysis.

Creation of Data Mapping Specifications (DMS) with BUS matrix for standard dimensions, subject area descriptions and business processes associated with each model. Physicalizing DMS using meta data structures defined within the data lake for data lineage.

Deriving Data Quality (DQ) rules and design the mapping rule tables and/or exception tables with appropriate logging mechanisms.

Determination of data load frequencies and design the ELT job references on the job control meta data structures.

Analyzing ELT scripts for data load performance and recommend optimization techniques to improve the performance and scalability of the solutions.

Big data eco system expertise

Domain Knowledge:
Expertise in ERP and finance modules including AP, AR, GL, Cash, Order Management, Accounting etc; Oracle and SAP functional knowledge is preferred

#DTR

Locations: United States; Ohio; Cincinnati

GE offers a great work environment, professional development, challenging careers, and competitive compensation. GE is an Equal Opportunity Employer . Employment decisions are made without regard to race, color, religion, national or ethnic origin, sex, sexual orientation, gender identity or expression, age, disability, protected veteran status or other characteristics protected by law.

GE will only employ those who are legally authorized to work in the United States for this opening. Any offer of employment is conditional upon the successful completion​ of a background investigation and drug screen.