Data Engineer

Houston, TX

Industry: Data Analyst Job Number: 4704

What is the position:

The Data Engineer will be responsible for optimizing and expanding the data architecture as well as optimizing data flow and collection

What will you do:
  • Support developers, data architects, and data analysts on data initiatives
  • Ensure optimal data delivery architecture is consistent throughout ongoing projects
  • Develop, construct, test, and maintain data architectures/data pipelines
  • Develop data set processes for data modeling, mining, and production
  • Ensure data architecture will support the requirements of the business
  • Discover opportunities for data acquisition
  • Leverage data from internal/external sources to answer business demands
  • Utilize a variety of languages/tools to merge systems together
  • Utilize analytics programs, machine learning, and statistical methods to prepare data for use in predictive and prescriptive modeling
  • Explore and examine data to find hidden patterns
  • Automate the common/repeatable data preparation and integration tasks in order to minimize manual and error-prone processes and improve productivity
  • Make recommendations to improve data reliability, efficiency, and quality
  • Educate business unit leaders in leveraging data and analytics capabilities to achieve their business goals

What are the requirements:
  • Bachelor’ s Degree in CS, Statistics, Information Systems, or related
  • 5+ years experience with data mining, integration, modeling, and/or optimization
  • Experience in the Oil & Gas Industry preferred
  • Strong experience with advanced analytics tools for Object-oriented scripting using languages like R, Python, Java, C++, Scala, etc.
  • Strong experience with database programming languages (ie. SQL, PL/SQL, MongoDB, Cassandra, etc.)
  • Strong experience with building and optimizing data pipelines, pipeline architectures, and integrated datasets using traditional data integration technologies.
  • Strong experience in working with data science teams in refining and optimizing data science and machine learning models and algorithms
  • Knowledge and/or experience with SQL on Hadoop tools (ie. HIVE, Impala, Presto, Hortonworks Data Flow (HDF), Dremio, Informatica, Talend, etc.)
  • Experience working with both open-source and commercial message queuing technologies (ie. Kafka, JMS, Azure Service Bus, Amazon Simple queuing Service, etc.)
  • Experience working with stream data integration technologies (ie. Apache Nifi, Apache Beam, Apache Kafka Streams, Amazon Kinesis, etc.)
  • Experience working with data discovery, analytics, and BI software tools (ie. Tableau, Qlik, PowerBI)
  • Ability to design, build, and manage data pipelines for data structures encompassing data transformation, data models, schemas, metadata and workload management
  • Ability to work with both IT and business in integrating analytics and data science output into business processes and workflows
  • Ability to work across multiple deployment environments including cloud, on-premises and hybrid, multiple operating systems
  • Excellent communication skills

You would be really happy working here if:
  • Roadblocks don’ t intimidate you. You understand how to successfully evaluate problems and develop appropriate solutions.
  • You can be counted on in crucial times, possessing great focus while completing projects successfully and efficiently.

Send an email reminder to:

Share This Job:

Related Jobs:

Login to save this search and get notified of similar positions.