Search jobs > Houston, TX > Data science engineer

Lead Data Science and Analytics Engineer

NexTier Oilfield Solutions
Houston, TX, USA
Full-time

Lead Data Science and Analytics Engineer

Houston, TX, USA Req #1865 Wednesday, June 19, 2024 POSITION SUMMARY

The position is responsible for capturing requirements and performing data analysis in addition to providing data engineering expertise supporting the advancement of data driven solutions to improve operational efficiency and drive business value.

This hybrid role will be involved in the full project lifecycle which includes driving discovery and requirements definition, data related development, and operational support / administration as part of the data management team responsible for the cloud data platform in Google Cloud (GCP).

This position will work directly with project stakeholders and other teams on advanced analytics, BI, and data science initiatives. Key Accountabilities

  • Responsible for building and maintaining the GCP data environment and processes (BigQuery, Cloud SQL, Dataproc, BigTable, Stackdriver, etc).
  • Coordinate with business and IT partners to gather and define requirements and design data workflows.
  • Perform data analysis, drive data model design, define data mappings, and data validation.
  • Create and maintain documentation (requirements, design, data models, process diagrams, user stories, support, and training).
  • Work with data stewards to maintain the data catalog.
  • Work hand-in-hand with application development and the data science team to understand business needs and coordinate product changes.
  • Build prototypes for data collection, evaluation, and integration to assist with data discovery and analysis.
  • Build stream ingestion processes to efficiently send, process, analyze & publish data.
  • Build automated workflows to ingest and process batch data.
  • Manage workflows in support of both product and data science pipelines.
  • Build & deploy large-scale ETL, ELT, and stream processing pipelines in a serverless microservice infrastructure using industry standard technologies such as SQL, Python, Java, Kubernetes, Dataflow, Spark, etc.
  • Develop and support data integration using APIs, Pub / Sub, etc.
  • Perform analyses of large structured and unstructured data to solve multiple & complex business problems.
  • Provider operational support and build dashboards to monitor and maintain the GCP environment.
  • Develop processes to validate, report, and address data quality.
  • Responsible for setting up auditing and logging to support the data platform.
  • Develop frameworks and utilities as needed to monitor / support the data platform and facilitate user access to data.

Required Knowledge, Skills, and Abilities

  • Excellent verbal and written skills with great attention to detail.
  • Excellent documentation skills.
  • Ability to multitask and prioritize effectively for multiple projects.
  • Capability to efficiently complete tasks in a fast-paced environment.
  • Ability to form effective working relationships and collaborate with different teams.
  • Ability to troubleshoot complex and ambiguous environments in order to find and solve problems.
  • Experience with on-premise, private and / or hybrid cloud solutions.
  • Proven experience administering, monitoring, and supporting production systems and applications.
  • Proven experience with RDBMS such as PostgreSQL, SQL Server and no-SQL DBs such as HBase, BigTable.
  • Experience writing production and modular code following coding best practices with Python, Java, Spark, Scala, or related required.
  • Experience building and using RESTful APIs.
  • Experience with Pub / Sub and Dataflow or Kafka and Spark Streaming.
  • Experience with IoT, time-series, or machine generated data is a plus.
  • Experience with data processing platforms such as Spark, Hadoop, Hive, Sqoop, Airflow, Google Cloud Platform (Dataproc, Dataflow, BigQuery, Compute Engine, Data Fusion), AWS (EMR, Kinesis, Lambda, Glue), Azure (HDInsight, Data Factory).
  • Experience supporting business intelligence or large-scale data warehouses / lakes using tech such as Hadoop, Kafka, IBM Cognos, Microsoft, MicroStrategy, Oracle, Snowflake, Tableau, Power BI, Teradata and / or similar tech, Unifi Software Data Catalog is a plus.
  • Working knowledge of Linux required, RHEL / Centos preferred.
  • Experience with container technologies (Docker and Kubernetes / GKE).
  • Familiar with Agile methodologies and related methods.
  • Experience with oil and gas software systems and data formats is preferred.

Minimum Required Education

Bachelor's Degree - Required

Minimum Required Work Experience

  • More than 10 yearsSoftware Engineering - Required
  • 2-3 yearsBusiness Analyst or Project Manager - Required

Other details

  • Job Family Digital
  • Job Function Digital
  • Pay Type Salary
  • Employment Indicator NexTier
  • Houston, TX, USA

Share this job :

9 days ago
Related jobs
Promoted
NexTier Bank
Houston, Texas

The Lead Data Science and Analytics Engineer is responsible for leading our team in delivering actionable insights and innovative data science and analytics solutions. Lead, mentor, and develop a team of data scientists, analysts, and data engineers. They will be responsible for overseeing data scie...

Promoted
LivaNova
Houston, Texas

The Head of Data Management and Analytics needs to have a broad understanding of the full range of strategic data and analytics capabilities, and the ability to communicate these concepts, methods and techniques in ways easily understood by other stakeholders:. Build partnerships with executive lead...

Promoted
VirtualVocations
Pasadena, Texas
Remote

A company is looking for a Data Engineer for Enterprise Analytics. ...

Promoted
ECF Data, LLC
Houston, Texas

The Azure Engineer will provide the engineering and second level of support for all Azure platform and related services. Architecture and engineering of Microsoft Azure, providing guidance and deep technical knowledge on various Azure architecture elements deployed in an Enterprise environment. Stro...

Promoted
JP Morgan Chase & Co.
Houston, Texas

As a Lead Software Engineer at JPMorgan Chase within the Corporate Sector, Data Management, you are an integral part of an agile team that works to enhance, build, and deliver trusted market-leading technology products in a secure, stable, and scalable way. Leads communities of practice across Softw...

Promoted
Konecranes Nuclear Equip and Services LLC
Houston, Texas

Responsible for creating the Skeleton Structure in Teamcenter, 2-3 day lead time after receipt of Order Acknowledgement, and distributes Excel version to the MMP and PM. We welcome different backgrounds and skills that enrich our community and we promote a place where we can ALL be ourselves. Lead M...

Promoted
NASA
Houston, Texas

Computer Science that included 30 semester hours or 45 quarter hours of course work in any combination of mathematics, statistics and computer science with at least half of those hours in mathematics and statistics courses that included differential and integral calculus; and that provided an in-dep...

Berkley
Houston, Texas

We are in turn committed to delivering innovative products and exceptional service to them, our valued agents and brokers, Berkley Oil & Gas is dedicated in its efforts to be well-informed of the changing dynamics of the industry; support industry efforts to minimize and mitigate risks and hazards i...

Repsol
Houston, Texas

Repsol Renewables is seeking an Analyst/Associate, Transmission and Market Analytics to join our dynamic and growing team. Gather the data and intelligence from Interconnection queues and utility IRPs for developing scenarios for production cost models. Improve data management processes, process aut...

KBR
Houston, Texas

Experience required with piping and instrumentation diagrams (P&IDs) development as well as experience generating fluid hydraulic calculations for line sizing and formulation of process data for in-line instruments and loads for rotating equipment such as pumps, compressors, and turbines. Leading a ...