Lead Data Engineer - Data Lake

S&P Global
Long Island City, New York, US
Full-time

About the Role :

Find out exactly what skills, experience, and qualifications you will need to succeed in this role before applying below.

The Team :

The Data Lake team is responsible for data ingestion from internal source systems in batch / real-time modes, curation and governance of the data assets created in the platform.

The team also works towards adopting the new features of the Databricks product and optimizing the operational aspects of the platform to enhance the user community experience.

The team has a broad and expert knowledge on Ratings organization’s critical data domains, technology stacks and architectural patterns, fosters knowledge sharing and collaboration that results in a unified strategy.

Responsibilities and Impact :

  • Design & Build data pipelines with an emphasis on scale, performance and reliability.
  • Provide technical expertise in the areas of design and implementation of Data Lake solution powered by Databricks on AWS cloud.
  • Ensure data governance principles adopted, data quality checks and data lineage implemented in each hop of the data.
  • Partner with the data teams, enterprise architecture organization to ensure best use of standards for the key data domains and use cases.
  • Continuous learner with an eye on emerging trends around data lake architecture and enterprise data solutions.
  • Ensure compliance through the adoption of enterprise standards and promotion of best practice / guiding principles aligned with organization standards.

What We’re Looking For :

Basic Required Qualifications :

  • Bachelors or Masters degree in Computer Science or Information Technology.
  • 8+ years of experience building solutions in big data technologies.
  • Strong experience programming with more than one of Java , Scala , Python , and large-scale data analytics tools.
  • Hands-on experience designing and building streaming data pipelines using data stream processing tools , Confluent, etc.
  • Strongly prefer experience building Data Lake & Data warehouse solutions using ETL, ELT pipelines on Databricks, Snowflake, Azure Data Lake, etc.
  • Strong understanding of database and analytical technologies in the industry including MPP and NoSQL databases.
  • Strong understanding of cloud platforms like AWS, Azure, or GCP, and their services (e.g., EC2, S3, AKS, EKS, etc.). AWS or any public cloud certification is a must.

Additional Preferred Qualifications :

  • Experience in continuous delivery through CI / CD pipelines, containers and orchestration technologies.
  • Experience with Machine Learning Libraries and Frameworks (TensorFlow, MLlib) is an added advantage.
  • Expert knowledge of Agile approaches to software development and able to put key Agile principles into practice to deliver solutions incrementally.
  • Monitors industry trends and directions; develops and presents substantive technical recommendations to senior management.
  • Excellent analytical thinking, interpersonal, oral, and written communication skills with strong ability to influence both IT and business partners.
  • Ability to prioritize and manage work to critical project timelines in a fast-paced environment.
  • Financial services industry experience is an added advantage.
  • Databricks experience or certifications is an added advantage.

J-18808-Ljbffr

2 days ago
Related jobs
Promoted
Hispanic Technology Executive Council
New York, New York

Lead Marketing Cloud data modeling and architecture including data extension modeling and cross-product data architecture & mapping. Conduct data profiling of existing marketing dataset to define data governance and standardization rules. In fact, we are the industry-leader for building Salesforce s...

Promoted
VirtualVocations
New York, New York

A company is looking for a Data Conversion and Migration Lead to provide data migration support services for a financial system modernization project. ...

JPMorgan Chase & Co.
New York, New York

Be responsible for ingesting data into our data lake and providing frameworks and services for operating on that data including the use of Spark. As a Lead Software Engineer at JPMorgan Chase within the Consumer and Community Banking in Wealth management space, you are an integral part of an agile t...

ITE MGMT
New York, New York

We are seeking a talented Business Intelligence and Data Analytics Manager to lead our team of data analysts and data engineers to drive improved data processes and generate new insights from our data. Team Leadership: Lead a team of business intelligence and data analysts, and data engineers, provi...

Hispanic Technology Executive Council
Queens, New York

Lead work streams focusing on a broad range of Data initiatives including data requirements, issues management, Data Quality rules, lineage, and taxonomy. The Data Integration Sr Lead Analyst is responsible for overseeing daily activities that support the execution, enablement and adoption of the Da...

05218 Citigroup Global Markets Inc.
New York, New York

Develop and implement a robust ownership and governance framework for MRD oversight, specifically within the Party/Entity domain, including the application of Party/Entity data in regulatory capital and liquidity processes including impact on applicable regulatory reports. Own the BAU monitoring of ...

Google
Queens, New York

We're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing ever...

Data Privacy
Queens, New York

You will also serve as a data and analytics solution architect, leading architecture initiatives encompassing data warehousing, data pipeline development, data integrations, and data modeling. Experience developing standards for database design and implementation of various strategic data architectu...

Capital One
New York, New York

We are seeking Data Engineers who are passionate about marrying data with emerging technologies. Utilize programming languages like Java, Scala, Python and Open Source RDBMS and NoSQL databases and Cloud based data warehousing services such as Redshift and Snowflake. Center 1 (19052), United States ...

JCW
New York, New York

The ideal candidate will transform complex data into actionable insights, develop comprehensive reports, and drive data-driven decisions. They are seeking a Data Analytics & Reporting Specialist with expertise in Cognos, Tableau, Informatica, SQL, and Python. Design and implemention of data reports ...