About the Role :
Find out exactly what skills, experience, and qualifications you will need to succeed in this role before applying below.
The Team :
The Data Lake team is responsible for data ingestion from internal source systems in batch / real-time modes, curation and governance of the data assets created in the platform.
The team also works towards adopting the new features of the Databricks product and optimizing the operational aspects of the platform to enhance the user community experience.
The team has a broad and expert knowledge on Ratings organization’s critical data domains, technology stacks and architectural patterns, fosters knowledge sharing and collaboration that results in a unified strategy.
Responsibilities and Impact :
- Design & Build data pipelines with an emphasis on scale, performance and reliability.
- Provide technical expertise in the areas of design and implementation of Data Lake solution powered by Databricks on AWS cloud.
- Ensure data governance principles adopted, data quality checks and data lineage implemented in each hop of the data.
- Partner with the data teams, enterprise architecture organization to ensure best use of standards for the key data domains and use cases.
- Continuous learner with an eye on emerging trends around data lake architecture and enterprise data solutions.
- Ensure compliance through the adoption of enterprise standards and promotion of best practice / guiding principles aligned with organization standards.
What We’re Looking For :
Basic Required Qualifications :
- Bachelors or Masters degree in Computer Science or Information Technology.
- 8+ years of experience building solutions in big data technologies.
- Strong experience programming with more than one of Java , Scala , Python , and large-scale data analytics tools.
- Hands-on experience designing and building streaming data pipelines using data stream processing tools , Confluent, etc.
- Strongly prefer experience building Data Lake & Data warehouse solutions using ETL, ELT pipelines on Databricks, Snowflake, Azure Data Lake, etc.
- Strong understanding of database and analytical technologies in the industry including MPP and NoSQL databases.
- Strong understanding of cloud platforms like AWS, Azure, or GCP, and their services (e.g., EC2, S3, AKS, EKS, etc.). AWS or any public cloud certification is a must.
Additional Preferred Qualifications :
- Experience in continuous delivery through CI / CD pipelines, containers and orchestration technologies.
- Experience with Machine Learning Libraries and Frameworks (TensorFlow, MLlib) is an added advantage.
- Expert knowledge of Agile approaches to software development and able to put key Agile principles into practice to deliver solutions incrementally.
- Monitors industry trends and directions; develops and presents substantive technical recommendations to senior management.
- Excellent analytical thinking, interpersonal, oral, and written communication skills with strong ability to influence both IT and business partners.
- Ability to prioritize and manage work to critical project timelines in a fast-paced environment.
- Financial services industry experience is an added advantage.
- Databricks experience or certifications is an added advantage.
J-18808-Ljbffr