AI Data Engineer Lead (Principal)

Grindr

San Francisco, CA

Full-time

This is a hybrid role based in our Palo Alto or San Francisco offices and will require you to be in office Tuesdays and Thursdays.

What’s so interesting about this role?

We at Grindr believe that AI can revolutionize the dating industry. Our Data Engineer lead is responsible for building high quality ML datasets at scale, used to train ML models that power AI-centric features.

In this pivotal role, you will have the opportunity to build foundational tools and data pipelines to ingest, normalize and clean the valuable data that would be fundamental for for our ML engineers in Grindr to build AI tools including recommendations, LLMs, ads, visual search, growth / notifications, trust and safety.

What’s the job?

We’re looking for an exceptional data engineer who is passionate about data for AI and values it can bring to Grindr, Who loves working with data ops at scale;

and who is committed to the hard work necessary to continuously improve our ML data pipelines.

In this position, you will be responsible for establishing and executing the strategy for our organization’s ML Data Engine, with an initial focus on agile ML Data OPs.

This includes identification of infrastructure components and data stack to be used, design and implementation of pipelines between data systems and teams, automation workflows, data enrichment and monitoring tools all for AI models.

As a tech lead specialized in data engineering, you are expected to code and contribute to the stack.

Responsibilities :

Dive into our dataset and design, implement and scale data pre / post processing pipelines of ML models
Work on applied ML solutions in the areas of data mining, cleaning, normalizing and modeling
Be self-motivated in seeking solutions when the correct path isn’t always known
Collaborate with engineers in conceptualizing, planning and implementing data engineering initiatives working with different stakeholders
Design and build data platforms & frameworks for processing high volumes of data, in real time as well as batch, that will be used across engineering teams
Build data processing streams for cleaning and modeling text data for LLMs
Research and evaluate new technologies in the big data space to guide our continuous improvement
Collaborate with multi-functional teams to help tune the performance of large data applications
Work with Privacy and Security team on data governance, risk and compliance initiatives
Work on initiatives to ensure stability, performance and reliability of our data infrastructure

What we’ll love about you

Bachelors in Computer Science, Mathematics, Physics, or a related fields
5+ years of experience as a data engineer building production-level pre / post-processing data pipelines for ML / DL models, including 2+ years of technical leadership experience
Experience in statistical analysis & visualization on datasets using Pandas or R
Experience designing and building highly available, distributed systems of data extraction, ingestion, normalization and processing of large data sets in real time as well as batch, that will be used across engineering teams using orchestration frameworks like Airflow, KubeFlow or other pipeline tools
Demonstrated prior experience in creating data pipelines for text data sets NLP / large language models
Ability to produce well-engineered software, including appropriate automated test suites, technical documentation, and operational strategy
Excellent coding skills in Python, Java, bash, SQL, and expertise with Git version control
Experience using big data technologies (Snowflake, Airflow, Kubernetes, Docker, Helm, Spark, pySpark)
Experience with any public cloud environment - AWS, GCP or Azure
Significant experience with relational databases and query authoring (SQL) as well as NoSQL databases like DynamoDB etc
Experience building and maintaining ETL (managing high-quality reliable ETL pipelines)

We’ll really swoon if you have

2+ years of experience of technical leadership in building data engineering pipelines for AI
Previous experience in building data pipeline for conversational AI APIs and recommender systems
Experience with distributed systems and microservices
Experience with Kubernetes and building Docker images
Experience with building stream-processing systems, using solutions such as Kafka, Storm or Spark-Streaming
Strong understanding of applied machine learning topics
Be familiar with legal compliance (with data management tools) data classification, and retention
Consistent track record of managing and implementing complex data projects

What you'll love about us

Mission and Impact : Grindr is the world-leading LGBTQ social networking service. Your role will impact the lives of millions of LGBTQ people around the world
Multiple Locations : We are hiring someone for this role to be based ideally in San Francisco or Palo Alto
Family Insurance : Insurance premium coverage for health, dental, and vision for you and partial coverage for your dependents
Retirement Savings : Generous 401K plan with 6% match and immediate vest in the US
Compensation : Industry-competitive compensation and eligibility for company bonus and equity programs
Queer-Inclusive Benefits : Industry-leading gender-affirming offerings with up to 90% cost coverage, access to Included Health, monthly stipends for HRT, and more
Additional Benefits : Flexible vacation policy, monthly stipends for cell phone, internet, wellness, and food, one-time home-office setup stipend, and company-sponsored events

30+ days ago

Related jobs

Promoted

AI/ML Data Engineering Lead (Specialist Master) - SFL Scientific

Deloitte

San Francisco, California

We are hiring an AI/ML Data Engineering Lead to support the design, development, and deployment of novel AI solutions across healthcare, life sciences, manufacturing, consumer, energy and other sectors. Leverage advanced technical skills in modern data architecture, data science engineering, data tr...

Data Engineer (Senior, Lead, Principal)

Salesforce

San Francisco, California

Previous projects should display technical leadership with an emphasis on data lake, data warehouse solutions, business intelligence, big data analytics, enterprise-scale custom data products. Note: By applying to the Data Engineer posting, recruiters and hiring managers across the organization hiri...

Lead Data Engineer, Data and Systems

Harnham

CA, United States

LEAD DATA ENGINEER, DATA AND SYSTEMS. I have partnered with an innovative healthcare start-up looking for a Lead Data Engineer to join their growing team! They are committed to developing novel inventions that target the human microbiome to improve patients' lives. Manage and optimize data warehouse...

Principal Data Engineer

hims & hers

San Francisco, California

We're looking for an experienced Principal Data Engineer to join our Data Platform Engineering team. Establish guidelines, controls, and processes to make data available for developing scalable data-driven solutions for Analytics and AI. Implement and maintain data governance practices to ensure com...

Lead Engineer, AI and Machine Learning

Sephora USA, Inc

San Francisco, California

As a Lead Machine Learning Engineer, you will operationalize innovative ML solutions and work alongside other team members like product manager, Enterprise architect, Principal ML Engineer, Data scientists and business to architect, design, build ML pipeline and productionalize ML models. This role ...

Lead Software Engineer, AI Platform

Karkidi

San Francisco, California

Humane’s AI Platform team connects information about the world around you to the cloud’s power of inference and computing. Integrate with third party machine learning APIs and train your own models. Drive new features from prototype to completion by collaborating closely with designers and infrastru...

Principal Digital Full Custom ASIC Design Engineer-AI Tensor Development

Sunlune

CA, United States

This exceptional triad not only fuels our current achievements but also positions us to pioneer the development and mass production of AI ASICs, a vital contribution to the burgeoning expanse of the AI domain. AI Tensor Development Engineer, full-time based in the Bay Area. Focus on the research of ...

Lead Real-Time Data Engineer

Unreal Gigs

San Francisco, California

Remote

As the Lead Real-Time Data Engineer, you'll lead our initiatives in designing, building, and optimizing real-time data infrastructure. If you're a seasoned engineer with expertise in real-time data technologies and a proven track record of leadership in delivering successful projects, we invite you ...

Software Engineer, Uber Tech Lead, Human Data

OpenAI

San Francisco, California

We capture data from a variety of sources to train our models, across multiple different modalities (for example ) and to train AI. Have past experience as a technical lead of a larger organization (Uber TL), serving as the technical lead for a large team (20 - 40+ engineers) or multiple engineering...

Lead Data Analytics Engineer

Maximus

San Francisco, California

Analyze requirements and apply architectural and engineering concepts to develop solutions that meet business needs while maintaining sustainability objectives, including: scalability, maintainability, security, reliability, extensibility, flexibility, availability, and manageability. Analyze requir...

AI Data Engineer Lead (Principal)

AI/ML Data Engineering Lead (Specialist Master) - SFL Scientific

Data Engineer (Senior, Lead, Principal)

Lead Data Engineer, Data and Systems

Principal Data Engineer

Lead Engineer, AI and Machine Learning

Lead Software Engineer, AI Platform

Principal Digital Full Custom ASIC Design Engineer-AI Tensor Development

Lead Real-Time Data Engineer

Software Engineer, Uber Tech Lead, Human Data

Lead Data Analytics Engineer

Related searches