Search jobs > New York, NY > Permanent > Data engineer ii

Data Engineer II, Content Understanding

Spotify
New York, NY
$122.7K-$175.3K a year
Permanent

As a Software Engineer in our Content Understanding teams, you will help define and build ML deployed at scale in support of a broad range of use cases driving value in media and catalog understanding.

We are looking for engineers who are very enthusiastic about data to focus on building structured, high-quality data solutions.

These solutions will be used to evolve our products bringing better experiences to our users and the global artist community alike.

We are processing petabytes of data using tools such as BigQuery, Dataflow and Pub / Sub. When needed, we also develop our own data tooling such as Scio, a Scala API for Apache Beam, and Luigi, a Python framework for scheduling.

What You'll Do

  • Build large-scale batch and real-time data pipelines with data processing frameworks such as Scio, Spark on Google Cloud Platform
  • Leverage best practices in continuous integration and delivery
  • Help drive optimisation, testing and tooling to improve data quality
  • Collaborate with other Software Engineers, ML Engineers, Data Scientists and other stakeholders, taking on learning and leadership opportunities that will arise every single day
  • Create and maintain metrics datasets as well as dashboards that power data driven decisionsWork in an agile team to continuously experiment, iterate and deliver on new product objectives
  • Work on machine learning projects powering the experience that suits each user individually

Who You Are

  • You have professional data engineering experience and you know how to work with high volume, heterogeneous data, preferably with distributed systems such as Hadoop, BigTable, Cassandra, GCP, AWS or Azure
  • You know Scala language well, and are interested in spreading this knowledge in the team
  • You have experience with one or more higher-level JVM-based data processing frameworks such as Beam, Dataflow, Crunch, Scalding, Storm, Spark, Flink etc
  • You might have worked with Docker as well as Luigi, Airflow, or similar tools
  • You are passionate about crafting clean code and have experience in coding and building data pipelines
  • You care about agile software processes, data-driven development, reliability, and responsible experimentation
  • You understand the value of collaboration and partnership within teams

Where You Will Be

For this role you will be based in New York City, USA

The United States base range for this position is $122 716 - $175 308, plus equity. The benefits available for this position include health insurance, six month paid parental leave, 401(k) retirement plan, monthly meal allowance, 23 paid days off, 13 paid flexible holidays.

These ranges may be modified in the future.

22 days ago
Related jobs
Promoted
Disney Entertainment & ESPN Technology
New York, New York

Collaborate with Data Product Managers, Data Architects and Data Engineers to design, implement, and deliver successful data solutions. The Data Engineer II will contribute to the Companys success by partnering with business, analytics and infrastructure teams to design and build data pipelines to f...

Promoted
Walt Disney
Queens, New York

The Data Reliability Engineering team for Disney’s Product and Data Engineering team is responsible for maintaining and improving the reliability of Disney Entertainment’s big data platform, which processes hundreds of terabytes of data and billions of events daily. The Data Engineer II will help us...

Spotify
New York, New York

We are a small, cross-functional team of Machine Learning Engineers and Data Engineers leveraging state of the art machine learning solely focused on building and deploying visual understanding models. As a Machine Learning Engineer in our Content Understanding teams, you will help define and build ...

INSPYR Solutions
New York, New York

Experience with non-relational databases / data stores (object storage, document or key-value stores, graph databases, column-family databases). The Data Engineer will join a critical central infrastructure team to support the build out and integration of new global data pipelines. You will be worki...

Intelliswift Software Inc
New York, New York

Experience with non-relational databases/data stores (object storage, document or key-value stores, graph databases, column-family databases. As a Data Engineer you will be working in one of the complex data warehouse environments. Experience with data modeling, warehousing, and building ETL pipelin...

Metro-Goldwyn-Mayer Studios Inc.
New York, New York

We are looking for a talented Data Engineer to help us build MGM+'s next-generation Data Platform to power all data applications. Experience with non-relational databases / data stores (object storage, document or key-value stores, graph databases, column-family databases). Experience with non-relat...

Akraya
New York, New York

Seeking a Data Engineer to join our high-performance team, focusing on expanding selection and driving costs lower for our customers. This role involves developing and supporting analytic technologies for complex data warehouse environments, and researching new technologies to ensure future scalabil...

Disney Entertainment & ESPN Technology
New York, New York

The Data Engineer II will help us in the ongoing mission of delivering outstanding services to our users allowing Disney Entertainment to be more data-driven. You will assist with on-call rotations as a Live-Ops Incident Commander to help categorize and track real time data incidents, loop in engine...

Akraya Inc
New York, New York

Seeking a Data Engineer to join our high-performance team, focusing on expanding selection and driving costs lower for our customers. This role involves developing and supporting analytic technologies for complex data warehouse environments, and researching new technologies to ensure future scalabil...

S&P Global
New York, New York

The S&P Global Ratings, Data Modernization and Automation team is responsible for providing data automation capabilities and solutions along with creating efficiencies in processes. As a Data Engineer, you will design and implement solutions that showcase business value through prioritization method...