Search jobs > Seattle, WA > Principal data scientist

Principal Data Scientist - Emerging ML

Capital One
Seattle, Washington, US
Full-time

Principal Data Scientist - Emerging ML

Check all associated application documentation thoroughly before clicking on the apply button at the bottom of this description.

Data is at the center of everything we do. As a startup, we disrupted the credit card industry by individually personalizing every credit card offer using statistical modeling and the relational database, cutting edge technology in 1988! Fast-forward a few years, and this little innovation and our passion for data has skyrocketed us to a Fortune 200 company and a leader in the world of data-driven decision-making.

As a Data Scientist at Capital One, you'll be part of a team that's leading the next wave of disruption at a whole new scale, using the latest in computing and machine learning technologies and operating across billions of customer records to unlock the big opportunities that help everyday people save money, time and agony in their financial lives.

Team Description

Emerging ML is the data science and machine learning team inside Capital One's Applied Research organization. We focus on research and development of new technologies within the domain of Artificial Intelligence with a focus on Embeddings and Foundation Models.

We partner closely with our product and engineering teams to connect emerging technologies with business critical use cases across Capital One's lines of business.

As part of Emerging ML, you will work on things like :

  • Conducting research into self supervised learning, transformer models, and representation learning
  • Building customer behavioral models (using transaction, clickstream, and other data) that identify trends, patterns, and relationships related to product usage
  • Refining integration patterns for encoder and decoder models for downstream use cases to connect Applied Research products and business use cases

Role Description

This is an individual contributor position. In Emerging ML, you will work at all phases of the data science lifecycle, including :

  • Build machine learning models through all phases of development, from design through training, evaluation and validation, and partner with engineering teams to operationalize them in scalable and resilient production systems that serve 50+ million customers.
  • Partner closely with a variety of business and product teams across Capital One to conduct the experiments that guide improvements to customer experiences and business outcomes in domains like marketing, servicing and fraud prevention.
  • Write software (Python, Scala, e.g.) to collect, explore, visualize and analyze numerical and textual data (billions of customer transactions, clicks, payments, etc.

using tools like Spark and AWS.

The Ideal candidate will be :

Curious and creative. You thrive on bringing definition to big, undefined problems. You love asking questions, and you love pushing hard to find the answers.

You're not afraid to share a new idea. You communicate clearly and effectively to share your findings with non-technical audiences.

Technical : You have hands-on experience developing data science solutions from concept to production using open source tools and modern cloud computing platforms.

You are not afraid of petabytes of data.

Statistically-minded. You have built models, validated them and backtested them. You know how to interpret a confusion matrix or a ROC curve.

You have experience with clustering, classification, sentiment analysis, time series analysis and deep learning.

Customer and product oriented. You share our passion for changing banking for good.

Basic Qualifications

  • Currently has, or is in the process of obtaining a Bachelor's Degree plus 5 years of experience in data analytics, or currently has, or is in the process of obtaining a Master's Degree plus 3 years in data analytics, or currently has, or is in the process of obtaining PhD, with an expectation that required degree will be obtained on or before the scheduled start date
  • At least 1 year of experience in open source programming languages for large scale data analysis
  • At least 1 year of experience with machine learning
  • At least 1 year of experience with relational databases

Preferred Qualifications :

  • Masters in "STEM" field (Science, Technology, Engineering, or Mathematics) plus 3 years of experience in data analytics
  • Experience building transformer models at scale (>

100M parameters)

  • Understanding of self-supervised learning methods
  • Strong foundation in software engineering
  • At least 1 year of experience working with AWS
  • At least 2 years' experience in Python, Scala, or R for large scale data analysis
  • At least 2 years' experience with machine learning
  • At least 2 years' experience with SQL

J-18808-Ljbffr

7 days ago
Related jobs
Promoted
Apple
Seattle, Washington

Do you get excited by driving product impact via measurement and evaluation, for products and services used by hundreds of millions of people globally? The vision for the AIML Data organization is to improve products by using data as the voice of our customers. Proficiency in data science, machine l...

Promoted
Shelf Engine
Seattle, Washington

Senior Data Scientist (ML/Ops). Shelf Engine is searching for a talented Senior Data Scientist to report directly to our Head of Data Science. In this role, you will be responsible for designing, developing, and maintaining the data pipelines and engineering infrastructure that support data science ...

Promoted
Apple, Inc.
Seattle, Washington

Experience collecting and analyzing crowd-sourced data, language data, image data, and/or multi-modal data. Research and develop Responsible AI evaluation methods to improve the quality of Apple Intelligence's user facing products - Create evaluation data sets to solve difficult, non-routine analysi...

Promoted
Amazon
Bellevue, Washington

We are looking for an experienced Data Scientist with a strong machine learning background to lead the development of frameworks that deepen understanding of engagement at Twitch. Lead the model development process to implement production algorithms, including exploratory data analysis, data modelin...

Promoted
Microsoft
Redmond, Washington

Define and communicate a vision for an extensive data scientist effort directly managing data scientists and directly coordinating with many cross teams. We are seeking a Principal Data & Applied Scientist Manager with a passion for business impact and innovation. Develop a long-term roadmap to ...

Promoted
Amazon
Seattle, Washington

Principal Data Scientist, Prime Video - Discovery Science. The Principal Data Scientist (DS) on this team is a technical expert and strategic thought leader responsible for tackling highly complex and ambiguous problems. As a strategic Principal Scientist, you will interact frequently with senior le...

Promoted
Holy Technologies
Seattle, Washington

Utilize diverse data sources to create models predicting revenue expansion and churn, while identifying key events in the customer lifecycle. Innovate in data science and machine learning, providing thought leadership around generative AI. Collaborate with engineering to build reliable, scalable, an...

Pacific Northwest National Laboratory
Seattle, Washington

Applies knowledge of statistics, machine learning, advanced mathematics, software development, and data modeling to integrate and clean data, recognize patterns, address uncertainty, pose questions, and make discoveries from structured and/or unstructured data. Produce solutions driven by artificial...

Apple
Seattle, Washington

Research and develop Responsible AI evaluation methods to improve the quality of Apple Intelligence's user facing products - Create evaluation data sets to solve difficult, non-routine analysis problems; applying sophisticated analytical methods as needed - Conduct analyses that includes data gather...

Starbucks
Seattle, Washington
Remote

Now Brewing – principal data scientist [Seattle, WA – U. As a principal data scientist, you will…. Mastery and comprehensive proficiency across most Data ETL (Teradata, Oracle, SQL, Python, Java, Ruby, Pig). This role will serve as a technical and strategic advisor to a broader data science team res...