Search jobs > New York, NY > Machine learning engineer

Machine Learning Performance Engineer

Jane Street
New York, New York, US
Full-time

We are looking for an engineer with experience in low-level systems programming and optimization to join our growing ML team.

Machine learning is a critical pillar of Jane Street's global business. Our ever-evolving trading environment serves as a unique, rapid-feedback platform for ML experimentation, allowing us to incorporate new ideas with relatively little friction.

Your part here is optimizing the performance of our models both training and inference. We care about efficient large-scale training, low-latency inference in real-time systems, and high-throughput inference in research.

Part of this is improving straightforward CUDA, but the interesting part needs a whole-systems approach, including storage systems, networking, and host- and GPU-level considerations.

Zooming in, we also want to ensure our platform makes sense even at the lowest level is all that throughput actually goodput?

Does loading that vector from the L2 cache really take that long?

If you’ve never thought about a career in finance, you’re in good company. Many of us were in the same position before working here.

If you have a curious mind and a passion for solving interesting problems, we have a feeling you’ll fit right in.

There’s no fixed set of skills, but here are some of the things we’re looking for :

  • An understanding of modern ML techniques and toolsets
  • The experience and systems knowledge required to debug a training run’s performance end to end
  • Low-level GPU knowledge of PTX, SASS, warps, cooperative groups, Tensor Cores, and the memory hierarchy
  • Debugging and optimization experience using tools like CUDA GDB, NSight Systems, NSight Compute
  • Library knowledge of Triton, CUTLASS, CUB, Thrust, cuDNN, and cuBLAS
  • Intuition about the latency and throughput characteristics of CUDA graph launch, tensor core arithmetic, warp-level synchronization, and asynchronous memory loads
  • Background in Infiniband, RoCE, GPUDirect, PXN, rail optimization, and NVLink, and how to use these networking technologies to link up GPU clusters
  • An understanding of the collective algorithms supporting distributed GPU training in NCCL or MPI
  • An inventive approach and the willingness to ask hard questions about whether we're taking the right approaches and using the right tools

If you're a recruiting agency and want to partner with us, please reach out to .

30+ days ago
Related jobs
Promoted
ST LUKE LEARNING CENTER
Queens, New York

The Personalization team at Peloton is looking for a machine learning engineer to drive personalization and recommendations for our highly engaged members across multiple platforms. They will work closely with ML Engineers, Software Engineers, Product Managers and Product Analysts to test ideas that...

Promoted
Borderless Capital
Queens, New York

You have experience working in an end-to-end machine learning team environment: analyzing data, building and iterating on machine learning models, writing production-level code and shipping to production, monitoring performance, and A/B testing. Collaborate with cross-functional partners, such as pr...

Promoted
Vimeo Inc
Queens, New York
Remote

We are seeking a highly motivated and innovative Machine Learning Engineer to join our team. Machine Learning Engineer, Data Scientist, or in a similar role, with a focus on personalization, sales, and marketing. Vimeo is at the forefront of innovation, driving personalized experiences and cutting-e...

Promoted
Jupiter Power
Queens, New York
Remote

As an open-source Machine Learning Engineer, you will work to improve the open-source machine learning ecosystem. Here at Hugging Face, we’re on a journey to advance good Machine Learning and make it more accessible. You will mainly work with existing open-source libraries, such as Transformers, Dat...

Promoted
IDEOgram
Queens, New York

Ideogram is a new generative AI startup looking for a dedicated Machine Learning Data Engineer to join the team. As a ML Data Engineer, you will build state-of-the-art machine learning data infrastructure. Applied machine learning experience, prior experience in training ML models on large scale dat...

Promoted
runwayml.com
Queens, New York

We’re looking for Machine Learning Engineers who can help us build new multimodal AI systems for media generation. The ideal candidate for this role has a combination of strong engineering skills, extensive experience working on machine learning projects, and thrives in fast-paced, high-ownership en...

Promoted
Samsung Electronics Perú
Queens, New York

As a machine learning model engineer of the Samsung Ads Platform Intelligence (PI) team, you will have access to unique Samsung proprietary data to develop and deploy a wide spectrum of large-scale machine learning products with real-world impact. Closely work with machine learning platform and serv...

DoorDash
New York, New York

As a Staff Machine Learning Engineer, you’ll be conceptualizing, designing, implementing, and validating algorithmic improvements to the catalog system and our product knowledge graph at the heart of our fast-growing grocery and retail delivery business. We’re looking for a passionate Applied Machin...

Skale
New York, New York

Looking for a Senior ML Engineer who can train, evaluate and deploy deep learning models. NLP and/ or computer vision and machine learning. PhD in Machine Learning or Computer Vision. ...

King
New York, New York

Aggregate huge amounts of data and information from large numbers of sources to discover patterns and features necessary to build machine learning models for prediction and forecasting. Design and implement end- to- end solutions using Machine Learning, Optimization, and other advanced computer scie...