A company is looking for a Senior Software Engineer, Machine Learning Inference.
Key Responsibilities
Design, develop, and optimize NVIDIA TensorRT and TensorRT-LLM for inference applications
Develop software in C++, Python, and CUDA for deploying LLMs and Generative AI models
Collaborate with deep learning experts and GPU architects to influence hardware and software design
Required Qualifications
BS, MS, PhD or equivalent experience in Computer Science, Computer Engineering, or a related field
8+ years of software development experience on a large codebase or project
Strong proficiency in C++ (required), Rust or Python programming languages
Experience in developing Deep Learning Frameworks, Compilers, or System Software
Knowledge of Machine Learning techniques and GPU programming with CUDA or OpenCL
Senior Engineer Machine Learning • Saint Paul, Minnesota, United States