A company is looking for a Senior AI and ML Storage Engineer.
Key Responsibilities
Design, develop, and operate distributed systems for managing data, compute, and networking for large-scale AI workloads
Build software and automation to orchestrate workloads across thousands of GPUs and petabytes of storage in multi-region clusters
Collaborate with AI / ML research teams to translate their requirements into scalable, high-performance solutions
Required Qualifications
BS or equivalent experience in Computer Science, Computer Engineering, or a related technical field
8+ years of experience in developing and operating large-scale distributed systems or HPC environments
Strong programming skills in C++, Python, or Go with experience in production-quality software systems
Solid understanding of distributed systems principles and large-scale orchestration frameworks
Hands-on experience with high-performance storage and compute scheduling tools
Storage Engineer • Mobile, Alabama, United States