Talent.com
AI / HPC Network Development Engineer - Networking

AI / HPC Network Development Engineer - Networking

xAIMemphis, TN, US
job_description.job_card.30_days_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

Job Description

Job Description

About xAI

xAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. We operate with a flat organizational structure. All employees are expected to be hands-on and to contribute directly to the company's mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important. All engineers are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates.

About the Role

xAI was first in the world to build a 100k GPU cluster on an ethernet network and then did it again in 92 days, floors, walls and all. We need an engineer with deep experience in RoCEv2 that can develop at hyper scale while optimizing performance and availability.

xAI is building at a furious pace with the latest hardware to help people understand the universe. To make the next significant leap forward, we need to own our own destiny by understanding our current network performance and availability and then optimize it to our training models and how we execute customer inference queries. You will spend most of your days deep inside NCCL, building metric dashboards and tweaking configurations to ensure no performance is left on the table. You will help design the next iteration of our backend and front-end networks that will allow us to seamlessly build-out new GPU infrastructure with little to no engineering assistance.

There will be a significant amount of travel to Memphis for building more capacity as well as participating in a team on-call rotation and helping on other scaling and maintenance efforts. This will become easier as we build out the team and engineers contribute to deployment and operations frameworks to remove repetitive tasks.

Location

We have 2 openings, one based in Palo Alto, California and the other in Dublin, Ireland. There will be significant travel expected to Memphis, Tennessee for data center buildouts and to the head office in Palo Alto for team collaboration.

Required Qualifications

  • A minimum of 10 years designing and operating large scale networks with 5 years in the ethernet AI / HPC space.
  • Deep understanding of congestion control on ethernet with Infiniband an added bonus.
  • Deep understanding of AI training and inference workloads and how they operate on the network. As part of this you are able to use and debug NCCL and potentially commit to the library.
  • Expertise in creating a portfolio of metrics for performance and operations to optimize the fleet for training and inference traffic.
  • Experience with Python to automate away repetitive tasks and facilitate your daily job working with and analyzing large sets of data.

Interview Process

After submitting your application, the team reviews your CV and statement of exceptional work. If your application passes this stage, you will be invited to an initial interview (45 minutes - 1 hour) during which a member of our team will ask some basic questions. If you clear the initial phone interview, you will enter the main process, which consists of five interviews :

  • Coding assessment in a language of your choice.
  • Data center network technologies and RoCEv2.
  • Manager Interview.
  • Meet and greet with the wider team where you will run through a presentation of a body of work you are proud of.
  • Benefits

    Base salary is just one part of our total rewards package at xAI, which also includes equity, comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short & long-term disability insurance, life insurance, and various other discounts and perks.

    xAI is an equal opportunity employer.

    California Consumer Privacy Act (CCPA) Notice

    serp_jobs.job_alerts.create_a_job

    Network Engineer • Memphis, TN, US

    Job_description.internal_linking.related_jobs
    • serp_jobs.job_card.promoted
    Network Development Engineer

    Network Development Engineer

    xAIMemphis, TN, US
    serp_jobs.job_card.full_time
    AI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering exc...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Travel Nurse RN - Cardiac Cath Lab

    Travel Nurse RN - Cardiac Cath Lab

    Trusted Resource Associates (TRA)Bartlett, TN, US
    serp_jobs.job_card.full_time
    Trusted Resource Associates (TRA) is seeking a travel nurse RN Cardiac Cath Lab for a travel nursing job in Bartlett, Tennessee. Job Description & Requirements.TRA RN Cardiac Cath Lab : Secure, F...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    • serp_jobs.job_card.new
    Travel Nurse RN - Cardiac Cath Lab - $2,926 per week

    Travel Nurse RN - Cardiac Cath Lab - $2,926 per week

    Trusted Resource Associates (TRA)Bartlett, TN, United States
    serp_jobs.job_card.full_time
    Trusted Resource Associates (TRA) is seeking a travel nurse RN Cardiac Cath Lab for a travel nursing job in Bartlett, Tennessee. Job Description & Requirements.TRA RN Cardiac Cath Lab : Secure, Flexi...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
    • serp_jobs.job_card.promoted
    Registered Nurse

    Registered Nurse

    U.S. NavyProctor, AR, United States
    serp_jobs.job_card.full_time +1
    To be eligible to enlist in the U.Navy, candidates must be between the ages of 18-34.The greatest reward for nearly every nurse is the joy of serving others. But in the Navy Nurse Corps, when you wo...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Junior AI Developer

    Junior AI Developer

    Crew Training InternationalMemphis, TN, United States
    serp_jobs.job_card.full_time
    Requisition # .Job Title .Junior AI Developer .Job Type .Location .Memphis, TN 38119 US...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Registered Nurse (RN) - Home Health

    Registered Nurse (RN) - Home Health

    AccentCareTunica, MS, US
    serp_jobs.job_card.full_time
    AccentCare is seeking a Registered Nurse (RN) Home Health for a nursing job in Tunica, Mississippi.Job Description & Requirements. RN / Registered Nurse, Weekend Baylor.RN Weekend Baylor, Home H...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_day
    • serp_jobs.job_card.promoted
    Travel Nurse RN - Med Surg - $1,761 per week

    Travel Nurse RN - Med Surg - $1,761 per week

    Slate HealthcareMason, TN, United States
    serp_jobs.job_card.full_time
    Slate Healthcare is seeking a travel nurse RN Med Surg for a travel nursing job in Mason, Tennessee.Job Description & Requirements. Profession : RN, JobSpecialty : Registered Nurse, Shift : 3x12 Days, Dur...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Looking for the ultimate side hustle?

    Looking for the ultimate side hustle?

    Survey AuthorityArlington, TN, United States
    serp_jobs.job_card.full_time
    Earn cash by matching with real companies that pay you for your opinions.serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_day
    • serp_jobs.job_card.promoted
    Remote Finance Advisor - AI Trainer

    Remote Finance Advisor - AI Trainer

    Data AnnotationBartlett, Tennessee
    serp_jobs.filters.remote
    serp_jobs.job_card.full_time +1
    We are looking for a finance professional to join our team to train AI models.You will measure the progress of these AI chatbots, evaluate their logic, and solve problems to improve the q...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    staff - Registered Nurse (RN) - Home Health - $70K-75K per year

    staff - Registered Nurse (RN) - Home Health - $70K-75K per year

    AccentCareTunica, MS, United States
    serp_jobs.job_card.full_time
    AccentCare is seeking a Registered Nurse (RN) Home Health for a nursing job in Tunica, Mississippi.Job Description & Requirements. RN / Registered Nurse, Weekend Baylor.RN Weekend Baylor, Home Healt...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Travel Nurse RN - Cardiac Catheterization Lab - $2,418 per week in Bartlett, TN

    Travel Nurse RN - Cardiac Catheterization Lab - $2,418 per week in Bartlett, TN

    TravelNurseSourceBartlett, TN, US
    serp_jobs.job_card.full_time
    TravelNurseSource is working with Medical Solutions to find a qualified Cath Lab RN in Bartlett, Tennessee, 38133!.A facility in Bartlett, TN is seeking its next amazing RN (Registered Nurse) to wo...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_day
    • serp_jobs.job_card.promoted
    Travel Nurse RN - Med Surg

    Travel Nurse RN - Med Surg

    Slate HealthcareMason, TN, US
    serp_jobs.job_card.full_time
    Slate Healthcare is seeking a travel nurse RN Med Surg for a travel nursing job in Mason, Tennessee.Job Description & Requirements. Nurse, Shift : 3x12 Nights, Duration : 13 weeks.Slate Healthcare J...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_day
    • serp_jobs.job_card.promoted
    Staff - Registered Nurse (RN) - Director Med Surg / Telemetry - $28+ per hour

    Staff - Registered Nurse (RN) - Director Med Surg / Telemetry - $28+ per hour

    Tenet MemphisBartlett, TN, United States
    serp_jobs.job_card.full_time
    Tenet Memphis is seeking a Registered Nurse (RN) Director Med Surg / Telemetry for a nursing job in Bartlett, Tennessee.Job Description & Requirements. Saint Francis Hospital Bartlett is a 196-bed h...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Staff - Registered Nurse (RN) - ICU - Intensive Care Unit - $31+ per hour

    Staff - Registered Nurse (RN) - ICU - Intensive Care Unit - $31+ per hour

    Tenet MemphisBartlett, TN, United States
    serp_jobs.job_card.full_time
    Tenet Memphis is seeking a Registered Nurse (RN) ICU - Intensive Care Unit for a nursing job in Bartlett, Tennessee.Job Description & Requirements. Up to 20K Sign-on Bonus Based on Eligibility.Saint...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Staff - Registered Nurse (RN) - Manager, ICU - Intensive Care Unit - $31+ per hour

    Staff - Registered Nurse (RN) - Manager, ICU - Intensive Care Unit - $31+ per hour

    Tenet MemphisBartlett, TN, United States
    serp_jobs.job_card.full_time
    Tenet Memphis is seeking a Registered Nurse (RN) Manager, ICU - Intensive Care Unit for a nursing job in Bartlett, Tennessee. Job Description & Requirements.Saint Francis Hospital Bartlett is a 196-...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Travel Nurse - RN - Cath Lab - $2145.2 / Week

    Travel Nurse - RN - Cath Lab - $2145.2 / Week

    CrossMed HealthcareBartlett, TN, US
    serp_jobs.job_card.full_time
    CrossMed Healthcare is seeking an experienced Cath Lab Registered Nurse for an exciting Travel Nursing job in Bartlett, TN. Shift : 8 hr days Start Date : ASAP Duration : 13 weeks Pay : $2145.At CrossMe...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days
    • serp_jobs.job_card.promoted
    Travel Cardiac Cath Lab RN

    Travel Cardiac Cath Lab RN

    Medical SolutionsBartlett, TN, US
    serp_jobs.job_card.full_time
    Medical Solutions is seeking a travel nurse RN Cardiac Cath Lab for a travel nursing job in Bartlett, Tennessee.Job Description & Requirements. We’re seeking talented healthcare profession...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    RN Manager Med Surg FT Days

    RN Manager Med Surg FT Days

    Saint Francis Hospital - BartlettBartlett, TN, United States
    serp_jobs.job_card.full_time
    Saint Francis Hospital Bartlett is a 196-bed hospital dedicated to providing high quality, compassionate care to the community. As a comprehensive medical center, Saint Francis Hospital Bartlett has...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_days