Talent.com
NCCL / RCCL SW Engineer

NCCL / RCCL SW Engineer

Hewlett Packard EnterpriseMinneapolis, MN, United States
job_description.job_card.variable_hours_ago
serp_jobs.job_preview.job_type
  • serp_jobs.job_card.full_time
job_description.job_card.job_description

NCCL / RCCL SW Engineer

This role has been designed as 'Hybrid' with an expectation that you will work on average 2 days per week from an HPE office.

Who We Are :

Hewlett Packard Enterprise is the global edge-to-cloud company advancing the way people live and work. We help companies connect, protect, analyze, and act on their data and applications wherever they live, from edge to cloud, so they can turn insights into outcomes at the speed required to thrive in today's complex world. Our culture thrives on finding new and better ways to accelerate what's next. We know varied backgrounds are valued and succeed here. We have the flexibility to manage our work and personal needs. We make bold moves, together, and are a force for good. If you are looking to stretch and grow your career our culture will embrace you. Open up opportunities with HPE.

Job Description :

An NCCL / RCCL engineer plays a crucial role in ensuring the quality and performance of NVIDIA Collective Communication Library (NCCL) and Radeon Collective Communication Library (RCCL) based applications and middleware, particularly in High-Performance Computing (HPC) environments. They are responsible for testing, debugging, and validating parallel programming frameworks and their implementations to meet established standards and specifications. This involves working with both the hardware and software aspects of HPC Systems (GPU-accelerated) systems, ensuring optimal functionality and efficiency for communication middleware.

Key responsibilities

  • Test plan development and execution : Designing and executing comprehensive test plans to validate MPI and SHMEM features, functionality, and performance.
  • Debugging and Root Cause Analysis : Identifying, analyzing, and resolving issues found during validation and testing, collaborating with development teams to implement corrective actions.
  • Performance Evaluation and Optimization : Evaluating and optimizing the performance of MPI and SHMEM based applications and middleware, including communication collective algorithms like AllReduce.
  • Automation and Infrastructure Development : Developing and maintaining post-silicon validation infrastructure including software, hardware, and automation environments.
  • Collaboration : Working closely with hardware teams, software developers, architects, and various stakeholders to ensure seamless integration and validation of systems.
  • Documentation : Generating and maintaining detailed documentation of validation activities, test results, and compliance reports
  • Troubleshooting : Providing technical expertise and support for troubleshooting and resolving technical issues related to MPI and SHMEM.
  • Staying updated with technology : Maintaining knowledge of validation trends, industry standards, and new technologies in high-performance computing, parallel programming, and communication middleware.

This position will support government accounts. Therefore, due to federal export-control regulations, the selected candidate must hold U.S. citizenship

Required skills

  • Parallel Programming and Communication : Strong understanding of parallel programming models, development, validation and performance analysis of GPU communication libraries (NCCL / RCCL) in distributed AI / HPC systems.
  • HPC Architectures : Knowledge of high-performance memory subsystems, SoC / ASIC memory architecture, high-speed I / O interfaces, and their interaction with parallel programming models.
  • Programming and Scripting : Proficiency in programming languages like C / C++, Python, and potentially others like Perl, for developing validation tests, scripts, and tools.
  • Validation Methodologies : Experience with various validation methodologies, including formal analysis and runtime instrumentation, for detecting MPI bugs and ensuring correctness.
  • Debugging Tools and Techniques : Expertise in utilizing debugging tools, methodologies, and techniques for identifying and resolving hardware and software issues at various levels.
  • Test Automation : Experience with test automation frameworks and methodologies for developing and maintaining automated regression tests and scripts.
  • Analytical and Problem-Solving Skills : Excellent analytical and problem-solving abilities to dissect complex systems, identify issues, and propose innovative solutions.
  • Communication and Collaboration : Strong communication and interpersonal skills for effective collaboration with cross-functional teams and stakeholders.
  • Attention to Detail : Meticulous attention to detail to catch discrepancies and ensure thorough validation of systems and processes.
  • Education and experience

  • Bachelor's or Master's degree in Computer Science, Computer Engineering, Electrical Engineering, or a related field is required.
  • 2+ years of experience in SoC / ASIC validation and debug, particularly within the context of high-performance computing and parallel programming, is highly beneficial.
  • Additional Skills :

    Cloud Architectures, Cross Domain Knowledge, Design Thinking, Development Fundamentals, DevOps, Distributed Computing, Microservices Fluency, Full Stack Development, Security-First Mindset, Solutions Design, Testing & Automation, User Experience (UX)

    What We Can Offer You :

    Health & Wellbeing

    We strive to provide our team members and their loved ones with a comprehensive suite of benefits that supports their physical, financial and emotional wellbeing.

    Personal & Professional Development

    We also invest in your career because the better you are, the better we all are. We have specific programs catered to helping you reach any career goals you have - whether you want to become a knowledge expert in your field or apply your skills to another division.

    Unconditional Inclusion

    We are unconditionally inclusive in the way we work and celebrate individual uniqueness. We know varied backgrounds are valued and succeed here. We have the flexibility to manage our work and personal needs. We make bold moves, together, and are a force for good.

    Let's Stay Connected :

    Follow @HPECareers on Instagram to see the latest on people, culture and tech at HPE.

    #unitedstates

    #storage

    Job : Engineering

    Job Level : TCP_03

    States with Pay Range Requirement

    The expected salary / wage range for a U.S. -based hire filling this position is provided below. Actual offer may vary from this range based upon geographic location, work experience, education / training, and / or skill level. If this is a sales role, then the listed salary range reflects combined base salary and target-level sales compensation pay. If this is a non-sales role, then the listed salary range reflects base salary only. Variable incentives may also be offered. Information about employee benefits offered can be found at .

    USD Annual Salary : $108,500.00 - $206,500.00

    The estimated job application period closure is November 29 2025; this timeline is provided for transparency and internal planning purposes.

    HPE is an Equal Employment Opportunity / Veterans / Disabled / LGBT employer. We do not discriminate on the basis of race, gender, or any other protected category, and all decisions we make are made on the basis of qualifications, merit, and business need. Our goal is to be one global team that is representative of our customers, in an inclusive environment where we can continue to innovate and grow together. Please click here : Equal Employment Opportunity .

    Hewlett Packard Enterprise is EEO Protected Veteran / Individual with Disabilities.

    HPE will comply with all applicable laws related to employer use of arrest and conviction records, including laws requiring employers to consider for employment qualified applicants with criminal histories.

    serp_jobs.job_alerts.create_a_job

    Sw Engineer • Minneapolis, MN, United States

    Job_description.internal_linking.related_jobs
    • serp_jobs.job_card.promoted
    JR106173 Nuclear Control Room Supervisor In-Training - Monticello, MN

    JR106173 Nuclear Control Room Supervisor In-Training - Monticello, MN

    Xcel EnergyMonticello, Minnesota, United States
    serp_jobs.job_card.full_time
    Monticello, Minnesota, 55362, United States of America.Are you looking for an exciting job where you can put your skills and talents to work at a company you can feel proud to be a part of? Do you ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Sr Engineer

    Sr Engineer

    TargetBrooklyn Park, MN, US
    serp_jobs.job_card.full_time
    Pay is based on several factors which vary based on position.These include labor markets and in some instances may include education, work experience and certifications. In addition to your pay, Tar...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Earthwork Foreman

    Earthwork Foreman

    Rachel Contracting IncSt Michael, MN, United States
    serp_jobs.job_card.full_time
    Position Title : Earthwork Foreman Location : St Michael, MN Salary Interval : Union Scale Pay_Range : N / A Application Instructions : All applicants must apply online. Please use the Apply Now butto...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    Sr Software Engineer I

    Sr Software Engineer I

    EBSCO Information ServicesSt Paul, MN, United States
    serp_jobs.job_card.full_time
    EBSCO Information Services (EBSCO) delivers a fully optimized research experience, seamlessly integrated with a powerful discovery platform to support the information needs and maximize the researc...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    • serp_jobs.job_card.new
    Sr. Snowflake Infrastructure Engineer

    Sr. Snowflake Infrastructure Engineer

    OsaicOakdale, MN, United States
    serp_jobs.job_card.full_time
    Current Employees and Contractors Apply HereOsaic Careers.Engineering Opportunity in Financial Services.Senior Snowflake Infrastructure Engineer. Windy Ridge Parkway, Atlanta, GA 30339.Executive Cen...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
    • serp_jobs.job_card.promoted
    • serp_jobs.job_card.new
    Semiconduct Engr II

    Semiconduct Engr II

    HoneywellMinneapolis, MN, United States
    serp_jobs.job_card.permanent
    The future is what you make it.When you join Honeywell, you become a member of our global team of thinkers, innovators, dreamers, and doers who make the things that make the future.That means chang...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
    • serp_jobs.job_card.promoted
    • serp_jobs.job_card.new
    Production Software Engineer II

    Production Software Engineer II

    OracleSt Paul, MN, United States
    serp_jobs.job_card.full_time
    Come and join us! Building off our Cloud momentum, Oracle has formed a new organization - Oracle Health & Analytics.This team will focus on product development and product strategy for Oracle Healt...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
    • serp_jobs.job_card.promoted
    • serp_jobs.job_card.new
    Senior Software Engineer - IaaS

    Senior Software Engineer - IaaS

    CVS HealthSt Paul, MN, United States
    serp_jobs.job_card.full_time
    At CVS Health, we’re building a world of health around every consumer and surrounding ourselves with dedicated colleagues who are passionate about transforming health care.As the nation’s leading h...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
    • serp_jobs.job_card.promoted
    • serp_jobs.job_card.new
    NCCL / RCCL SW Engineer

    NCCL / RCCL SW Engineer

    HPEBloomington, MN, United States
    serp_jobs.job_card.full_time
    This role has been designed as ‘Hybrid’ with an expectation that you will work on average 2 days per week from an HPE office. Hewlett Packard Enterprise is the global edge-to-cloud company advancing...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
    • serp_jobs.job_card.promoted
    • serp_jobs.job_card.new
    Sr Software Development Eng

    Sr Software Development Eng

    NextEra Energy, Inc.St Paul, MN, United States
    serp_jobs.job_card.full_time
    At NextEra Energy Analytics, we offer renewable energy consulting services using industry-leading scientific analysis to plan, site and optimize renewable energy projects.We specialize in transform...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
    • serp_jobs.job_card.promoted
    • serp_jobs.job_card.new
    PAM Engineer, Integrations & Projects

    PAM Engineer, Integrations & Projects

    Perennial Resources InternationalSt Paul, MN, United States
    serp_jobs.job_card.full_time
    We are seeking a PAM Engineer to join our high-functioning Privileged Access Management team.The ideal candidate will handle mid-level engineering tasks, focusing on CyberArk integrations, project ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
    • serp_jobs.job_card.promoted
    • serp_jobs.job_card.new
    Sr. Application Engineer - Remote

    Sr. Application Engineer - Remote

    CBRESt Paul, MN, United States
    serp_jobs.filters.remote
    serp_jobs.job_card.full_time
    Administrative, Data & Analytics, Engineering / Maintenance, Project Management.Dallas - Texas - United States of America, Fort Worth - Texas - United States of America, Las Vegas - Nevada - United S...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
    • serp_jobs.job_card.promoted
    • serp_jobs.job_card.new
    Sr. Application Engineer - Remote East Region

    Sr. Application Engineer - Remote East Region

    CBRESt Paul, MN, United States
    serp_jobs.filters.remote
    serp_jobs.job_card.full_time
    Application Engineer - Remote East Region.Administrative, Data & Analytics, Engineering / Maintenance, Facilities Management, Project Management. Atlanta - Georgia - United States of America, Beverly ...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
    • serp_jobs.job_card.promoted
    • serp_jobs.job_card.new
    Mechanical Engineer

    Mechanical Engineer

    Manpower EngineeringCoon Rapids, MN, US
    serp_jobs.job_card.permanent
    A client of ours has an immediate opening for a Mechanical Engineer.Candidates should have an electromechanical background and be proficient in SolidWorks. This role will support products for aerosp...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
    • serp_jobs.job_card.promoted
    Sr Application Engineer - Remote- Central Region

    Sr Application Engineer - Remote- Central Region

    CBREMinneapolis, MN, United States
    serp_jobs.filters.remote
    serp_jobs.job_card.full_time
    Sr Application Engineer - Remote- Central Region.Dallas - Texas - United States of America, Houston - Texas - United States of America, Milwaukee - Wisconsin - United States of America, Minneapolis...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_30
    • serp_jobs.job_card.promoted
    • serp_jobs.job_card.new
    Software Engineer II

    Software Engineer II

    C.H. RobinsonSt Paul, MN, United States
    serp_jobs.job_card.full_time
    Robinson is seeking a Software Engineer II to build modern, responsive, and highly scalable systems that power Navisphere, the world's most advanced supply chain platform.In this role, you will dir...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
    • serp_jobs.job_card.promoted
    • serp_jobs.job_card.new
    Software Engineer - Air Traffic Control Systems (Multiple Levels)

    Software Engineer - Air Traffic Control Systems (Multiple Levels)

    NoblisSt Paul, MN, United States
    serp_jobs.job_card.full_time +2
    Be a Part of Aviation History : Join Noblis on the FAA's Most Significant Modernization Initiative.For nearly 30 years, Noblis has been at the forefront of aviation innovation, partnering with the F...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
    • serp_jobs.job_card.promoted
    • serp_jobs.job_card.new
    Software Engineer III

    Software Engineer III

    TeledyneSt Paul, MN, United States
    serp_jobs.job_card.full_time
    Teledyne Technologies Incorporated provides enabling technologies for industrial growth markets that require advanced technology and high reliability. These markets include aerospace and defense, fa...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours
    • serp_jobs.job_card.promoted
    Quality Engineer

    Quality Engineer

    OptumELK RIVER, Minnesota, United States
    serp_jobs.job_card.full_time
    Optum is a global organization that delivers care, aided by technology to help millions of people live healthier lives.The work you do with our team will directly improve health outcomes by connect...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_1_day
    • serp_jobs.job_card.promoted
    • serp_jobs.job_card.new
    Software Development Eng II

    Software Development Eng II

    NextEra Energy, Inc.St Paul, MN, United States
    serp_jobs.job_card.full_time
    At NextEra Energy Analytics, we offer renewable energy consulting services using industry-leading scientific analysis to plan, site and optimize renewable energy projects.We specialize in transform...serp_jobs.internal_linking.show_moreserp_jobs.last_updated.last_updated_variable_hours