Search jobs > Santa Clara, CA > Data center engineer

Data Center GPU Systems Application Engineer

AMD
Santa Clara, CA, US
Full-time

WHAT YOU DO AT AMD CHANGES EVERYTHING

We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences the building blocks for the data center, artificial intelligence, PCs, gaming and embedded.

Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world's most important challenges.

We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives.

AMD together we advance

Data Center Systems Application Engineer

THE TEAM :

AMD's Data Center GPU organization is transforming the industry with our AI based Graphic Processors. Our primary objective is to design exceptional products that drive the evolution of computing experiences, serving as the cornerstone for enterprise Data Centers, (AI) Artificial Intelligence, HPC and Embedded systems.

If this resonates with you, come and joining our Data Center GPU organization where we are building amazing AI powered products with amazing people.

THE ROLE :

The Datacenter GPU System Engineering team is seeking a strong Data Center Systems Application Engineer. In this customer-focused role, you will be working with the world's AI & HPC GPUs being designed into exascale class supercomputers.

You will be working directly with key external OEM partners, field application teams, internal development teams, and other stakeholders to bring a portfolio of next-generation Server Platforms to market using AMD's InstinctTM Accelerators and support them post-production.

KEY RESPONSIBILITIES :

  • Manage technical interaction with OEM / ODM Partners to enable deployment of AMD InstinctTM Accelerators in Partner systems.
  • Support Partners in the bring-up and validation of AMD InstinctTM GPUs in their system, guide partners on use of AMD tools, qualification test methods, and analysis of test results.
  • Lead the debug of Partner / Customer issues (HW, firmware, driver), working with a cross-functional team and driving the root cause investigation.
  • Work with Partners on the development of manufacturing / screen tests to ensure reliability at scale.
  • Understand Partner requirements and schedule, identify gaps in AMD offering and work with other stakeholders to close them.
  • Author design guideline, technical presentations, and training material.
  • Provide recommendation to improve customer experience with our SW and HW.

PREFERRED EXPERIENCE :

  • 10+ years' experience in Data Center system design, board design, validation, or Application Engineering, preferably in external customer facing roles.
  • Strong knowledge in PC / server architecture and interfaces, experience with system level debug.
  • Strong System Level debugging skills with hands-on experiences in system bring-up, HW debug, and performance optimizations on various system architectures.
  • Understanding and experience working with Enterprise Linux environment (Ubuntu, CentOS / RHEL and SLES).
  • Excellent oral and written communication skills to communicate technical results clearly and accurately.
  • Test or design experience in system interconnect technologies (PCIe, XGMI, CXL, USB, I / O Controllers)
  • Familiarity with various deployment models including cloud, virtualization and containers.
  • Automation, orchestration, delivery via Kubernetes, Docker, or Mesos.
  • Experience or knowledge of server firmware / BIOS settings, boot process, server monitoring and management SW.
  • Experience relating to power and thermal management.
  • Solid knowledge of Shell / BASH, C / C++, Python, or other framework.
  • Experience with OpenCL, CUDA, or ROCm is a plus.

ACADEMIC CREDENTIALS :

BS in Electrical Engineering, Computer Engineering, or Computer Science. MS or PhD a plus.

LI-CC2 #LI-Hybrid

At AMD, your base pay is one part of your total rewards package. Your base pay will depend on where your skills, qualifications, experience, and location fit into the hiring range for the position.

You may be eligible for incentives based upon your role such as either an annual bonus or sales incentive. Many AMD employees have the opportunity to own shares of AMD stock, as well as a discount when purchasing AMD stock if voluntarily participating in AMD's Employee Stock Purchase Plan.

You'll also be eligible for competitive benefits described in more detail here.

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and / or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law.

We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.

30+ days ago
Related jobs
Advanced Micro Devices, Inc
Santa Clara, California

The Datacenter GPU System Engineering team is seeking a Lead Partner Engineer. Data Center system design, board design, validation, or Application Engineering, preferably in external customer facing roles. AMD's Data Center GPU organization is transforming the industry with our AI based Graphic Proc...

Promoted
Apple
Cupertino, California

Join this team, and you’ll collaborate with engineers across Apple to build and deploy forward-looking prototype systems that contribute to the development of our world-renowned hardware and software architecture. Apple’s Platform Architecture group is looking for talented engineers to build high-pe...

Promoted
Turner Laser Systems
Fremont, California

Turner Laser Systems is looking for a Laser Applications Engineer to join the team as a full-time position at our headquarters in Fremont, CA. We utilize cutting-edge lasers, optical systems, vision, sensors, motion and software to bring laser applications from the lab to state-of-the-art automation...

Promoted
Energi People
Santa Clara, California

Senior Telecom Engineer Data Center. As AI is developing many Data Center companies are growing, but what makes this one different to the others? Consistency. Not just within the Data Center infrastructure but in the workplace culture and people there too. Also just a little FYI we have multiple rol...

Equinix
Palo Alto, California

Senior Mechanical Engineer, Data Center Cooling Systems. Proven years of professional experience preferred in mechanical engineering program ownership including aspects of design, operating practices, repair and maintenance, and training focusing on data center or mission-critical systems. Strong da...

TCWGlobal
Santa Clara, California

Work closely and pro-actively with other engineering teams such as system architects, chip and board designers, software/firmware engineers, HW/SW QA teams and Applications engineering teams to drive design, development, debug and release of next generations products. Handling Labs and Data Centers ...

Equinix
San Jose, California

Senior Mechanical Design Engineer (Data Center HVAC). Performs the planning, system design, system implementation and support of our data center HVAC systems. Knowledge of data center HVAC mechanical engineering design. Equinix is the world's digital infrastructure company®, operating over 250 data ...

Databricks
Mountain View, California

As a software engineer on the Runtime team at Databricks, you will be building the next generation distributed data storage and processing systems that can outperform specialized SQL query engines in relational query performance, yet provide the expressiveness and programming abstractions to support...

Simple Solutions
San Jose, California

Provide updates of the physical layout of equipment and cabling infrastructure utilizing Data Center Information management tools (DCIM) Work with data center provider vendors along with electrical & mechanical vendors as well as network/ hardware providers by performing capacity planning and implem...

AMD
Santa Clara, California

AMD's Data Center GPU organization is transforming the industry with our AI based Graphic Processors. Our primary objective is to design exceptional products that drive the evolution of computing experiences, serving as the cornerstone for enterprise Data Centers, (AI) Artificial Intelligence, HPC a...