Search jobs > Santa Clara, CA > Data center engineer

Data Center GPU Systems Application Engineer

AMD
Santa Clara, CA, US
Full-time

WHAT YOU DO AT AMD CHANGES EVERYTHING

We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences the building blocks for the data center, artificial intelligence, PCs, gaming and embedded.

Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world's most important challenges.

We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives.

AMD together we advance

Data Center Systems Application Engineer

THE TEAM :

AMD's Data Center GPU organization is transforming the industry with our AI based Graphic Processors. Our primary objective is to design exceptional products that drive the evolution of computing experiences, serving as the cornerstone for enterprise Data Centers, (AI) Artificial Intelligence, HPC and Embedded systems.

If this resonates with you, come and joining our Data Center GPU organization where we are building amazing AI powered products with amazing people.

THE ROLE :

The Datacenter GPU System Engineering team is seeking a strong Data Center Systems Application Engineer. In this customer-focused role, you will be working with the world's AI & HPC GPUs being designed into exascale class supercomputers.

You will be working directly with key external OEM partners, field application teams, internal development teams, and other stakeholders to bring a portfolio of next-generation Server Platforms to market using AMD's InstinctTM Accelerators and support them post-production.

KEY RESPONSIBILITIES :

  • Manage technical interaction with OEM / ODM Partners to enable deployment of AMD InstinctTM Accelerators in Partner systems.
  • Support Partners in the bring-up and validation of AMD InstinctTM GPUs in their system, guide partners on use of AMD tools, qualification test methods, and analysis of test results.
  • Lead the debug of Partner / Customer issues (HW, firmware, driver), working with a cross-functional team and driving the root cause investigation.
  • Work with Partners on the development of manufacturing / screen tests to ensure reliability at scale.
  • Understand Partner requirements and schedule, identify gaps in AMD offering and work with other stakeholders to close them.
  • Author design guideline, technical presentations, and training material.
  • Provide recommendation to improve customer experience with our SW and HW.

PREFERRED EXPERIENCE :

  • 10+ years' experience in Data Center system design, board design, validation, or Application Engineering, preferably in external customer facing roles.
  • Strong knowledge in PC / server architecture and interfaces, experience with system level debug.
  • Strong System Level debugging skills with hands-on experiences in system bring-up, HW debug, and performance optimizations on various system architectures.
  • Understanding and experience working with Enterprise Linux environment (Ubuntu, CentOS / RHEL and SLES).
  • Excellent oral and written communication skills to communicate technical results clearly and accurately.
  • Test or design experience in system interconnect technologies (PCIe, XGMI, CXL, USB, I / O Controllers)
  • Familiarity with various deployment models including cloud, virtualization and containers.
  • Automation, orchestration, delivery via Kubernetes, Docker, or Mesos.
  • Experience or knowledge of server firmware / BIOS settings, boot process, server monitoring and management SW.
  • Experience relating to power and thermal management.
  • Solid knowledge of Shell / BASH, C / C++, Python, or other framework.
  • Experience with OpenCL, CUDA, or ROCm is a plus.

ACADEMIC CREDENTIALS :

BS in Electrical Engineering, Computer Engineering, or Computer Science. MS or PhD a plus.

LI-CC2 #LI-Hybrid

At AMD, your base pay is one part of your total rewards package. Your base pay will depend on where your skills, qualifications, experience, and location fit into the hiring range for the position.

You may be eligible for incentives based upon your role such as either an annual bonus or sales incentive. Many AMD employees have the opportunity to own shares of AMD stock, as well as a discount when purchasing AMD stock if voluntarily participating in AMD's Employee Stock Purchase Plan.

You'll also be eligible for competitive benefits described in more detail here.

AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and / or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law.

We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.

30+ days ago
Related jobs
Promoted
VirtualVocations
Santa Clara, California

Key Responsibilities:Assist with the development, deployment, and maintenance of automation softwareWork on DCIM asset creation and auditingCreate documentation, provide training, and participate in major initiativesRequired Qualifications:5+ years of software engineering experience3+ years combined...

Sanmina-SCI Systems de México
San Jose, California

The Field Applications Engineer is a customer and operations-facing role. They will build and further develop relationships with customer engineers and engineering management at Sr. Recognized as a technology leader, Sanmina Corporation provides end-to-end manufacturing solutions, delivering superio...

Cadence Design Systems, Inc.
San Jose, California

BS degree Computer Science/Engineering, Electrical, Engineering, or related field8+ years of design/EDA experienceStrong knowledge in Digital Design Fundamentals, Semiconductor fundamentals, and Static Timing Analysis is requiredPrior experience with IC digital implementation flows and backend EDA t...

Advanced Micro Devices, Inc.
Santa Clara, California

Business Operations Director - Data Center GPU. AMD's Data Center GPU organization is transforming the industry with our AI based Graphic Processors. Our primary objective is to design exceptional products that drive the evolution of computing experiences, serving as the cornerstone for enterprise D...

Amazon Development Center U.S., Inc.
Santa Clara, California

Our team has a great balance of research scientists and engineers working together to solve complex science and engineering challenges associated with design validation, testing, fuzzing, and runtime monitoring of large scale distributed systems. You’ll bring a passion for innovation, data, search, ...

NVIDIA
Santa Clara, California
Remote

We are looking for a Senior Technical Product Manager to take the lead on how these complex applications are deployed and monitored in the world’s most advanced data centers, helping craft technologies that allow our customers to easily deploy CUDA-enabled applications at scale. As a CUDA Product Ma...

Mediabistro
Santa Clara, California

The Market Development Manager, Data Center GPUwill play a key rolewithin the Data Center GPU Business Unit. This individual will be responsible for winning data center GPU designs around AMD's current and next generation GPUs, serving as the business development subject matter expert on our AI capa...

Element Critical
Sunnyvale, California

Oversees operation and management of routine and emergency services on a variety of critical systems such as: Stand-by diesel generators, Switchgear, Automatic Transfer Switches, Uninterrupted Power Supplies, Power Distribution Units, Air Handling Units, Computer Room Air Conditioners, Chillers, Coo...

Trilyon, Inc.
San Jose, California

Relevant experience in the mechanical design of computer/networking equipment for data center. Collaborate with thermal engineers to design and implement cooling solutions. Provide technical design guidance and mentoring for junior engineers. Bachelor’s degree in mechanical engineering or equivalent...

Apple
Cupertino, California

Join this team, and you’ll collaborate with engineers across Apple to build and deploy forward-looking prototype systems that contribute to the development of our world renowned hardware and software architecture. Apple’s Platform Architecture group is looking for talented engineers to build high pe...