Search jobs > Santa Clara, CA > Temporary > Engineer gpu server

Failure Analysis Engineer - GPU/Server

JobLookup
Santa Clara, California, US
$43-$45 an hour
Full-time

Location : Santa Clara, Ca.

The information below covers the role requirements, expected candidate experience, and accompanying qualifications.

Rate : $43 - $45 / hr

Contract to Hire - 6+ months

Summary

The Failure Analysis Engineer uses procedures and instructions to initiate the analysis process when product failure occurs.

Investigations are researched for root causes with analysis documented, recorded, and communicated internally and with the client company.

Responsible for failure analysis on Customer returned Server / GPU boards.

Essential Duties and Responsibilities include the following. Other Duties not listed may be assigned.

Data Analysis / Communication / Issue Resolution - Prevention - 90% of Job

  • Have ability to system and board level testing and debugging down to components level.
  • Have knowledge to do component swapping, removal to isolate failures and overall deeper FA
  • Visual mechanical inspection (VMI) of Server / GPU board components (Motherboards, GPU, GPU baseboard, CPU, DIMM, NIC, SSD, Power Supply, etc.

and / or electronics components

  • Completes component level trouble shooting (capacitor, resistors, fuse, IC, diode, etc.) and failure analysis
  • Complete sample analysis for equipment and process qualifications.
  • Conduct innovative use of new analytical tools, equipment, and methodologies.
  • Develop records and perform various failure analysis on systems and components to identity root cause.
  • Develop formal failure analysis data reporting and present internally and to client.
  • Requires communication with Asia team in emails and conference calls.
  • Be proficient in a large number of applications used for database management and reporting.
  • Follows procedures and diagrams in performing engineering change orders as required.
  • Other duties as assigned.

Team Leadership - 10% of Job

  • Responsible for maintaining a positive work environment while supporting the company's culture.
  • Responsible for fostering a healthy and safe work environment, focusing on the well-being of all associates.
  • Maintains and strengthens internal, external customer, and supplier relationships. Communicates and works well with

all business partners.

Ensures inventory management and merchandise allocation flows product through the warehouse in the most cost

efficient and productive manner.

Education and / or Experience

  • Bachelor's degree in Electrical Engineering
  • 4 years' experience in Failure Analysis
  • Comprehensive server knowledge is essential. (BIOS / BMC / CPLD / FPGA, etc.)
  • Ability to use electronic test equipment (oscilloscopes / multi-meters / thermal imaging camera, etc.)
  • Must be able to speak English. Able to speak Mandarin Chinese is a plus.

Essential Skills :

  • Requires excellent written and oral communication skills.
  • Knowledge of basic Linux environment and commands
  • Familiarity with Microsoft Office software (advanced excel)
  • Familiar with computer software and operating systems and possess the ability to identify and perform software updates (BIOS, BMC, Components Firmware, etc.)
  • Ability to read and interpret schematics / block diagrams with detailed understanding of server and subassembly functionality.
  • Ability to read and interpret system board views and board layout details
  • Effectively communicate concepts and solutions with various levels of the organization.
  • Effectively liaison between Company and Client.
  • Must be able to work cross-functionally with minimal supervision.
  • Requires strong analytical and statistical skills.
  • Requires flexibility to work overtime for special projects or business supports.

Competencies :

  • Shows determination to achieve excellent results
  • Finds better ways
  • Demands top performance
  • Inspires commitment

Working Condition

J-18808-Ljbffr

4 days ago
Related jobs
Ledgent Technology
Santa Clara, California

The Failure Analysis Engineer uses procedures and instructions to initiate the analysis process when product failure occurs. Responsible for failure analysis on Customer returned Server/ GPU boards. Visual mechanical inspection (VMI) of Server/GPU board components (Motherboards, GPU, GPU baseboard, ...

netPolarity
Santa Clara, California

Drive returned product Failure Analysis, characterizing failures, and escalating issues and trends to the Hardware Quality Engineering team Resolve escalated RMA's by determining the hardware root cause of the failure issue. Failure Analysis Engineer Job Code67711 Post Date06/05/2024 CitySanta Clara...

Western Digital Capital
San Jose, California

Collaborate with lab analysts/technicians to identify the root cause of failure, generate clear and concise failure analysis reports, and assist process engineering teams address the root cause of failures. Collaborate with global failure analysis team members, as well as process engineers, to captu...

Astro Digital
Santa Clara, California

Your primary role will be to document and provide in-depth root cause understanding of test failures. This hands-on position requires close collaboration with multiple teams and a strong background in Electrical Engineering. You will provide statistical data on different types of failures and recomm...

Western Digital
San Jose, California

Collaborate with lab analysts/technicians to identify the root cause of failure, generate clear and concise failure analysis reports, and assist process engineering teams address the root cause of failures. Collaborate with global failure analysis team members, as well as process engineers, to captu...

Power Integrations, Inc.
San Jose, California

The Senior Failure Analysis Engineer will perform power supply or system level failure analysis to support RMA and internal/external customer issues. Perform fault isolation and defect analysis/characterization on power ICs to identify root causes of product failures in reliability test, production ...

HCLTech
Fremont, California

Failure Analysis Engineering activities with deep technical engineering expertise in Server / Storage platforms. Perform electrical failure analysis on failing server components – motherboards, CPUs, GPUs, DIMMs, etc. Role: Failure Analysis Engineer. Support failure analysis of Open Compute Server /...

Western Digital
San Jose, California

Leading lab analysts/technicians to complete failure analysis jobs, generating clear and concise failure analysis reports, and assisting process engineering teams address the root cause of failures. Collaboration with global failure analysis team members, as well as process engineers, to capture com...

Western Digital Capital
San Jose, California

Leading lab analysts/technicians to complete failure analysis jobs, generating clear and concise failure analysis reports. Principal Media Failure Analysis Engineer. Bachelors, Masters’ or PhD Degree in Materials Engineering, Chemical Engineering, Chemistry, Mechanical Engineering, Physics, or relat...

Palo Alto Networks
Santa Clara, California

Failure Analysis Engineer to join our dynamic Hardware Quality Engineering team. Lead the failure analysis process for returned products, identify failure trends, and escalate issues to the Hardware Quality Engineering team. In this critical role, you’ll collaborate closely with HW Engineering, Cust...