Search jobs > Santa Clara, CA > Temporary > Failure engineer gpu

GPU / Server Failure Analysis Engineer

JobLookup
Santa Clara, California, US
$43-$45 an hour
Full-time

Location : Santa Clara, Ca.

The information below covers the role requirements, expected candidate experience, and accompanying qualifications.

Rate : $43 - $45 / hr

Contract to Hire - 6+ months, Onsite

Local Candidates Highly Desirable!

Summary

The Failure Analysis Engineer uses procedures and instructions to initiate the analysis process when product failure occurs.

Investigations are researched for root causes with analysis documented, recorded, and communicated internally and with the client company.

Responsible for failure analysis on Customer returned Server / GPU boards.

Essential Duties and Responsibilities include the following. Other Duties not listed may be assigned.

Data Analysis / Communication / Issue Resolution - Prevention - 90% of Job

  • Have ability to system and board level testing and debugging down to components level.
  • Have knowledge to do component swapping, removal to isolate failures and overall deeper FA
  • Visual mechanical inspection (VMI) of Server / GPU board components (Motherboards, GPU, GPU baseboard, CPU, DIMM, NIC, SSD, Power Supply, etc.

and / or electronics components

  • Completes component level trouble shooting (capacitor, resistors, fuse, IC, diode, etc.) and failure analysis
  • Complete sample analysis for equipment and process qualifications.
  • Conduct innovative use of new analytical tools, equipment, and methodologies.
  • Develop records and perform various failure analysis on systems and components to identity root cause.
  • Develop formal failure analysis data reporting and present internally and to client.
  • Requires communication with Asia team in emails and conference calls.
  • Be proficient in a large number of applications used for database management and reporting.
  • Follows procedures and diagrams in performing engineering change orders as required.
  • Other duties as assigned.

Team Leadership - 10% of Job

  • Responsible for maintaining a positive work environment while supporting the company's culture.
  • Responsible for fostering a healthy and safe work environment, focusing on the well-being of all associates.
  • Maintains and strengthens internal, external customer, and supplier relationships. Communicates and works well with

all business partners.

Ensures inventory management and merchandise allocation flows product through the warehouse in the most cost

efficient and productive manner.

Education and / or Experience

  • Bachelor's degree in Electrical Engineering
  • 4 years' experience in Failure Analysis
  • Comprehensive server knowledge is essential. (BIOS / BMC / CPLD / FPGA, etc.)
  • Ability to use electronic test equipment (oscilloscopes / multi-meters / thermal imaging camera, etc.)
  • Must be able to speak English. Able to speak Mandarin Chinese is a plus.

Essential Skills :

  • Requires excellent written and oral communication skills.
  • Knowledge of basic Linux environment and commands
  • Familiarity with Microsoft Office software (advanced excel)
  • Familiar with computer software and operating systems and possess the ability to identify and perform software updates (BIOS, BMC, Components Firmware, etc.)
  • Ability to read and interpret schematics / block diagrams with detailed understanding of server and subassembly functionality.
  • Ability to read and interpret system board views and board layout details
  • Effectively communicate concepts and solutions with various levels of the organization.
  • Effectively liaison between Company and Client.
  • Must be able to work cross-functionally with minimal supervision.
  • Requires strong analytical and statistical skills.
  • Requires flexibility to work overtime for special projects or business supports.

Competencies :

  • Shows determination to achieve excellent results
  • Finds better ways
  • Demands top perform

J-18808-Ljbffr

3 days ago
Related jobs
Ledgent Technology
Santa Clara, California

The Failure Analysis Engineer uses procedures and instructions to initiate the analysis process when product failure occurs. Responsible for failure analysis on Customer returned Server/ GPU boards. Visual mechanical inspection (VMI) of Server/GPU board components (Motherboards, GPU, GPU baseboard, ...

Promoted
Western Digital
San Jose, California

Collaboration with global failure analysis team members, as well as process engineers, to capture common defect observations and drive defect reduction improvements. Developing new failure analysis processes and capabilities required for next-generation products and unique failure events. Bache...

Western Digital
San Jose, California

Leading lab analysts/technicians to complete failure analysis jobs, generating clear and concise failure analysis reports, and assisting process engineering teams address the root cause of failures. Collaboration with global failure analysis team members, as well as process engineers, to capture com...

PEAK Technical Staffing
San Jose, California

Interfaces with customer and R&D, Product Engineering and Quality Engineering organizations to identify elevated factory and field failure symptom's requiring in depth circuit failure analysis which ultimately leads to identification of root cause, corrective action recommendations of ESSN Server Pr...

Western Digital Capital
San Jose, California

Leading lab analysts/technicians to complete failure analysis jobs, generating clear and concise failure analysis reports. Principal Media Failure Analysis Engineer. Bachelors, Masters’ or PhD Degree in Materials Engineering, Chemical Engineering, Chemistry, Mechanical Engineering, Physics, or relat...

netPolarity
Santa Clara, California

Drive returned product Failure Analysis, characterizing failures, and escalating issues and trends to the Hardware Quality Engineering team Resolve escalated RMA's by determining the hardware root cause of the failure issue. Failure Analysis Engineer Job Code67711 Post Date06/05/2024 CitySanta Clara...

Palo Alto Networks
Santa Clara, California

Failure Analysis Engineer to join our dynamic Hardware Quality Engineering team. Lead the failure analysis process for returned products, identify failure trends, and escalate issues to the Hardware Quality Engineering team. In this critical role, you’ll collaborate closely with HW Engineering, Cust...

Support Revolution
San Jose, California

High-performance product team in Supermicro is seeking talented Senior System Product Engineer who can lead the technical collateral development of in-house server system products. Diagnose the root cause of system failures and isolate the components/failure nodes. Bachelor’s degree in Electrical, C...

PEAK Technical Staffing USA
San Jose, California

Interfaces with customer and R&D, Product Engineering and Quality Engineering organizations to identify elevated factory and field failure symptom's requiring in depth circuit failure analysis which ultimately leads to identification of root cause, corrective action recommendations of ESSN Server Pr...

Net2Source
San Jose, California

Position: Failure Analysis Engineer. ...