Search jobs > Austin, TX > Ai ml engineer

AI/ML Staff Systems Design Engineer - C/C++, Kernel development for AI/ML processor, performance modeling

Advanced Micro Devices, Inc
Austin, Texas, United States
Full-time

WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world.

Our mission is to build great products that accelerate next-generation computing experiences the building blocks for the data center, artificial intelligence, PCs, gaming and embedded.

Underpinning our mission is the AMD culture. We push the limits of innovation to solve the world’s most important challenges.

We strive for execution excellence while being direct, humble, collaborative, and inclusive of diverse perspectives. AMD together we advance THE ROLE : We are looking for a dynamic, energetic candidate to join our growing team in AI Group.

In this role, the individual will be responsible for optimizing neural processor compiler, developing tools and methodologies to optimize and realize full system performance for AI workloads, responsible for architecting and defining kernel dataflow, defining block level and system level performance of Neural Processing Unit (NPU), NPU network performance modeling, and performance bottleneck analysis on pre / post silicon platforms THE PERSON : You will be tasked with analyzing AI workloads, analyzing system-level performance bottlenecks, and finding ways to achieve the best performance and power.

KEY RESPONSIBILITIES : Work with cross-functional teams to optimize various parts of the SW stack AI Compiler, AI frameworks, device drivers, and firmware.

Work on block & system level performance analysis for VLIW based AI Engine processor architecture. Bring up emerging ML models based on CNN, transformers and characterize performance.

Develop and validate VLIW-based processor systems on both pre-Silicon and post-Silicon platforms based on different use case applications Develop application specific reusable kernel code for AI Engine processors.

Bring up & debug on pre / post silicon platform. Debug the failures on pre / post-silicon platform using trace interface, waveform viewer.

Solve challenging technical problems with complex SoC based systems that integrate robust algorithms and features. Lead the discussion in the AI Engines Technical Solutions Team forum.

Be involved in all aspects of integrated product development, including design, prototyping, implementation, testing, and product demonstration.

Provide feedback on architecture, use cases, IP design, tools, and documentation. Create reference model using Matlab / Python libraries and verify functionality of kernels PREFERRED EXPERIENCE : Solid knowledge of AI and ML concepts and techniques.

Practical experience applying these concepts to solve real-world problems in the context of research or work experience.

Understanding the performance implications on AI acceleration of different compute, memory, and communication configurations and hardware and software implementation choices.

Developing and optimizing code for VLIW processors. Analyzing code for high performance CONV, GEMM and non-linear operators Deep understanding of AI frameworks, preferably ONNX.

Experience with AI / ML inference stacks such as ONNXRuntime. Proficiency in pre and post silicon performance analysis of ML models for edge and cloud based platforms.

Proficiency in C++ based kernel development for distributed processors. Excellent C / C++ coding skills Experience in processor performance and memory performance characterization Experience in system debug tool is plus e.

g. using Lauterbach, gdb, Valgrind and other debug tools is required. Experience in TensorFlow, PyTorch, Keras is a plus.

Experience with static and dynamic power characterization is a plus. Familiarity with VLIW SIMD vector processor architecture, ACADEMIC CREDENTIALS : BS or MS with industry experience PhD in Electrical Engineering or Computer Engineering Location : Austin TX #LI-RF1 #LI-HYBRID At AMD, your base pay is one part of your total rewards package.

Your base pay will depend on where your skills, qualifications, experience, and location fit into the hiring range for the position.

You may be eligible for incentives based upon your role such as either an annual bonus or sales incentive. Many AMD employees have the opportunity to own shares of AMD stock, as well as a discount when purchasing AMD stock if voluntarily participating in AMD’s Employee Stock Purchase Plan.

You’ll also be eligible for competitive benefits described in more detail here. AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services.

AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and / or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law.

We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.

THE ROLE : We are looking for a dynamic, energetic candidate to join our growing team in AI Group. In this role, the individual will be responsible for optimizing neural processor compiler, developing tools and methodologies to optimize and realize full system performance for AI workloads, responsible for architecting and defining kernel dataflow, defining block level and system level performance of Neural Processing Unit (NPU), NPU network performance modeling, and performance bottleneck analysis on pre / post silicon platforms THE PERSON : You will be tasked with analyzing AI workloads, analyzing system-level performance bottlenecks, and finding ways to achieve the best performance and power.

KEY RESPONSIBILITIES : Work with cross-functional teams to optimize various parts of the SW stack AI Compiler, AI frameworks, device drivers, and firmware.

Work on block & system level performance analysis for VLIW based AI Engine processor architecture. Bring up emerging ML models based on CNN, transformers and characterize performance.

Develop and validate VLIW-based processor systems on both pre-Silicon and post-Silicon platforms based on different use case applications Develop application specific reusable kernel code for AI Engine processors.

Bring up & debug on pre / post silicon platform. Debug the failures on pre / post-silicon platform using trace interface, waveform viewer.

Solve challenging technical problems with complex SoC based systems that integrate robust algorithms and features. Lead the discussion in the AI Engines Technical Solutions Team forum.

Be involved in all aspects of integrated product development, including design, prototyping, implementation, testing, and product demonstration.

Provide feedback on architecture, use cases, IP design, tools, and documentation. Create reference model using Matlab / Python libraries and verify functionality of kernels PREFERRED EXPERIENCE : Solid knowledge of AI and ML concepts and techniques.

Practical experience applying these concepts to solve real-world problems in the context of research or work experience.

Understanding the performance implications on AI acceleration of different compute, memory, and communication configurations and hardware and software implementation choices.

Developing and optimizing code for VLIW processors. Analyzing code for high performance CONV, GEMM and non-linear operators Deep understanding of AI frameworks, preferably ONNX.

Experience with AI / ML inference stacks such as ONNXRuntime. Proficiency in pre and post silicon performance analysis of ML models for edge and cloud based platforms.

Proficiency in C++ based kernel development for distributed processors. Excellent C / C++ coding skills Experience in processor performance and memory performance characterization Experience in system debug tool is plus e.

g. using Lauterbach, gdb, Valgrind and other debug tools is required. Experience in TensorFlow, PyTorch, Keras is a plus.

Experience with static and dynamic power characterization is a plus. Familiarity with VLIW SIMD vector processor architecture, ACADEMIC CREDENTIALS : BS or MS with industry experience PhD in Electrical Engineering or Computer Engineering Location : Austin TX #LI-RF1 #LI-HYBRIDAt AMD, your base pay is one part of your total rewards package.

Your base pay will depend on where your skills, qualifications, experience, and location fit into the hiring range for the position.

You may be eligible for incentives based upon your role such as either an annual bonus or sales incentive. Many AMD employees have the opportunity to own shares of AMD stock, as well as a discount when purchasing AMD stock if voluntarily participating in AMD’s Employee Stock Purchase Plan.

You’ll also be eligible for competitive benefits described in more detail here. AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services.

AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and / or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law.

We encourage applications from all qualified candidates and will accommodate applicants’ needs under the respective laws throughout all stages of the recruitment and selection process.

30+ days ago
Related jobs
Advanced Micro Devices, Inc
Austin, Texas

In this role, the individual will be responsible for optimizing neural processor compiler, developing tools and methodologies to optimize and realize full system performance for AI workloads, responsible for architecting and defining kernel dataflow, defining block level and system level performance...

Promoted
Raytheon
Bee Cave, Texas

Join our organization and experience the Systems V engineering life cycle while interfacing with a variety of engineering disciplines, subject matter experts, chief engineers, chief technologists, the customer and so much more to ensure we design, integrate, and build our systems to work the first t...

Promoted
University of Texas at Austin
Austin, Texas

However, employees who have access to the compensation information of other employees or applicants as a part of their essential job functions cannot disclose the pay of other employees or applicants to individuals who do not otherwise have access to compensation information, unless the disclosure i...

Promoted
Apple Inc.
Austin, Texas

If you enjoy learning new technologies, solving challenges with little mentorship, and are comfortable proposing and implementing solutions, demonstrating Software Engineering standard methodologies, you will find it exciting to work in AiDP! The ideal candidate for this position will be able to thi...

Promoted
Amazon
Austin, Texas

The Machine Learning Platform Software Team is looking for a Software Engineer who wants to develop industry leading acceleration platforms with an affinity towards efficient, robust, and highly available systems. As a member of the UC organization, you’ll support the development and management of C...

Outlier
Austin, Texas

Assessing the factuality and relevance of domain-specific text produced by AI models. Evaluating and ranking domain-specific responses generated by AI models. PLEASE NOTE: We collect, retain and use personal data for our professional business purposes, including notifying you of opportunities that m...

Advanced Micro Devices, Inc
Austin, Texas

AMD GPUs in upstream open-source repositories Collaborate and interact with internal GPU library teams to analyze and optimize training and inference for deep learning Work with open-source framework maintainers to understand their requirements – and have your code changes integrated upstream Work i...

Tek Ninjas
TX, United States

NLP Natural language processing (NLP)....

Inherent Technologies
TX, United States

Position: AI Engineer/Architect </b></p> <p><b>Location: Dallas TX/Remote</b></p> <p> </p> <p><b><u>Roles and Responsibilities:</u></b></p> <p> </p> <p> Educational Qualifications: Graduate or ...

ExcelSoft
TX, United States

You will collaborate with data scientists, ML engineers, and IT professionals to ensure that machine learning models are seamlessly integrated into production systems and perform at scale. As an ML Ops Engineer, you will be responsible for managing the machine learning lifecycle, with a focus on dep...