×
Register Here to Apply for Jobs or Post Jobs. X

System Level Debug Engineer - Data Center GPU

Job in Austin, Travis County, Texas, 78716, USA
Listing for: Advanced Micro Devices, Inc.
Full Time position
Listed on 2026-03-12
Job specializations:
  • IT/Tech
    Systems Engineer, IT Support
Salary/Wage Range or Industry Benchmark: 80000 - 100000 USD Yearly USD 80000.00 100000.00 YEAR
Job Description & How to Apply Below

Overview

WHAT YOU DO AT AMD CHANGES EVERYTHING

At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we push the limits of innovation to solve the world s most important challenges with bold ideas, human ingenuity, and a shared passion to create something extraordinary. When you join AMD, you ll discover the real differentiator is our culture—direct, humble, collaborative, and inclusive of diverse perspectives.

Join us as we shape the future of AI and beyond. Together, we advance your career.

The Team

AMD s Data Center GPU organization is transforming the industry with our AI-based Graphics Processors. Our objective is to design exceptional products that drive the evolution of computing experiences for enterprise data centers, AI, HPC, and embedded systems. If this resonates with you, join our Data Center GPU organization where we are building amazing AI-powered products with amazing people.

The Role

AMD is looking for a lead systems engineer to provide thought leadership and subject-matter expertise to our growing team. As a key contributor, you will have a strong technical background to contribute to all aspects of the software development process. We offer competitive benefit packages and an award-winning culture. Join us!

The Datacenter Graphics and Accelerated Computing (DCGPU) organization is looking for an experienced system-level debug engineer. The candidate will be part of a team that brings up, validates, and ensures the platform is fully validated, including electrical, power, networking, and SOC. The individual will lead and document the plan for validating the system itself as well as create documentation for unique steps to enable it.

The individual must drive root-cause to closure for issues encountered and communicate with functional and IP owners for resolution.

The Person

You are a highly motivated, hands-on leader with a strong development background, problem-solving mentality, excellent communication skills, ability to prioritize tasks, and willingness to learn and adapt. Excellent teamwork skills and the ability to lead a highly technical team.

Experience in debugging of complex HW/FW issues is essential. You should understand the flow of a GPU through the different layers of a system and be able to validate items connecting to the GPU SOC (PCIe, VRs, RMs, retimers, HBM, internal networking). Communication is essential when working with different owners of the functional code stack, and you should be able to drive issues via phone, chat, or email.

Hands-on experience with hardware in a data center environment is required.

Key Responsibilities
  • Debug / triage engineer with understanding of industry tools for root-cause analysis of complex issues
  • Understand GPU / system-level HW and SW flow
  • Probe parts of a board; check electrical and power currents and validate a system
  • Provide leadership for driving to root cause issues
  • Communicate / document flows and methods of bring-up, boot-up, system initialization and debugging
  • Lead technical presentations demonstrating understanding of application, data, infrastructure, architecture expertise and application systems design
  • Collaborate with application and infrastructure architects; define-design-deliver technical architectures, patterns, technical quality, risks, fitness for purpose and operability of technical solutions
  • Be a leader and mentor to the operation team; be hands-on and lead by example
  • Hands-on troubleshooting and solving technical issues; own the problem and drive for resolution
  • Proactively support a team culture that fosters knowledge sharing, excellence, and collaboration
Preferred Experience
  • Significant experience in SoC and/or system debug of complex issues
  • Develop / document debug capabilities on a given SOC and system
  • Go-to person for debugging of issues for production-level platform validation
  • Collaborate with internal teams on root-cause issues, finding optimum resolutions
  • Hands-on experience using industry debug tools and board-level power analysis
  • Proven experience…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary