AI/DCGPU Full-Stack Solutions Architect
Listed on 2025-12-19
-
IT/Tech
AI Engineer, Machine Learning/ ML Engineer, Data Scientist, Systems Engineer
WHAT YOU DO AT AMD CHANGES EVERYTHING
At AMD, our mission is to build great products that accelerate next‑generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you’ll discover the real differentiator is our culture.
We push the limits of innovation to solve the world’s most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond.
THE ROLE
The Architecture & Strategy group is an influence‑based collection of senior technology professionals and this AI‑focused role in its Solutions Architecture sub‑group is key to the group's and AMD's continued success.
This is not a silicon‑focused role but is instead a solutions‑focused position that will work with peers within the Solutions Architecture team, across the Architecture & Strategy group in AMD's Data Center Solutions Group (DSG), and as needed across other DSG as well as AMD's Data Center GPU teams. This new role has been created to ensure our team has a seat at the table and maintains visibility to the rapid changes in the AI technology landscape both inside AMD and in the outside AI ecosystems.
THEPERSON
We are looking for a candidate with hands‑on full‑stack AI skills who both understands and can effectively communicate the areas where the current state of hardware, platform software, AI‑focused software, data, and operations meet as well as where those elements are likely to go in the future. That fluency is required to supply meaningful feedback on where AMD's AI efforts should focus in the future for the company's greatest competitive advantage.
KEY RESPONSIBILITIESWe are looking for a senior AI‑focused team member with responsibilities in three primary areas:
- Infrastructure Configuration
- CPU/GPU compute
- AI‑specific backend networking and design
- Storage technologies and tiering aligned to AI training and inference pipelines
- Bare-metal Linux with AI hw/sw tweaks and configurations
- AI software platforms like Kubernetes for Cloud‑Native implementations, and SLURM for HPC‑focused setups
- LLMs/SLMs (primary)
- Understand how to platform / tune both training‑ and inference‑focused language models on AMD Instinct and EPYC
- AI model and RAG pipelining to tailor model and agent behavior to suit data accuracy and output expectations
- Experience with large‑scale model training or inference
- MCP and data pipelining
- Remain current with traditional ML/DL and model generation outside of just LLMs
- Python
- Familiarity with AMD ROCm or NVIDIA CUDA with an ML framework (PyTorch, JAX, vLLM, Tensor
RT, NeMo, …)
- Act as a full‑stack, AI solutions‑focused resource for customer discussions and potential on‑site meetings (some travel required)
- Maintain and build AI ecosystem participation and visibility
- Including CfP submissions for AI conferences
- Experience in both current AI technologies and the technologies that preceded the rise in language models, e.g.
- Traditional Machine Learning / Deep Learning
- Experience with Data Engineering techniques and technologies
- Readying data from disparate sources (structured, unstructured, stream and batch) for ingest into data pipelines
- Experience with direct customer and partner engagement on AI technology selection, AI platform architectures
- Experience in public speaking at AI / Data‑focused technology conferences
- Bachelor's Degree in a technical field (e.g. engineering, mathematics, statistics), Masters preferred
- This is a Senior level role; no recent college graduates will be considered
T…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).