Principal Firmware Engineer, Annapurna Labs ML Acceleration Systems Software
Listed on 2026-06-03
-
Software Development
Software Engineer, DevOps
Overview
In Annapurna Labs we are at the forefront of hardware/software accelerator solutions for not only Amazon Web Services (AWS), but across the industry. The Machine Learning Acceleration Systems Firmware team is looking for candidates interested in diving deep into our designs of Machine Learning servers and developing world‑class firmware to support current and future generations of accelerator silicon.
ResponsibilitiesIn this role, you will lead a team of software and firmware developers to build reliable server firmware deployed across millions of accelerators across EC2. You will build AI‑driven software tooling that root causes failures and identifies causes of system failures—work that directly impacts how our customers leverage AWS Trainium for their machine learning workloads. You will design and develop server software at AWS scale, collaborate with hardware developers and software engineers to design validation strategies that ensure reliability across the entire product line, mentor your team through complex technical challenges, establish operational procedures that scale across products, and work cross‑functionally to integrate design‑for‑excellence principles into our development process.
You will also participate in technical discussions that shape how we approach system design & validation, ensuring we catch issues before they reach customers. This is a fast‑paced, intellectually challenging position that demands high standards and a constant pursuit of performance, quality and cost improvement. Using data and key metrics, you will drive and measure process improvements that enhance operational effectiveness.
Day in the Life
Your day‑to‑day responsibilities will include interfacing with internal and external customers to understand project requirements and facilitate system development on top of your server design. You will learn operational challenges of our existing fleet with the goal of improving the current customer experience as well as developing improved systems for future designs. You will work directly with vendors and ODM/JDM design teams to develop and manufacture your product at scale.
BasicQualifications
- 7+ years of working directly with engineering teams experience
- Experience managing programs across cross‑functional teams, building processes and coordinating release schedules
- Experience building and evaluating system‑level technical design
- Bachelor's degree in Computer Science, Computer Engineering, or related fields
- Experience managing teams, or experience as a mentor, tech lead or leading an engineering team
- Experience in software development, or experience troubleshooting and debugging technical systems and experience that includes strong analytical skills, attention to detail, and effective communication abilities
- Experience with hardware/software integration and real‑time systems
- 10+ years of systems software or firmware engineering
- Proficiency with programming languages commonly used in systems software (such as C, C++, Rust, or Python)
- 5+ years of project management disciplines including scope, schedule, budget, quality, along with risk and critical path management experience
- Experience managing projects across cross‑functional teams, building sustainable processes and coordinating release schedules
- Experience defining KPIs/SLA's used to drive multi‑million dollar businesses and reporting to senior leadership
- Master's degree in Computer Science, Computer Engineering, or related fields
- Experience troubleshooting and debugging technical systems
- 5+ years of embedded firmware development experience
- Knowledge of data center infrastructure design, operations, or delivery
- Experience navigating a knowledge base and following Standard Operating Procedures (SOPs)
- Experience with AI or machine learning applications in systems engineering
Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).