Network Development Engineer, Annapurna Labs Infrastructure
Listed on 2025-10-08
-
IT/Tech
Systems Engineer, Cloud Computing, Cybersecurity, Network Engineer
Overview
Join to apply for the Network Development Engineer, Annapurna Labs Infrastructure role at Amazon Web Services (AWS).
Annapurna Labs is an organization within AWS responsible for building innovation in silicon and software for AWS customers. With development centers in the U.S. and Israel, Annapurna is at the forefront of innovation by combining cloud scale with the world’s most talented engineers. The Annapurna team covers silicon engineering, hardware design and verification, software, and operations. The team has contributed to AWS cloud infrastructure in networking and security with products such as AWS Nitro, Enhanced Network Adapter (ENA), and Elastic Fabric Adapter (EFA), as well as in compute (e.g., AWS Graviton and F1 EC2 Instances), machine learning (AWS Neuron, Inferentia and Trainium ML Accelerators), and scalable NVMe storage.
As part of the Annapurna Labs Infrastructure team, you will have the opportunity to contribute to the next generation of cloud computing infrastructure. The role involves a fast-paced, innovative environment focused on delivering high-impact infrastructure for Machine Learning Accelerators, including on-premise and cloud deployments for accelerated computing.
Key responsibilities- The Network Development Engineering role involves developing a broad range of skills. Leverage Linux expertise to troubleshoot, implement fixes and workarounds, keep software up-to-date, and provide data and metrics to manage services.
- Design networks, develop network monitoring, and troubleshoot connectivity issues. Communicate clearly and collaborate with others to deliver results. Be a self-starter, comfortable with ambiguity and change. Be customer-obsessed, understanding customer pain points and delivering resolutions quickly and completely.
- Lead across teams to develop and execute infrastructure plans that enable customers and engineering teams developing the Machine Learning Acceleration product family. Dive deep to solve critical infrastructure issues involving networking, high-performance compute clusters, infrastructure automation of hardware/software/firmware testing, and ASIC/EDA development.
- Influence within your team, customers, and AWS service teams to drive and develop technical implementations for overall infrastructure designs. Identify and implement process improvements to improve agility and operations, including design, automation, development, test, or operations.
- Define new mechanisms for system health monitoring, diagnostics, repair, and automation. Develop, document, and update operational runbooks as you participate in on-call rotations.
- Work with customers to translate requirements into cloud and on-premise infrastructure solutions. Define infrastructure requirements for labs and server rooms, and liaison with contractors and vendors for infrastructure.
- Take ownership for testing, deployments and measuring infrastructure health; support silicon development workflows including ATE testers, emulators, and lab debug equipment.
- Collaborate with top engineers to develop Machine Learning Accelerators.
- Work backwards from customers to develop infrastructure requirements for cloud and on-premise environments.
- Deliver on-premises infrastructure that meets customer needs; own testing, deployments, and health metrics.
- Participate in on-call rotations and maintain runbooks.
- 4+ years of experience with major internet routing protocols.
- 4+ years of experience in a Linux/Unix environment.
- 1+ years of automation scripting using Python, Bash, Shell and/or Perl.
Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status. Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit (Use the "Apply for this Job" box below).
for more information. If the country/region you’re…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).