×
Register Here to Apply for Jobs or Post Jobs. X

Network Engineer II

Job in Redmond, King County, Washington, 98073, USA
Listing for: Microsoft Corporation
Full Time position
Listed on 2026-05-21
Job specializations:
  • IT/Tech
    Systems Engineer
Job Description & How to Apply Below
** Overview*
* The HPC/AI (High performance Computing and Artificial Intelligence) team is on a mission to build the next-generation distributed AI supercomputer, enabling breakthroughs in artificial intelligence by delivering unmatched computational power, scalability and reliability. We design and develop cutting-edge infrastructure that supports high-performance AI model training at scale, laying the foundation for innovations that redefine what AI can achieve.

We are seeking passionate and innovative engineers to design, build and manage cutting-edge networking infrastructure that powers large-scale AI training. This role focuses on developing next-generation networking capabilities to ensure high performance, low latency, and minimal jitter for distributed AI workloads. You will play a critical role in enabling state-of-the-art AI systems to achieve their full potential.

As a Network Engineer on the HPC/AI team, you will play a pivotal role in shaping and managing the next-generation networking infrastructure for AI training and inference in Azure Cloud. This is a unique opportunity to work at the intersection of two of the hottest fields in technology: AI and high-performance computing. With the explosive growth of generative AI and the increasing demand for large-scale, low-latency systems, this area is at the forefront of innovation and impact.

You will work across diverse network architectures and cutting-edge processor and accelerator technologies, driving the design and delivery of a comprehensive, end-to-end solution with a relentless focus on performance, scalability, and observability. If you're passionate about groundbreaking technology, large-scale systems, and AI infrastructure, join us to build the platform that will power the future of AI supercomputing!

Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

** Responsibilities*
* + Demonstrates some knowledge of data - knows what data is needed, knows how to find new or missing data, and can describe defects and their relevance to product and service targets. Identifies patterns and trends in data and interprets them to inform decisions related to products and/or services.

+ Collaborates with teams across the organization to support and manage safe and secure network deployments.

+ Works with machine-readable definitions to manage deployments.

+ Supports the management of incidents by applying technical knowledge to diagnose and triage issues with a commitment to maintaining the quality of products and services. Takes notes during incidents and participates in postmortem and root cause analysis processes.

+ Performs testing and validation of network devices, firmware, and configurations. Defines and implements test cases with existing automation tools, and exposes test coverage gaps.

+ Triages, troubleshoots, and repairs live site issues by applying an understanding of network components and features (e.g., device operating systems) as well as problem management tools (e.g., root cause analysis, trend analysis, postmortems), to discover and drive solutions with minimal or no disruption to customers. Actively participates in on-call/DRI duties to troubleshoot and may actively resolve incidents in production.

+ Monitors network telemetry and performs analyses to identify patterns that reveal errors and unexpected problems. Makes suggestions on improvements to monitoring based on observations and experience.

+ Provides instructions to datacenter or network site staff/technicians on how to securely repair, replace, and maintain physical network hardware and components deployed in production. Identifies gaps and inefficiencies in processes related to securely installing and deploying new hardware and components and provides instructions to address gaps.

** Qualifications*
* *
* Required Qualifications:

*
* + Master's…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary