×
Register Here to Apply for Jobs or Post Jobs. X

Principal Group Software Engineering Manager

Job in Redmond, King County, Washington, 98053, USA
Listing for: Microsoft Corporation
Full Time position
Listed on 2026-05-23
Job specializations:
  • IT/Tech
    Systems Engineer, Cloud Computing
Job Description & How to Apply Below
Overview

M365 Copilot inference is a high-impact engineering team advancing applied AI and large-scale machine learning across Microsoft. We design and operate the platform powering Microsoft 365 Copilot experiences.

Our team is operating at massive GPU (Graphics Processing Unit) scale across multiple regions and SKUs in global datacenters. We build the core LLM (Large Language Model) API (Application Programming Interface) , routing, capacity, and control plane services that turn that fleet into Copilot experiences.

We are hiring a Principal Group Software Engineering Manager to own GPU fleet health, capacity intake and planning, and automated model deployment for Copilot. This is one of the most strategic leadership roles in Copilot: every feature, experiment, and model launch flows through the systems this leader owns. You will lead existing teams, grow the org, and build the control plane that turns capacity management from a manual, ticket-driven process into an automated, self-driven platform.

You will own end‑to‑end GPU fleet health and capacity platform, establishing a single source of truth with strong observability across hardware, hosts, and workloads to drive utilization and reliability. Design and scale capacity intake, planning, and deployment reducing models time‑to‑production and meeting SLAs (service level agreement) for priority workloads through automation and data‑driven operations.

Build a unified control plane that connects intake, planning, deployment, and fleet operations, enabling global optimization across cost, latency, compliance, and flexible model scaling (0→1 platform ownership).

Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

Responsibilities

* Build and lead a high-performing organization of engineering managers and senior engineers across capacity buildouts/automation, capacity planning, and the control plane.

* Set the strategy and roadmap for Copilot capacity management and the control plane.

* Drive execution across existing teams today, with a clear plan to grow the org as control plane scope expands.

* Partner deeply with Copilot, AI Core, Azure to align demand, supply, and COGs (cost of goods sold) for Copilot workloads.

* Own live-site, reliability, and operational excellence for the capacity surface area.

* Establish metrics and SLAs for intake latency, fleet utilization, automation coverage, and time-to-deploy; use them to guide investment decisions.

* Coach and grow managers and senior ICs (individual contributor); raise the engineering bar; recruit experienced platform leaders into the team.

* Represent capacity in executive reviews and cross-org leadership forums; communicate trade-offs between cost, speed, and reliability with clarity.

Qualifications

Required Qualifications:

* Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python

* OR equivalent experience.

Other Requirements:

Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings:

* Microsoft Cloud Background Check:
This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.

Preferred Qualifications:

* Master's Degree in Computer Science or related technical field AND 12+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python

* OR Bachelor's Degree in Computer Science or related technical field AND 15+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python

* OR…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary