Software Engineering Manager, Site Reliability Engineering
Listed on 2026-06-26
-
IT/Tech
SRE/Site Reliability, Cloud Computing: Infrastructure & Operations, Systems Engineer
Software Engineering Manager, Site Reliability Engineering
Sunnyvale, CA, USA
Qualifications- Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.
- 8 years of experience in software engineering, systems engineering, or site reliability engineering.
- 5 years of experience building and developing large‑scale infrastructure or distributed systems.
- 2 years of experience with people management.
Master's degree in Computer Science or Engineering, or a related field.
About the jobSite Reliability Engineering (SRE) combines software and systems engineering to build and run large‑scale, massively distributed, fault‑tolerant systems. The SRE team ensures that Google Cloud’s services—both internally critical and externally visible—maintain reliability, uptime appropriate to customer needs, and continuous improvement. SREs also monitor system capacity and performance.
In this role you will manage the complex challenges of scale unique to Google Cloud while applying expertise in coding, algorithms, complexity analysis, and large‑scale system design. You will lead a small team, foster a culture of curiosity and open problem‑solving, and enable the team to take ownership of meaningful projects.
Key responsibilities include:
- Managing the scale and availability of next–generation Workspace GenAI features in collaboration with the Workspace AI SRE team, ensuring model‑based features remain fast, reliable, and degrade gracefully under load.
- Operating a newly partitioned Spanner storage topology, completing physical isolation of Spanner allocations per Editor and shard, and managing elastic resource capacity through autoscaling.
- Orchestrating large‑scale, multi‑system restore operations for critical customer data, contributing to tools and playbooks that coordinate data recovery across dependencies, validate data correctness, and restore integrity after complex platform incidents.
- Directing the resource headroom and efficiency roadmap for the Editors portfolio.
US: $207,000 – $301,000 (USD) + 20% bonus target + equity + benefits.
Equal‑Employment OpportunityGoogle is an equal opportunity and affirmative action employer. We are committed to building a workforce representative of the users we serve and to fostering a culture of belonging. Our hiring decisions are made without regard to race, creed, color, religion, gender, sexual orientation, gender identity or expression, national origin, disability, age, genetic information, veteran status, marital status, pregnancy or related condition, criminal history, or any other basis protected by law.
Please see our EEO and hiring policies for additional information.
Google is a global company and requires English proficiency for all roles unless otherwise specified.
To all recruiters:
Google does not accept agency resumes. Please do not forward resumes for this position.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).