Sr. Site Reliability Engineer
Listed on 2026-05-19
-
IT/Tech
Cloud Computing, Systems Engineer
Onwards Together!
Illumio is the leader in ransomware and breach containment, redefining how organizations contain cyberattacks and enable operational resilience. Powered by the Illumio AI Security Graph, our breach containment platform identifies and contains threats across hybrid multi‑cloud environments – stopping the spread of attacks before they become disasters.
Recognized as a Leader in the Forrester Wave for Microsegmentation, Illumio enables Zero Trust, strengthening cyber resilience for the infrastructure, systems, and organizations that keep the world running.
Location: 5 on-site days a week in Sunnyvale, CA Headquarters.Our Team's Vision:
Our Engineering team is shaping the future of cybersecurity. We thrive on visionary leadership, autonomy, and ownership, fostering a culture of innovation that propels us forward in the ever‑evolving cybersecurity landscape.
As a leader in Zero Trust Segmentation, we are redefining security for a world facing unprecedented cyber threats. You’ll work with a highly scalable SaaS service built using cloud‑native technologies while simultaneously shipping the solution on‑premises.
Our guiding philosophy in Engineering is to get things right through practicing disciplined engineering, focusing, not cutting corners, and of course having fun while we are believe in enabling ownership at all levels of the organization and empowering teams. If you thrive in this culture, come join us!
Your Impact:We are looking for an experienced Senior Site Reliability Engineer (SRE) with a strong background in AWS & Azure cloud platforms to play a key role in ensuring the reliability, scalability, and performance of our cloud‑based systems and applications.
The ideal candidate will have hands‑on experience in supporting, and managing AWS and Azure infrastructure, along with a passion for automation, continuous improvement, and collaboration with cross‑functional teams.
If you are passionate about AWS and/or Azure cloud platform and have a track record of driving reliability, scalability, and performance in cloud‑based environments, we’d love to hear from you.
- Monitor system performance, application health, and infrastructure metrics using monitoring and logging services, and implement proactive measures to optimize performance and availability
- Oncall duty for production uptime and support for customer escalations
- Release upgrades and maintenance activities including hotfixes and infrastructure updates
- Lead incident response and resolution efforts, conducting root cause analysis, implementing corrective actions, and documenting post‑incident reviews
- Implement security best practices and controls in the cloud environments to protect data, applications, and infrastructure, and ensure compliance with regulatory requirements
- Drive continuous improvement initiatives to enhance reliability, scalability, and efficiency of infrastructure and services, leveraging automation and emerging technologies
- Bachelor’s degree in computer science, Engineering, or related field; or equivalent work experience
- 5+ years of experience working as a Site Reliability Engineer (SRE) or similar role, with a focus on AWS and/or Azure cloud platform
- Hands‑on experience in designing, deploying, and managing AWS and/or Azure infrastructure, including compute, storage, networking, and security services
- Proficiency in scripting and programming languages such as Power Shell, Python, or Go for automation and infrastructure management tasks
- Strong understanding of CI/CD principles and experience with tools such as Azure Dev Ops, Jenkins, or Git Lab CI/CD
- Experience with containerization technologies (e.g., Docker, Kubernetes) and microservices architecture in AWS and Azure environments is a plus
- Excellent analytical, problem‑solving, and communication skills, with the ability to collaborate effectively with cross‑functional teams
- AWS or Azure certifications such as AWS/Azure Solutions Architect, Azure Dev Ops Engineer, or Azure Security Engineer are preferred
For roles in San Francisco and Los Angeles:
Pursuant to the San Francisco Fair Chance Ordinance and the Los Angeles Fair Chance Initiative for Hiring, Illumio will consider for employment qualified applicants with arrest and conviction records.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).