Site Reliability Engineer II
Listed on 2026-03-11
-
IT/Tech
Cloud Computing, Systems Engineer, Cybersecurity, SRE/Site Reliability
Onwards Together! Illumio is the leader in ransomware and breach containment, redefining how organizations contain cyberattacks and enable operational resilience. Powered by the Illumio AI Security Graph, our breach containment platform identifies and contains threats across hybrid multi-cloud environments – stopping the spread of attacks before they become disasters. Recognized as a Leader in the Forrester Wave™ for Microsegmentation, Illumio enables Zero Trust, strengthening cyber resilience for the infrastructure, systems, and organizations that keep the world running.
Location5 on-site days a week in Sunnyvale, CA Headquarters.
Our Team's VisionOur Engineering team is driven by a culture that thrives on visionary leadership, autonomy, and ownership, creating a dynamic synergy that drives us forward in the ever-evolving landscape of cybersecurity.
When you join our team, you become part of the leader in Zero Trust Segmentation. You'll work with a cutting-edge technology stack that spans operating systems, distributed applications, and immersive UI/visualization tools.
We're shaping the future of cybersecurity. And together, we will continue to build world-class products—led by people with different perspectives, backgrounds, and a commitment to innovation in a time when the world faces its greatest cybersecurity threats in history.
Your ImpactAs an SRE Engineer II, you will be responsible for managing our multi-cloud infrastructure on Azure, AWS and/or GCP. As and when required, you will be responsible for designing new services and applications in the cloud(s) and take them from development to production while working closely with Engineering, SRE/OPS, and Security teams.
On a day-to-day basis, you will work on enhancing system reliability and scalability of Illumio SaaS products, and drive continuous improvement initiatives.
The ideal candidate will have a passion for cloud technology, automation, and collaboration, along with a solid foundation in Azure cloud platform and related Dev Ops practices.
- Design, deploy, and maintain cloud infrastructure solutions on Azure, AWS, and/or GCP to support our applications and services
- Implement infrastructure as code (IaC) principles using tools such as Terraform, ARM templates, or Cloud Formation to automate provisioning and configuration management
- Develop and maintain CI/CD pipelines for automated software delivery and deployment, leveraging tools such as Azure Dev Ops, AWS Code Pipeline, or Jenkins
- Monitor system performance, application health, and infrastructure metrics using cloud monitoring and logging services, and implement proactive measures to optimize performance and availability
- Support incident response and resolution efforts, conduct root cause analysis, implement corrective actions, and document post-incident reviews
- Collaborate with Engineering teams to design and implement scalable and reliable architectures, providing guidance on best practices for cloud-native application development
- Implement security best practices and controls in cloud environments to protect data, applications, and infrastructure, and ensure compliance with regulatory requirements
- Drive automation initiatives to streamline operational tasks, reduce manual effort, and improve overall efficiency in cloud operations
- Stay current with cloud platform updates, trends, and best practices, and evaluate emerging technologies for potential adoption to drive innovation and efficiency
- Provide support and guidance to junior team members, fostering a culture of learning, collaboration, and continuous improvement within the SRE/Dev Ops team
- Bachelor's degree in Computer Science, Engineering, or related field; or equivalent work experience
- 2+ years of experience working as an SRE, Dev Ops Engineer, or similar role, with hands‑on experience in Azure cloud platform in a production environment setting
- Exposure to AWS and/or GCP cloud platforms is preferred
- Proficiency in scripting and programming languages such as Power Shell, Python, or Go for automation and infrastructure management tasks
- Experience with CI/CD tools and methodologies, containerization technologies, and…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).