Senior Site Reliability Engineer
Job in
Sunnyvale, Santa Clara County, California, 94087, USA
Listed on 2026-06-26
Listing for:
Illumio
Full Time
position Listed on 2026-06-26
Job specializations:
-
IT/Tech
Cloud Computing: Infrastructure & Operations, SRE/Site Reliability, Systems Engineer, Azure
Job Description & How to Apply Below
- We are looking for an experienced Senior Site Reliability Engineer (SRE) with a strong background in AWS & Azure cloud platforms to play a key role in ensuring the reliability, scalability, and performance of our cloud-based systems and applications.
- The ideal candidate will have hands‑on experience in supporting, and managing AWS and Azure infrastructure, along with a passion for automation, continuous improvement, and collaboration with cross‑functional teams.
- Monitor system performance, application health, and infrastructure metrics using monitoring and logging services, and implement proactive measures to optimize performance and availability.
- On‑call duty for production uptime and support for customer escalations.
- Release upgrades and maintenance activities including hotfixes and infrastructure updates.
- Lead incident response and resolution efforts, conducting root cause analysis, implementing corrective actions, and documenting post‑incident reviews.
- Implement security best practices and controls in the cloud environments to protect data, applications, and infrastructure, and ensure compliance with regulatory requirements.
- Drive continuous improvement initiatives to enhance reliability, scalability, and efficiency of infrastructure and services, leveraging automation and emerging technologies.
- Health and Dependent Savings Accounts
- Life and Disability Programs
- Paid Parental Leave
- Voluntary Benefit Programs
- Company Sponsored Wellness Program
- Wellness Reimbursement Program
- Retirement Savings
- Equity Opportunities
- Paid time off and Paid Holidays
- Employee Incentive Program
- Medical, Dental, Vision Coverage
- Proficiency in scripting and programming languages such as Power Shell, Python, or Go for automation and infrastructure management tasks.
- Experience with containerization technologies (e.g., Docker, Kubernetes) and microservices architecture in AWS and Azure environments is a plus.
- Strong understanding of CI/CD principles and experience with tools such as Azure Dev Ops, Jenkins, or Git Lab CI/CD.
- 5+ years of experience working as a Site Reliability Engineer (SRE) or similar role, with a focus on AWS and/or Azure cloud platform.
- Hands‑on experience in designing, deploying, and managing AWS and/or Azure infrastructure, including compute, storage, networking, and security services.
- Bachelor’s degree in computer science, Engineering, or related field; or equivalent work experience.
- AWS or Azure certifications such as AWS/Azure Solutions Architect, Azure Dev Ops Engineer, or Azure Security Engineer are preferred.
- Excellent analytical, problem‑solving, and communication skills, with the ability to collaborate effectively with cross‑functional teams.
Position Requirements
10+ Years
work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×