Senior Site Reliability Engineer
Listed on 2026-06-04
-
IT/Tech
Systems Engineer, Cloud Computing: Infrastructure & Operations
US Corp. is seeking a Lead Site Reliability Engineer to spearhead our mission of delivering highly available and performant systems. With an average of over 12 years of industry experience, the successful candidate will bridge the gap between software development and systems engineering. You will be responsible for designing and implementing automated infrastructure using Terraform, managing containerized workloads within Kubernetes, and refining our CI/CD pipelines to ensure seamless code deployment.
This role requires a deep dive into system internals, identifying bottlenecks, and implementing robust monitoring and observability solutions using Prometheus and Grafana. As a technical leader, you will define and maintain SLIs, SLOs, and SLAs while leading incident response and post-mortem analyses to prevent recurrence. You will work closely with product teams to ensure architectural scalability and reliability from the ground up.
The ideal candidate is an automation enthusiast with a proactive mindset toward security, scalability, and system resilience in a cloud-native environment.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).