AWS Cloud Site Reliability Engineer
Listed on 2026-06-13
-
IT/Tech
Cloud Computing, Systems Engineer, SRE/Site Reliability, IT Support
Optum is a global organization that delivers care, aided by technology to help millions of people live healthier lives. The work you do with our team will directly improve health outcomes by connecting people with the care, pharmacy benefits, data and resources they need to feel their best. Here, you will find a culture guided by diversity and inclusion, talented peers, comprehensive benefits, and career development opportunities.
Come make an impact on the communities we serve as you help us advance health equity on a global scale. Join us to start Caring. Connecting. Growing together.
You will be part of a world‑class identity matching solution building a state‑of‑the‑art application at the center of identity management for Optum Technology. The role requires providing 24×7 operational support to all production practices on holidays and weekends, coordinating with various teams, raising support tickets for all issues, analyzing root causes, and enabling efficient resolution of all production processes. You must maintain logs of all issues and ensure resolutions meet quality assurance tests for all production processes, and have a strong understanding of business processes within the various systems used within the application.
PrimaryResponsibilities
- Lead and mentor a team of SREs to ensure high‑quality delivery and professional growth
- Design, build, and maintain scalable and reliable systems using cloud‑native technologies
- Develop and implement monitoring, alerting, and observability strategies to ensure optimal system performance and user experience
- Automate operational tasks and drive infrastructure‑as‑code (IaC) adoption
- Proactively identify and resolve reliability risks, bottlenecks, and performance issues
- Leverage AI tools to streamline workflows, automate tasks, and drive continuous improvement
- Collaborate with engineering and product teams on architecture, code reviews, and incident response
- Lead post‑incident reviews (blameless retrospectives), root cause analysis, and continuous improvement initiatives
- Streamline migration processes, ensure consistency and enhance efficiency through automation and innovative solutions
- Define SLOs/SLIs, track error budgets, and report on system health to stakeholders
- Ensure compliance and security standards are integrated into system operations
- Stay current with emerging technologies and SRE best practices
- Bachelor’s degree or a CS/IT related field
- 3+ years of experience with Cloud SDKs with AWS using Java (Spring Boot microservices), Scala, and Python
- 3+ years of experience with distributed data services (Dynamo
DB, Athena, or similar) - 3+ years of experience with AWS Cloud services such as S3, Cloud Watch, ECS, Lambda, RDS, EMR, and ECS
- 3+ years of experience with CI/CD using Git Hub Actions or similar
- All telecommuters will be required to adhere to United Health Group’s Telecommuter Policy
- Experience in Unix, Hadoop, HBase, and Hive
- Experience working with offshore and onsite teams as part of job requirements
- Proven good communication skills
- 3+ years of experience in Elastic APM
- 3 years with Scala
- 3 years with Kubernetes clusters
Pay for this role ranges from $72,800 to $130,000 annually based on full‑time employment. The salary is based on local labor markets, education, work experience, certifications, and other factors. In addition to salary, we offer a comprehensive benefits package, incentive and recognition programs, equity stock purchase, and 401(k) contribution (subject to eligibility). We comply with all minimum wage laws applicable.
Pursuant to the San Francisco Fair Chance Ordinance, we will consider qualified applicants with arrest and conviction records. United Health Group is an Equal Employment Opportunity employer under applicable law; qualified applicants will receive consideration for employment without regard to race, national origin, religion, age, color, sex, sexual orientation, gender identity, disability, or protected veteran status, or any other characteristic protected by local, state, or federal laws, rules, or regulations.
United Health Group is a drug‑free workplace; candidates must pass a drug test before beginning employment.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).