Site Reliability Engineer
Job in
Irvine, Orange County, California, 92713, USA
Listed on 2026-02-10
Listing for:
Swoon
Contract
position Listed on 2026-02-10
Job specializations:
-
IT/Tech
Cloud Computing, Systems Engineer, SRE/Site Reliability, IT Support
Job Description & How to Apply Below
Swoon is actively seeking a Product Platform Site Reliability Engineer to join the team!
Basics of the role- Location – Irvine, CA
- On-Site – 5 days a week – Monday – Friday
- 3-6 month contract role
- Hourly Rate - $50-$60 an hour
- Must be a US Citizen or Permanent Resident – due to the security of this role, no other work authorizations can be used.
- Design, build, and operate shared multi-cloud platform infrastructure (AWS, GCP, Azure) to support secure, scalable, and highly available healthcare applications.
- Develop and manage Kubernetes-based platform services, including multi-cluster environments and service mesh (Istio), to ensure resilient application delivery.
- Implement and maintain Infrastructure-as-Code and automation frameworks (Terraform, Helm, Cloud Formation, Ansible) to standardize and streamline platform environments.
- Build and operate CI/CD and Git Ops pipelines (Bitbucket Pipelines, ArgoCD) to enable reliable, repeatable, and zero-downtime application deployments.
- Architect and maintain high-availability, disaster recovery, and cross-region deployment solutions for mission-critical services.
- Establish and manage platform-wide monitoring, observability, and alerting systems (Prometheus, Grafana, Open Telemetry) to proactively ensure reliability and performance.
- Enforce security, compliance, and cost-optimization practices (HIPAA, SOC 2, ISO 27001, Fin Ops) while reducing operational toil through automation and continuous improvement.
- Bachelor’s degree in Computer Science (or related field)
- 4+ years of hands-on experience operating production-grade cloud and platform infrastructure
- Demonstrated expertise in Kubernetes, cloud platforms, and Infrastructure-as-Code, including CI/CD, Git Ops, and automated environment management.
- Strong background in monitoring, observability, and reliability engineering, with experience supporting highly available, distributed systems.
- Proven ability to diagnose, troubleshoot, and resolve complex platform and infrastructure issues, including participation in on-call rotations and incident response.
- Proficiency in at least one scripting or programming language (Python, Go, or Bash) for automation, tooling, and operational support.
- Location – Irvine, CA
- On-Site – 5 days a week – Monday – Friday
- 3-6 month contract role
- Hourly Rate - $50-$60 an hour
- Must be a US Citizen or Permanent Resident – due to the security of this role, no other work authorizations can be used.
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×