×
Register Here to Apply for Jobs or Post Jobs. X

Systems Reliability Engineer

Job in Mission, Hidalgo County, Texas, 78512, USA
Listing for: El Camino Health
Full Time position
Listed on 2025-12-01
Job specializations:
  • IT/Tech
    Systems Engineer, Cloud Computing, Cybersecurity, IT Support
Job Description & How to Apply Below
Staff Systems Reliability Engineer page is loaded## Staff Systems Reliability Engineer remote type:
Fully Remote locations:
Remote - UStime type:
Full time posted on:
Posted Yesterday job requisition :
JR645
** Career-defining. Life-changing.
** At iRhythm, you’ll have the opportunity to grow your skills and your career while impacting the lives of people around the world. iRhythm is shaping a future where everyone, everywhere can access the best possible cardiac health solutions. Every day, we collaborate, create, and constantly reimagine what’s possible. We think big and move fast, driven by our commitment to put patients first and improve lives.

We need builders like you. Curious and innovative problem solvers looking for the chance to meaningfully shape the future of cardiac health, our company, and your career
*
* About This Role:

** We are seeking a highly experienced and strategic Staff System Reliability Engineer V to lead the design, scalability, and resilience of our cloud infrastructure. This role is ideal for someone with deep expertise in AWS, infrastructure automation, and observability who thrives in complex, high-availability environments. As a senior technical leader, you’ll work closely with engineering and security teams to optimize performance, improve deployment pipelines, and uphold service reliability across mission-critical systems.
** What You Will Be Doing
*** Design and implement scalable, fault-tolerant AWS-based infrastructure using Terraform and/or Cloud Formation for regulated workloads (e.g., HIPAA, FDA CFR Part 11, EU MDR).
* Develop and maintain CI/CD pipelines using tools like Git Lab CI, ArgoCD, or similar.
* Write automation tools and scripts in Python and/or Go to support operations, monitoring, and self-healing systems.
* Lead incident response efforts, root cause analysis, and postmortem documentation for system failures.
* Git Lab pipeline authoring
* Kubernetes (EKS) cluster management support.
* Ability to migrate applications from ELB/ALB EC2 instances to k8s using Helm for configuration management.
* Define and monitor SLOs, SLAs, and error budgets across key services.
* Implement and manage observability tools (e.g., Prometheus, Grafana, Cloud Watch, Open Telemetry).
* Collaborate with software engineers to ensure systems are designed for reliability and security from the ground up.
* Harden system security by implementing least privilege IAM, automated patching, and vulnerability management.
* Evaluate and onboard new technologies to improve infrastructure efficiency and resilience.
* Mentor junior SREs and promote best practices in reliability engineering across the organization.
** What We Need to See
*** Requires a minimum of 12 years of related experience with a Bachelor’s degree; or 8 years and a Master’s degree; or a PhD with 5 years’ experience; or equivalent experience.
** Ways To Stand Out
*** Expert-level knowledge of AWS services (EC2, Lambda, VPC, IAM, RDS, ECS/EKS, etc.).
* Helm, Argo CD
* Git Lab: ability to abstract complexity to templated pipeline archetypes for similar development projects.
* Strong proficiency in Python and/or Go for automation and tooling.
* Deep understanding of infrastructure-as-code and Git Ops workflows.
* Experience managing observability and alerting systems at scale.
* Strong grasp of Linux systems, networking, and distributed architecture principles. Familiarity with regulatory requirements such as FDA 21 CFR Part 11, HIPAA, ISO 13485, and EU MDR as they relate to infrastructure and Dev Ops.
* Strong written and verbal communication skills, including documentation and incident reporting.
** Work Environment / Other Requirements:
*** Occasional travel to office if in Bay area.
*
* What’s In It for You:

*** Competitive compensation including base salary, annual performance bonus, and stock/equity opportunities.
* Outstanding benefits package with comprehensive medical, dental, vision, and wellness programs.
* Generous paid time off including vacation, holidays, and sick leave — because work/life balance matters.
* Flexible work options including hybrid and remote arrangements, depending on your location.
* 401(k) with company match and financial…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary