×
Register Here to Apply for Jobs or Post Jobs. X

Site Reliability Engineering II

Job in Dallas, Dallas County, Texas, 75215, USA
Listing for: Innovaccer
Full Time position
Listed on 2025-10-25
Job specializations:
  • IT/Tech
    Cloud Computing, Systems Engineer, SRE/Site Reliability, IT Support
Salary/Wage Range or Industry Benchmark: 125000 - 150000 USD Yearly USD 125000.00 150000.00 YEAR
Job Description & How to Apply Below
Position: 3510- Site Reliability Engineering II

Overview

We at Innovaccer are looking for a Site Reliability Engineer-II to build secured modern healthcare cloud infrastructure and a massive data stack and aim to write everything as code.

Responsibilities
  • Take ownership of SRE pillars:
    Deployment, Reliability, Scalability, Service Availability (SLA/SLO/SLI), Performance, and Cost
  • Lead production rollouts of new releases and emergency patches using CI/CD pipelines while continuously improving deployment processes
  • Establish robust production promotion and change management processes with quality gates across Dev/QA teams
  • Roll out a complete observability stack across systems to proactively detect and resolve outages or degradations
  • Analyze production system metrics, optimize system utilization, and drive cost efficiency
  • Manage autoscaling of the platform during peak usage scenarios
  • Perform triage and RCA by leveraging observability tool chains across the platform architecture
  • Reduce escalations to higher-level teams through proactive reliability improvements
  • Participate in the 24x7 OnCall Production Support team
  • Lead monthly operational reviews with executives covering KPIs such as uptime, RCA, CAP (Corrective Action Plan), PAP (Preventive Action Plan), and security/audit reports
  • Operate and manage production and staging cloud platforms, ensuring uptime and SLA adherence
  • Collaborate with Dev, QA, Dev Ops, and Customer Success teams to drive RCA and product improvements
  • Implement security guidelines (e.g., DDoS protection, vulnerability management, patch management, security agents)
  • Manage least-privilege RBAC for production services and tool chains
  • Build and execute Disaster Recovery plans and actively participate in Incident Response
  • Work with a cool head under pressure and avoid shortcuts during production issues
  • Collaborate effectively across teams with excellent verbal and written communication skills
  • Build strong relationships and drive results without direct reporting lines
  • Take ownership, be highly organized, self-motivated, and accountable for high-quality delivery
Qualifications
  • 4-7 years in production engineering, site reliability, or related roles
  • Solid hands-on experience with at least one cloud provider (AWS, Azure, GCP) with automation focus (certifications preferred)
  • Strong expertise in Kubernetes and Linux
  • Proficiency in scripting/programming (Python required)
  • Observability is very critical for the scale of our systems and ability to find insights/behavior, detect problem/failures. Looking for leads to drive this charter spanning across logs, metrics, mesh, tracing etc.
  • Knowledge of CI/CD pipelines and tool chains (Jenkins, ArgoCD, Git Ops)
  • Familiarity with persistence stores (Postgres, Mongo

    DB), data warehousing (Snowflake, Databricks), and messaging (Kafka)
  • Exposure to monitoring/observability tools such as Elastic Search, Prometheus, Jaeger, New Relic, etc
  • Proven experience in production reliability, scalability, and performance systems
  • Experience in 24x7 production environments with process focus
  • Familiarity with ticketing and incident management systems
  • Security-first mindset with knowledge of vulnerability management and compliance
  • Advantageous: hands-on experience with Kafka, Postgres, and Snowflake
  • Excellent judgment, analytical thinking, and problem-solving skills
  • Ability to quickly identify and drive optimal solutions within constraints
  • Lead least privilege based RBAC for various production services and tool chains
  • Able to perform with cool head under pressure situations without taking any shortcuts
  • Collaboration with solid verbal and oral communication skills are very critical to this role. Strong cross-functional collaboration skills, relationship building skills, and ability to achieve results without direct reporting relationships
  • Ability to quickly identify and drive to the optimal solution when presented with a series of constraints
  • Excellent judgment, analytical thinking, and problem-solving skills
  • Self-motivated individual that possesses excellent time management and organizational skills
  • Strong sense of personal responsibility and accountability for delivering high quality work.
Benefits
  • Generous Paid Time Off: 22 days per year plus company holidays
  • Best-in-Class Parental Leave
  • Recognition & Rewards: monetary incentives and company-wide recognition
  • Comprehensive Insurance Coverage: medical, dental, vision; 100% company-paid disability and basic life insurance

Innovaccer Inc. is an equal opportunity employer. We celebrate diversity and are committed to fostering an inclusive workplace where all employees feel valued and empowered regardless of protected characteristics. Innovaccer Inc. participates in the E-Verify program to confirm employment eligibility of all newly hired employees based out of the U.S. and employed by Innovaccer Inc.

Disclaimer:
Innovaccer does not charge fees or require payment from individuals or agencies for securing employment with us. We do not guarantee job spots or engage in any financial transactions related to employment. If you encounter any…

To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary