×
Register Here to Apply for Jobs or Post Jobs. X

Lead Site Reliability Engineer

Job in Austin, Travis County, Texas, 78716, USA
Listing for: Cox Communications
Full Time position
Listed on 2025-12-31
Job specializations:
  • IT/Tech
    Systems Engineer, SRE/Site Reliability
Salary/Wage Range or Industry Benchmark: 100000 - 125000 USD Yearly USD 100000.00 125000.00 YEAR
Job Description & How to Apply Below

Company

Company Cox Automotive - USA

Job Family Group

Job Family Group Engineering / Product Development

Job Profile

Job Profile Lead Software Engineer

Management Level

Management Level Manager - Non People Leader

Flexible Work Option

Flexible Work Option Hybrid - Ability to work remotely part of the week

Travel %

Travel % No

Work Shift

Work Shift Day

Compensation

Compensation Compensation includes a base salary of $ - $. The base salary may vary within the anticipated base pay range based on factors such as the ultimate location of the position and the selected candidate’s knowledge, skills, and abilities. Position may be eligible for additional compensation that may include an incentive program.

Job Description

The Lead Site Reliability Engineer will be part of the Site Reliability Engineering (SRE) team. The SRE team drives reliability, observability, and engineering practice maturity across over 150 teams made up of over a thousand engineers in our part of Cox Automotive. We build processes, documentation, and tools that scale: deep observability to detect and diagnose issues faster, engineering maturity assessments that drive measurable improvement, reusable golden paths that accelerate delivery, and trusted advisory relationships that align reliability with business priorities.

Much of our work focuses on eliminating toil through automation and establishing self-service capabilities that multiply our impact.

If you love building monitoring systems that reveal truth, evaluating engineering practices to raise the bar organization-wide, and acting as a trusted advisor to engineers and leadership, we want to talk to you.

Responsibilities
  • Define and drive adoption of SLIs, SLOs, error budgets, and high-quality alerting standards across the organization
  • Architect end-to-end observability strategies (metrics, logs, traces, business signals) with consistent taxonomy and discoverability
  • Build centralized dashboards, reliability scorecards, and runbooks used by engineering teams and leadership
  • Establish engineering practice maturity baselines and partner with teams on measurable improvement plans
  • Create golden paths—standardized pipelines, infrastructure modules, and service templates that enable rapid, consistent delivery
  • Lead internal workshops, game days, and learning programs to spread operational excellence
  • Act as a trusted advisor to product and engineering leadership, providing-driven insights on reliability risk and trade-offs
  • Guide post-incident reviews toward systemic remediation (guardrails, automation, design changes) rather than superficial fixes
  • Design and extend self-service platforms for deployment, progressive delivery, and automated recovery
  • Reduce MTTR through better telemetry, automation, and resilience patterns
  • Mentor engineers across teams to become local reliability champions, scaling SRE impact without adding headcount
Qualifications
  • Experience programming in at least one of the following languages:
    Python, Typescript, or Java.
  • Bachelor’s degree in a related discipline and 6 years’ experience in a related field. The right candidate could also have a different combination, such as a master’s degree and 4 years’ experience; a Ph.D. and 1 year of experience; or 18 years’ experience in a related field.
  • Applicants must currently be authorized to work in the United States for any employer without current or future sponsorship. No OPT, CPT, STEM/OPT or visa sponsorship now or in future.
  • Expertise in designing, analyzing, and troubleshooting large-scale distributed systems.
  • Deep hands-on experience with modern observability tools (Cloud Watch and New Relic)
  • Proven ability to assess engineering practices and drive measurable improvements across multiple teams.
  • Experience establishing SLIs/SLOs, managing error budgets, and improving alert signal-to-noise ratios.
  • Strong background in release engineering, CI/CD, and progressive deployment strategies.
  • Deep expertise in AWS, Terraform, AWS CDK, and Git Hub/Git Hub Actions.
  • Track record reducing MTTR and improving availability through automation and architectural improvements.
  • Excellent written and verbal communication skills tailored to both engineers and executives.
  • Systema…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary