×
Register Here to Apply for Jobs or Post Jobs. X

Site Reliability Engineer II

Job in New York, New York County, New York, 10261, USA
Listing for: 5014 Disney Entertainment & Sports LLC
Full Time position
Listed on 2026-06-02
Job specializations:
  • IT/Tech
    SRE/Site Reliability, Cloud Computing
Salary/Wage Range or Industry Benchmark: 123000 - 165000 USD Yearly USD 123000.00 165000.00 YEAR
Job Description & How to Apply Below
Location: New York

Site Reliability Engineer II

Job Posting :

Department:
Engineering Fleet – Reliability Engineering & Operational Support to backend service development teams.

We build world‑class products that enable Disney, ESPN, Hulu, and other media brands to reach millions of people worldwide.

Job Description

The Streaming SRE squad drives improvements in performance, resiliency, and operational excellence. We partner with cross‑functional teams to provide guidance, automation, education, and best practices that elevate the reliability and scalability of services that support our products and brands. We are seeking a Site Reliability Engineer to contribute to the stability and scalability of critical systems by building automation, improving operational workflows, enhancing observability, and participating in incident response.

Responsibilities
  • Contribute to the design, implementation, and improvement of systems to enhance reliability, scalability, and performance.
  • Build and maintain automation for deployment, monitoring, alerting, and operational workflows.
  • Collaborate with software engineering teams to implement SRE best practices, including SLIs, SLOs, error budgets, and automated remediation.
  • Support CI/CD pipelines and participate in optimizing the software delivery lifecycle.
  • Develop tools, dashboards, and instrumentation to improve observability across metrics, logs, and distributed tracing.
  • Participate in incident response, root cause analysis, and corrective actions to prevent recurrence.
  • Assist in capacity planning, performance tuning, and scaling strategies for distributed systems.
  • Maintain and improve Infrastructure‑as‑Code (IaC) definitions and cloud environment configurations.
  • Contribute to documentation, runbooks, architectural diagrams, and operational standards.
  • Collaborate with cross‑functional teams to identify reliability risks and recommend improvements.
  • Participate in incident‑based escalations and rotations to support high‑availability production systems.
  • Continuously evaluate system architecture, tools, and practices to drive operational excellence and efficiency.
Basic Qualifications
  • Bachelor’s degree in computer science, engineering, or related field (or equivalent experience).
  • 3+ years of experience in Site Reliability Engineering, Dev Ops, Platform Engineering, or related discipline.
  • Hands‑on experience with cloud platforms – AWS (preferred), GCP, Azure.
  • Proficiency in Python, Go, JavaScript, Bash, or equivalent scripting languages.
  • Working knowledge of Linux or Unix‑based systems.
  • Experience with CI/CD systems (Git Hub Actions, Git Lab CI, Jenkins).
  • Familiarity with Infrastructure‑as‑Code (Terraform, Cloud Formation, etc.).
  • Experience with containerization technologies such as Docker and Kubernetes.
  • Understanding of networking fundamentals, distributed systems, and system design basics.
  • Strong analytical and troubleshooting skills, including the ability to diagnose complex system issues.
  • Ability to work both independently and collaboratively.
  • Strong communication skills and the ability to collaborate effectively with cross‑functional teams.
Preferred Qualifications
  • Hands‑on experience with observability stacks (Prometheus, Grafana, ELK/EFK, Datadog, Splunk, New Relic).
  • Exposure to Git Ops tooling (Argo CD, Flux).
  • Experience contributing to SLO/SLI frameworks and implementing error budgets.
  • Knowledge of service mesh architectures (Istio, Linkerd).
  • Familiarity with performance testing and load testing tools.
  • Experience with message queues, event‑driven systems, or distributed data platforms.
  • Cloud or Dev Ops‑related certifications (AWS Associate/Specialty, GCP Professional, Kubernetes CKA/CKS).
  • Experience working in large‑scale enterprise environments or with distributed global teams.
  • Experience using modern AI‑assisted development tools (Copilot, Cursor, or similar).
  • Understanding foundational AI/ML concepts, familiarity with cloud‑native AI services such as model hosting, or ability to use AI tools to automate cloud operations tasks.
Compensation & Benefits

The hiring range for this position in New York City is $123,000 - $165,000. The base pay actually offered will take into account internal equity and may vary depending on the candidate’s geographic region, job‑related knowledge, skills, and experience among other factors. A bonus and/or long‑term incentive units may be provided as part of the compensation package, in addition to the full range of medical, financial, and other benefits, dependent on the level and position offered.

Location

& Employment

New York, NY, USA – Full Time. Job Posting Primary Business: PE – Sports, News & Entertainment, Enablement – Infrastructure Engineering. Primary City, State, Region, Postal Code:
New York, NY, USA.

#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary