Sr. Site Reliability Engineer; SRE
Listed on 2026-05-26
-
IT/Tech
Cloud Computing, SRE/Site Reliability
Scientific Games:
Scientific Games is the global leader in lottery games, sports betting and technology, and the partner of choice for government lotteries. From cutting‑edge backend systems to exciting entertainment experiences and trailblazing retail and digital solutions, we elevate play every day. We push game designs to the next level and are pioneers in data analytics and iLottery. Built on a foundation of trusted partnerships, Scientific Games combines relentless innovation, legendary performance, and unwavering security to responsibly propel the global lottery industry ever forward.
PositionSummary
We are looking for a skilled Site Reliability Engineer (SRE) to enhance the stability, performance, and reliability of our production systems. The SRE will work closely with development, Dev Ops, and security teams, ensuring production readiness, managing on‑call responsibilities, and improving observability across applications and infrastructure.
- Monitoring & Observability
- Maintain and enhance observability using New Relic, Graylog, or other monitoring tools.
- Establish actionable alerting and dashboards for service health and performance metrics.
- Reliability Engineering
- Implement and maintain reliable systems, focusing on capacity planning, performance optimization, and fault tolerance to ensure high availability and scalability.
- Collaborate with teams to define and implement Service Level Indicators (SLIs), Service Level Objectives (SLOs), and Service Level Agreements (SLAs), and monitor their performance.
- Automation & Infrastructure Operations
- Automate operational processes, reducing manual interventions.
- Manage Kubernetes workloads on AWS EKS, ensuring secure and stable deployments.
- Work with Hashi Corp Vault for secrets management and security compliance.
- Incident & Problem Management
- Participate in on‑call rotation to handle production incidents and ensure rapid resolution.
- Troubleshoot production issues, identify root causes, and implement permanent fixes.
- Lead post‑incident reviews, create action items, and follow through on remediation.
- Collaboration
- Work closely with Dev Ops to improve CI/CD pipelines for production readiness.
- Partner with development teams to embed resilience and observability into applications.
- Documentation & Knowledge Sharing
- Document operational runbooks, escalation procedures, and production playbooks.
Required Skills
- Bachelor’s degree in computer science or related field, or equivalent work experience.
- Experience:
6+ years as an SRE, Dev Ops Engineer, or similar role. - Cloud:
Strong experience with AWS (EKS, EC2, S3, Route
53, IAM). - Kubernetes: 6+ years managing production Kubernetes workloads.
- Monitoring & Observability:
Hands‑on with New Relic, Graylog, or similar. - Secrets Management:
Experience with Hashi Corp Vault or equivalent. - Automation & CI/CD:
Proficiency with Git Hub Actions, Git Lab CI/CD, Helm and ArgoCD. - IaC:
Hands‑on experience with Terraform. - Scripting:
Proficiency in Python, Bash, or equivalent scripting languages. - Incident Management:
Strong debugging, troubleshooting, and root‑cause analysis skills. - On‑Call Readiness:
Willingness to participate in 24x7 on‑call rotation.
Desired Skills
- AWS certification.
- Familiarity with .NET application stack.
- Multi‑cloud exposure.
- Experience managing Kubernetes clusters with Rancher in on‑prem environments.
- Familiarity with Packer for building Golden AMIs.
The physical demands described here are representative of those that must be met by an employee to successfully perform the essential functions of this job. Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions. While performing the duties of this job, the employee is regularly required to sit, stand, walk, bend, use hands, operate a computer, and have specific vision abilities to include close and distance vision, and ability to adjust focus working with computer and business equipment.
WorkConditions
Scientific Games Corporation and its affiliates (collectively, SG) are engaged in highly regulated gaming and lottery businesses. As a result, certain SG employees may, among other things, be…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).