×
Register Here to Apply for Jobs or Post Jobs. X

Senior Site Reliability Engineer

Job in Dallas, Dallas County, Texas, 75215, USA
Listing for: Las Vegas Sands Corp.
Full Time position
Listed on 2026-06-23
Job specializations:
  • IT/Tech
    SRE/Site Reliability, Cloud Computing: Infrastructure & Operations, Azure, IT Support
Salary/Wage Range or Industry Benchmark: 125000 - 150000 USD Yearly USD 125000.00 150000.00 YEAR
Job Description & How to Apply Below

Position Overview:

The primary responsibility of the Senior Site Reliability Engineer (SRE) is to lead reliability engineering initiatives across our Azure estate and Command Center operations. This role focuses on scripting, automation, and observability to ensure uptime, performance, and rapid incident response.

All duties are performed in accordance with departmental and company policies, practices, and procedures. All team members are expected to conduct themselves in a professional manner at all times and observe the company’s standards, work requirements and rules of conduct.

Responsibilities
  • Architect end‑to‑end monitoring using Azure Monitor, Log Analytics, Application Insights, and ITRS Geneos.
  • Implement monitoring‑as‑code with Terraform/Bicep, including alerts, dashboards, and diagnostic settings.
  • Create actionable dashboards (Azure Workbooks, Grafana) for SLIs/SLOs and real‑time service health.
  • Design alert taxonomies with severity mapping (P0–P4), dynamic thresholds, and escalation policies.
  • Reduce alert noise and ensure 100% alert‑to‑runbook mapping.
  • Support Major Incident Command (MIC) during P0/P1 bridges with technical expertise and rapid remediation.
  • Build automation using Power Shell, Python, and Azure Functions for alert lifecycle, runbooks, and self‑healing workflows.
  • Integrate with ITSM (Service Now/Jira) for automated ticket enrichment and routing.
  • Eliminate repetitive operational tasks and reduce toil through automation‑first practices.
  • Define and enforce SLIs/SLOs, error budgets, and resilience patterns (bulkheads, retries, timeouts).
  • Conduct production readiness reviews, chaos drills, and failover rehearsals.
  • Partner with application teams to embed instrumentation and structured logging.
  • Enforce desired state with Azure Policy, DSC/Guest Configuration, and drift detection.
  • Harden networking (VNet, NSGs, Private Link, Firewall), identity (Entra ), and secrets (Key Vault).
  • Ensure auditability and compliance across environments.
  • Be on site within the IT Command Center.
Minimum Qualifications
  • At least 21 years of age.
  • Proof of authorization to work in the United States.
  • Bachelor’s degree in Computer Science or IT field, or equivalent experience.
  • Must be able to obtain and maintain any required certification or license.
  • 7+ years of experience in SRE/Dev Ops/Platform roles, with 4+ years focused on Azure at scale.
  • Expert knowledge in infrastructure as code (Terraform or Bicep) and Git‑based workflows (Git Hub Actions/Azure Dev Ops).
  • Proficiency in CI/CD, deployment strategies (canary, blue‑green), and automated rollbacks.
  • Proficiency in Power Shell and Python for automation; experience building reusable modules.
  • Demonstrated experience with AKS, App Services, Functions, VM Scale Sets, and Azure networking/security.
  • Deep knowledge of Azure services: AKS, App Services, Functions, VMSS, Storage, Front Door, API Management, Load Balancers, Monitor, Log Analytics, App Insights, Key Vault, Policy, Defender Automation.
  • Proficiency in IaC tools:
    Terraform/Bicep, Power Shell, Python, Git Hub Actions/Azure Dev Ops.
  • Observability:
    Azure Monitor, Log Analytics, App Insights, Prometheus/Open Telemetry, ITRS Geneos.
  • Service Management:
    Service Now, Jira.
  • Proficiency in SRE fundamentals: SLIs/SLOs, error budgets, capacity planning, chaos testing, toil reduction.
  • Demonstrated experience leading incidents and collaborating across teams.
  • Strong interpersonal skills with the ability to communicate effectively across levels.
  • Must be available to work varied shifts including nights, weekends, and holidays to ensure 24/7 coverage; provide off‑hours support as needed during critical incidents.
  • Team members must be on site within the IT Command Center.
Preferred Qualifications
  • AZ‑400:
    Azure Dev Ops Engineer Expert
  • AZ‑305:
    Azure Solutions Architect Expert or AZ‑104:
    Azure Administrator
  • AZ‑500:
    Azure Security Engineer Associate
  • ITIL v4 for operational rigor
  • SRE Foundation/Practitioner Certification (Dev Ops Institute or equivalent)
Physical Requirements
  • Must be able to lift or carry 50 pounds unassisted during assigned tasks.
  • Physically access assigned workspace areas with or without reasonable accommodation.
  • Work indoors and may be exposed to environmental factors such as noise and dust.
  • Utilize laptop and standard keyboard to perform essential functions.
#J-18808-Ljbffr
Position Requirements
10+ Years work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary