×
Register Here to Apply for Jobs or Post Jobs. X

Senior Site Reliability Engineer

Job in Dallas, Dallas County, Texas, 75215, USA
Listing for: Sands Corp
Full Time position
Listed on 2026-02-15
Job specializations:
  • IT/Tech
    Systems Engineer, IT Support, Cloud Computing, Cybersecurity
Salary/Wage Range or Industry Benchmark: 80000 - 100000 USD Yearly USD 80000.00 100000.00 YEAR
Job Description & How to Apply Below
Job Description:

** Position Overview
** The primary responsibility of the Senior Site Reliability Engineer (SRE) to lead reliability engineering initiatives across our Azure estate and Command Center operations. This role focuses on scripting, automation, and observability to ensure uptime, performance, and rapid incident response. The Senior SRE will design and implement monitoring-as-code, optimize alerting, and build self-healing automation that reduces toil and accelerates recovery.

As part of our journey from traditional operations toward a mature SRE model, the Senior SRE will partner with product engineering, platform teams, and the Command Center including Service Desk and Major Incident Command (MIC) to deliver measurable improvements in service reliability.

All duties are to be performed in accordance with departmental and Las Vegas Sands Corp.’s policies, practices, and procedures. All Las Vegas Sands Corp. Team Members are expected to conduct and carry themselves in a professional manner at all times. Team Members are required to observe the company’s standards, work requirements and rules of conduct.
** Essential

Duties & Responsibilities **
* ** Observability & Monitoring**  + Architect end-to-end monitoring using Azure Monitor, Log Analytics, Application Insights, and ITRS Geneos.  + Implement monitoring-as-code with Terraform/Bicep, including alerts, dashboards, and diagnostic settings.  + Create actionable dashboards (Azure Workbooks, Grafana) for SLIs/SLOs and real-time service health.
* ** Alerting & Incident Response**  + Design alert taxonomies with severity mapping (P0–P4), dynamic thresholds, and escalation policies.  + Reduce alert noise and ensure 100% alert-to-runbook mapping.  + Support Major Incident Command (MIC) during P0/P1 bridges with technical expertise and rapid remediation.
* ** Automation & Tooling**  + Build automation using Power Shell, Python, and Azure Functions for alert lifecycle, runbooks, and self-healing workflows.  + Integrate with ITSM (Service Now/Jira) for automated ticket enrichment and routing.  + Eliminate repetitive operational tasks and reduce toil through automation-first practices.
* ** Reliability Engineering**  + Define and enforce SLIs/SLOs, error budgets, and resilience patterns (bulkheads, retries, timeouts).  + Conduct production readiness reviews, chaos drills, and failover rehearsals.  + Partner with app teams to embed instrumentation and structured logging.
* ** Governance & Compliance**  + Enforce desired state with Azure Policy, DSC/Guest Configuration, and drift detection.  + Harden networking (VNet, NSGs, Private Link, Firewall), identity (Entra ), and secrets (Key Vault).  + Ensure auditability and compliance across environments.
* Perform job duties in a safe manner.
* Attend work as scheduled on a consistent and regular basis.
* Perform other related duties as assigned.
** Minimum Qualifications
*** At least 21 years of age.
* Proof of authorization to work in the United States.
* Bachelor’s degree in Computer Science or IT field, or equivalent experience.
* Must be able to obtain and maintain any certification or license, as required by law or policy.
* 7+ years of experience in SRE/Dev Ops/Platform roles, with 4+ years focused on Azure in production at scale.
* Expert knowledge in Infrastructure as Code (Terraform or Bicep) and Git-based workflows (Git Hub Actions/Azure Dev Ops).
* Proficiency in CI/CD, deployment strategies (canary, blue-green), and automated rollbacks.
* Proficiency in Power Shell and Python for automation; experience building reusable modules.
* Demonstrated experience with AKS, App Services, Functions, VM Scale Sets, and Azure networking/security.
* Deep knowledge of:  +
** Azure:
** AKS, App Services, Functions, VMSS, Storage, Front Door, API Management, Load Balancers, Monitor, Log Analytics, App Insights, Key Vault, Policy, Defender  +
** Automation & IaC:
** Terraform/Bicep, Power Shell, Python, Git Hub Actions/Azure Dev Ops  +
** Observability:
** Azure Monitor, Log Analytics, App Insights, Prometheus/Open Telemetry; experience with ITRS Geneos.  +
** Service Management:
** Service Now, Jira
* Proficiency in SRE…
Position Requirements
10+ Years work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary