Site Reliability Engineer - Data Infrastructure
Listed on 2026-06-05
-
IT/Tech
Cloud Computing, SRE/Site Reliability
Site Reliability Engineer - Data Infrastructure
Location:
Seattle
Employment Type:
Regular
Job Code: A32205
Responsibilities- Incident response and triage:
Serve as a first responder for production alerts and incidents, execute established runbooks to mitigate issues, and accelerate escalation when necessary. - Operational excellence and change management:
Perform routine operational tasks such as deployments, configuration changes, and system maintenance following established change‑control processes to minimize production risk. - Iterative automation and AI augmentation:
Identify and automate repetitive tasks using scripting (Python, Go, Bash) and AI agents to reduce toil, enhance consistency, and boost overall productivity. - Observability and monitoring:
Refine monitoring dashboards, tune alert thresholds, and ensure sufficient instrumentation for rapid problem detection and diagnosis. - Data Center and AI Infrastructure:
Support daily operations, construction, and maintenance of data‑center environments and AI infrastructure for large‑scale data processing.
- Minimum Qualifications:
Bachelor’s degree in Computer Science, a related technical field, or equivalent practical experience. - 2+ years of experience in an SRE, Dev Ops, Systems Administration, or similar role.
- Experience with at least one scripting language (Python, Bash, Go).
- Solid understanding of Linux operating systems and networking concepts.
- Preferred Qualifications:
Familiarity with container technologies such as Docker and Kubernetes. - Hands‑on experience with a common data store (MySQL, Redis, Postgre
SQL). - Experience with monitoring and observability tools (Prometheus, Grafana, ELK Stack).
- Strong desire to learn, proactive problem‑solving attitude, and excellent communication skills.
- Experience in the operation and construction of Data Centers is a big plus.
The base salary range for this position in this city is $148,200 – $300,960 annually.
Compensation may vary outside of this range depending on a number of factors, including a candidate’s qualifications, skills, competencies, and experience, and location.
Benefits:
Employees have day‑one access to medical, dental, and vision insurance, a 401(k) savings plan with company match, paid parental leave, short‑term and long‑term disability coverage, life insurance, wellbeing benefits, and more. Employees also receive 10 paid holidays per year, 10 paid sick days per year, and 17 days of Paid Personal Time (prorated upon hire with increasing accruals by tenure).
The Company reserves the right to modify or change these benefits programs at any time, with or without notice.
Legal Compliance (EEO)For Los Angeles County (unincorporated) candidates: qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state, and local laws, including the Los Angeles County Fair Chance Ordinance for Employers and the California Fair Chance Act. Our company believes that criminal history may have a direct, adverse, and negative relationship on the following job duties, potentially resulting in the withdrawal of the conditional offer of employment:
- Interacting and occasionally having unsupervised contact with internal/external clients and/or colleagues;
- Appropriately handling and managing confidential information including proprietary and trade secret information and access to information technology systems;
- Exercising sound judgment.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).