Site Reliability Engineer
Listed on 2026-05-19
-
IT/Tech
Systems Engineer, SRE/Site Reliability
Career Opportunities with Peerless Technologies
A great place to work.
Current job opportunities are posted here as they become available.
The Senior Sight Reliability Engineer role owns the reliability of systems they don't write. They bring the experience to spend most of their time preventing fires rather than fighting them. They remain in the on‑call rotation and are effective incident responders, but their primary contribution is the foundational work that raises the reliability floor for everyone. Working primarily with:
Kubernetes infrastructure, Git Ops patterns, progressive delivery, rollback automation, capacity planning, and disaster recovery (DR) strategy. Our Senior SREs are less focused on keeping platforms running and more focused on resilience and reliability outcomes – building the internal capability that lets SREs and dev teams ship with confidence. Their success is measured by the same standard, at a higher bar: most of their time is in planned, high‑leverage work.
Looking for a US citizen who can be in person in Dayton Ohio, Huntsville AL, or St. Louis MO.
- 3-7+ years of experience in Operations, Sys Admin, Dev Ops, or Software engineering
- Bachelor’s Degree in CS, Computer Engineering, or related technical field
- US Citizenship & must have or be able to obtain a Top Secret Clearance
- Systems thinking – understanding how systems fail together, blast radius, and more
- Observability Fundamentals – not just the 3 signals, but knowing why and how to use telemetry to optimize services and engineering quality of life
- Software engineering – building automation & non‑trivial APIs, git workflows, leading code reviews, Linux/networking fundamentals
- Strong Communication, Collaboration, and Organizational Skills
- Specialty
Skills:
(2 or more) - Platform & Infrastructure – Kubernetes, ArgoCD/Git Ops, DR, capacity planning
- Observability – OTel standards, Grafana/Perses, Tempo, Clickhouse, Victoria Metrics
- Automation & Toil Reduction – scripting, CI/CD, runbook automation, “Dev Ops”
- Data & Alerting – dashboard quality, alert design, anomaly detection
- SRE Certifications from The Dev Ops Institute, AWS Solution Architect, or similar
- Hands‑on experience with:
Python, Go, Kubernetes, Argo CD, Git Lab/Git Hub, Jenkins, Docker, Locust/Gatling, Prometheus, Grafana/Perses
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).