Senior Site Reliability Engineer
Listed on 2026-05-30
-
IT/Tech
Cloud Computing, SRE/Site Reliability, Systems Engineer
About the Team
We are looking for a highly motivated Site Reliability Engineer to join our growing Infrastructure and Platform Engineering team. You will play a critical role in operating, monitoring, automating, maintaining, and providing metrics and observability for our Kubernetes-based platform and enabling our engineering organization to deliver capabilities and incremental products quickly and reliably. You and the team will have the opportunity to work with cutting-edge technologies, solve complicated problems, and contribute to the foundation of our infrastructure.
This role requires a strong technical background, a passion for automation, and a collaborative mindset.
This role involves collaborating with teams across multiple locations and time zones. You will need to be adept at communicating efficiently and building positive relationships with colleagues in diverse locations.
About the RoleMay be required to be on site at client locations in the DC, MD, and VA (DMV) area.
You have a growth mindset and will be part of a team promoting a diverse and inclusive environment where you and your workmates are happy, energized and engaged, and who are excited to come to work every day.
Responsibilities- Ensuring the Workday Kubernetes based platform is maintained, healthy, and ensures high availability for our customers through infrastructure automation, CI/CD pipelines, reporting, incident handling and response, and observability tools.
- Maintain the overall platform: maintain core platform components, ensuring high availability, scalability, and security.
- Automate and optimize:
Automate infrastructure provisioning, configuration management, and application deployments using tools like Terraform and Argo CD. - Troubleshooting and support:
Provide support and solve for platform-related issues, working closely with development teams to resolve problems. - Security and compliance:
Implement and maintain security standard methodologies for the platform, ensuring compliance with industry standards. - Documentation and knowledge sharing:
Build and maintain comprehensive documentation for platform components and processes. Actively participate in knowledge sharing within the team. - Collaborate effectively with other engineers and development teams across multiple locations and time zones.
- Stay up-to-date with the latest technologies and trends in the platform engineering space.
This role may require a security clearance at the TS/SCI w/CI Poly level. Applicants must have the ability to obtain and maintain a U.S. government issued security clearance. An active TS/SCI w/CI Poly is preferred.
Basic Qualifications- A minimum of 5 years of hands‑on experience working with large scale cloud infrastructure, automation, and overall Dev Ops methodologies
- Bachelor’s degree in a computer related field or equivalent work experience
- Infrastructure as code:
Proficiency in infrastructure automation tools like Terraform. - CI/CD:
Experience with building, maintaining, and consuming CI/CD pipelines and tools like Argo CD. - Problem‑solving:
Strong analytical and problem‑solving skills. - Communication:
Excellent communication and collaboration skills.
- Strong understanding of Kubernetes
- Amazon Web Services proficiency working in a production environment
- Proficiency in at least one programming language such as C#, Python, Ruby, Rust, or Go programming language proficiency
- Experience with security auditing and compliance frameworks.
- Experience working in air gapped cloud regions
The annualized base salary ranges for the primary location and any additional locations are listed below. Workday pay ranges vary based on work location. As a part of the total compensation package, this role may be eligible for the Workday Bonus Plan or a role‑specific commission/bonus, as well as annual refresh stock grants. Recruiters can share more detail during the hiring process.
Each candidate’s compensation offer will be based on multiple factors including, but not limited to, geography, experience, skills, job duties, and business need, among other things. For more information regarding Workday’s…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).