More jobs:
Software Development Engineer, DevOps; US Federal
Job in
McLean, Fairfax County, Virginia, USA
Listed on 2025-11-19
Listing for:
Workday, Inc.
Full Time
position Listed on 2025-11-19
Job specializations:
-
IT/Tech
Cloud Computing, SRE/Site Reliability
Job Description & How to Apply Below
**** Your work days are brighter here.
**** At Workday, we value our candidates’ privacy and data security. Workday will never ask candidates to apply to jobs through websites that are not Workday Careers.
Please be aware of sites that may ask for you to input your data in connection with a job posting that appears to be from Workday but is not. In addition, Workday will never ask candidates to pay a recruiting fee, or pay for consulting or coaching services, in order to apply for a job at Workday.
** About the Team
** We’re the team that deploys, operates and supports our cloud native technology platform that was designed from scratch for the cloud. We lead the reliability for the complete stack and tools that delivers and supports Workday products across public clouds (e.g. AWS and GCP).
The platform is built using Cloud Native technologies (CNCF), on a foundation of Kubernetes in Public Cloud environments. This provides a secure platform on which Workday service teams, and Platform development teams can build and test their pre-release code, through deployment to production on a continuous basis.
** About the Role
**** This role will support one or more direct or indirect contracts with the U.S. Federal Government which, due to federal government security requirements, mandates that all Workday personnel working on the contracts be United States citizens (naturalized or native).
** The primary function of the Dev Ops/SRE team is to ensure the reliability and availability of the platform to meet the desired SLAs, reduce operational load and to scale sustainably in alignment with business growth.
· Be a key member of team of dedicated Dev Ops/SREs responsible for software engineering and operations, with an emphasis on reducing operational toil.
· Automation and improvements are planned by following scrum practices with two week sprints.
· The scrum team is autonomous - on-call function is follow-the-sun
· Tech stack is Cloud Native (Kubernetes, Istio, OPA, GoLang, Ruby/Groovy, ArgoCD, Jenkins, Prometheus, Grafana etc)
· Responsible for the safe change and reliability of customer environments, with SLO gated multi-stage deployment automation. Mission is to improve platform reliability, observability and overall customer satisfaction.
· Develop and launch effective SLIs to ensure that SLOs are achieved through building an extendable Observability architecture, runbook automation, and establishing new processes.
· Partner with platform service teams to craft and implement a range of SRE standards for their respective services to meet. Define benchmarks and automation to qualify services to move to production environments.
** About You
** Your passion for identifying and solving problems on distributed environments scaling across configuration, Linux Operating System and network. You have hands-on experience handling distributed environments (Kubernetes experience or Certified Kubernetes administrator certification is a big plus). You have a keen interest in improving operational efficiency, and believe that automation is the key to operating large-scale systems. You are driven to ensure customer success.
*
* Basic Qualifications:
*** BS/MS in Computer Science or related field or equivalent degree
* 4+ years of Dev Ops or SRE experience in a distributed systems environment.
* 4+ yrs experience with AWS, GCP, or Azure
* 4+ yrs experience with Kubernetes
* Proficiency with a programming language such as GoLang, Python, or Ruby (preferably GoLang (Go))
** Other
Qualifications:
*** 4+ years in handling and solving distributed systems in a public cloud, Passionate automator, with a track record of referenceable examples.
* Experienced with software development standard methodologies such as code management, CI/CD (GIT/Jenkins/ArgoCD) , testing
* Can work independently and with the demeanor that everything can be automated.
* Skills and enthusiasm to operate, maintain, support and sustain the platform.
* Excited by working in a fast-paced environment. Experience collaborating with multi-functional global and remote teams with a diverse set of backgrounds.
* Excellent documentation skills, experience with developing detailed…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×