DevOps/SRE Manager; USAF Cloud One
Listed on 2026-02-16
-
IT/Tech
Cloud Computing, Systems Engineer
Leidos was awarded the U.S. Air Force Cloud One Architecture and Common Shared Services contract and currently has an opening for a Dev Ops/SRE Manager supporting AWS, Azure, Google, and Oracle clouds. This is an exciting opportunity to use your experience to modernize a leading, global-scale multi-cloud environment in support of a critical mission, supporting USAF system resiliency, security, and cost effectiveness.
Location: These positions will be hybrid remote. Candidates will be required to work onsite as needed. Preferred candidates will be located near Hanscom AFB (Boston, MA) or work in Huntsville, AL.
OverviewAs the Dev Ops/SRE manager, you will develop in scalable cloud-native solutions, ensure best practices across architecture, development, deployment, and security, and lead a group of Dev Ops/SRE engineers. This role is essential to ensuring secure, scalable, and resilient connectivity across hybrid and multi-cloud environments. You’ll work closely with cloud engineers, cybersecurity analysts, and program leadership to drive continuous improvement and deliver value to the mission.
ResponsibilitiesLead a group of 5-15 Dev Ops and SRE engineers to fulfill the requirements for the program
Provide leadership using software engineering principles to build and maintain scalable, highly reliable, and performant large-scale systems
Design, implement, and maintain CI/CD pipelines for secure, automated software delivery
Design and implement highly available, fault-tolerant systems for AWS, Azure, GCP, OCI
Define and monitor SLIs, SLOs, and SLAs to ensure service reliability and performance
Implement robust monitoring, logging, and alerting using tools such as Prometheus, Grafana, Azure Monitor, and Cloud Watch
Lead incident response and postmortem processes to drive continuous improvement
Collaborate with development teams to embed reliability into application design and deployment
Lead capacity planning including forecasting resource needs to ensure systems can scale effectively
Ensure compliance with security best practices, including IAM, VPC design, and encryption standards
Implement Dev Sec Ops pipelines for a variety of technical stacks on AWS, Azure, GCP, and OCI
Develop infrastructure as code (IaC) using tools such as Terraform, Ansible, or Cloud Formation
Deploy and manage applications on cloud platforms such as AWS, Azure, Google Cloud or OCI
Configure and optimize container orchestration platforms (e.g., Kubernetes, Docker)
Maintain high availability, scalability, and performance of cloud-based systems
Configure and maintain virtualized environments to ensure performance, scalability, and security
Support infrastructure modernization efforts by integrating virtualization solutions into hybrid cloud environments
Implement automated security tools for vulnerability scanning, SAST/DAST, and container security
Drive consistency for deployment and build processes
Establish proactive monitoring solutions to ensure system reliability and availability
Respond to and troubleshoot production incidents, performing root cause analysis and resolution
Embed security best practices into the SDLC and CI/CD processes
Develop strategy and integration methodology around design, development and implementation of cloud-based solutions
Enable quick development and release of changes and bug-fixes on an as-required basis and incorporate feedback from developers/users
Prepare detailed technical documentation to support development and operational processes
Partner with business stakeholders to understand requirements and translate them into technical solutions
Present architectural designs and recommendations to executive leadership
Mentor, guide and supervise teams for related activities
Lead reviews and provide guidance on complex technical decisions
Prepare detailed technical documentation to support development and operational processes
Collaborate with team members and provide mentorship to junior staff, fostering a learning environment
Act as the Dev Ops/SRE manager to assess employee performance, hire new employees, and ensure compliance with corporate training requirements
Bachelors…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).