Site Reliability Engineer, US Gov
Listed on 2025-12-13
-
IT/Tech
Systems Engineer, Cloud Computing, Cybersecurity, IT Support
What You’ll Be Doing
Architect, automate, test, deploy, and maintain a well-designed, highly available cloud infrastructure in AWS Gov Cloud and AWS C2E with a strong focus on security, compliance, and operational excellence. You will ensure adherence to SOC2, NIST 800-171, and FedRAMP Moderate controls while supporting cleared environments and workflows.
You will also manage the deployment and ongoing maintenance of Quindar deployments at government sites. As the team grows, you will define best practices for availability, latency, and performance across all services.
A key mission focus will be driving Quindar’s readiness and deployment onto AWS C2E, including control implementation, continuous container hardening, deployment configuration, and networking requirements.
Some of your responsibilities will include incident management, supporting a 24/7 on-call rotation, and technical troubleshooting of complex enterprise and mission-critical systems. Ideally, the systems and automation you create significantly reduce the need for manual intervention.
You will work closely with engineering teams (frontend, backend, and flight/mission operations engineers) to ensure the Quindar system is operational to all required performance metrics.
Technical Skills- Deep experience with Kubernetes, containerized workloads, and serverless architectures
- Expertise managing Kubernetes clusters with AWS EKS or Rancher as well as with integrating AWS cloud-specific services
- Hands-on experience supporting Gov Cloud, IL-enclave deployments, or C2E environments
- Experience managing and using observability stacks such as Grafana LGTM, Datadog, etc.
- Proficiency in Python and Terraform (or similar IaC tooling)
- Strong understanding of networking: VPNs, NLB/ALBs, HTTPS, TLS, VPC peering, and CDN integration
- Strong knowledge of designing, analyzing, and troubleshooting API services, distributed No
SQL/relational databases, caching systems, event-driven architectures, and multi-tier systems - Strong background in task automation and CI/CD pipeline development (Git Lab Workflows preferred but not required)
- Understanding of Unix/Linux operating systems
- Knowledge of cloud security best practices, enclave boundary protections, enclave-to-enclave interconnects, and cost-efficient architectures
- Experience with identity and access management (Auth0, Keycloak, AWS IAM, or ICAM patterns)
- Strong git fundamentals and preferred experience managing software product deployment across multiple classification levels.
- Bachelor’s degree in Computer Science or related field
- 3+ years of professional experience as an SRE, Dev Ops, reliability, infrastructure, or platform engineer
- Active U.S. Security Clearance (Secret or higher required; TS/SCI preferred)
- U.S. Citizenship required
- Experience working toward ATO/authorization in federal, DoD, or IC environments preferred
- Experience supporting deployments in Gov Cloud, C2S/C2E, or IL-enclave environments highly desirable
- To conform to U.S. Government export regulations, applicant must be a (i) U.S. citizen or national, (ii) U.S. lawful, permanent resident (aka green card holder), (iii) Refugee under 8 U.S.C. § 1157, or (iv) Asylee under 8 U.S.C. § 1158, or be eligible to obtain the required authorizations from the U.S. Department of State. Learn more about the ITAR here.
- We work in a cutting edge industry and you will get the opportunity to be part of a small team with a large direct impact on the success of our customers’ space missions!
- We take work life balance very seriously. We require employees to take 15 days off but provide unlimited PTO and follow most US federal government holidays.
- Mental health is just as important as physical so we provide quarterly health & wellness benefits.
- Comprehensive health insurance for you and your family with 100% coverage for employees.
- We encourage employees to save for retirement and provide 4% 401(k) matching.
- Each quarter we have a 4-day company offsite. Previous locations include San Francisco, Nashville, Denver, Santa Fe, New Orleans, San Diego, Bozeman, and New York City.
- Our culture and company is evolving. You will be key in creating the next major or minor version!
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).