Site Reliability Engineer
Listed on 2026-01-12
-
IT/Tech
Cybersecurity, Systems Engineer
Get AI-powered advice on this job and more exclusive features.
We’re seeking a Site Reliability Engineer to support secure, mission‑critical systems at Fort Meade. You’ll ensure uptime and performance of AI‑powered cyber applications in closed AWS environments, working closely with remote engineering teams and on‑site stakeholders.
You’ll be responsible for ensuring the performance, reliability, and operational readiness of advanced AI‑powered applications running in secure, closed‑cloud AWS environments. You’ll support a powerful microservices‑based platform used for cyber operations, working closely with engineering teams and government stakeholders to keep systems running smoothly and securely, even in the high‑pressure context of real‑time cyber warfare.
Key Responsibilities- Maintain and troubleshoot Docker‑based microservices in secure AWS enclaves
- Build monitoring, logging, and alerting systems
- Automate deployments using IaC tools (Terraform, Cloud Formation)
- Lead incident response and root cause analysis
- Support real‑time cyber ops systems in high‑security environments
- 5+ years in SRE, Dev Ops, or cloud operations
- Strong AWS (EC2, ECS, VPC), Docker, and Linux experience
- Proficient in scripting (Python, Bash, or Go)
- Familiar with secure/air‑gapped environments and compliance
- TS/SCI clearance and ability to work on‑site full‑time
- Experience with Neo4j, Graph
QL, NATS, or AI/ML systems - AWS/Kubernetes certifications
- Background in defense or cyber operations
Posted By: Patrick Fuller
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).