AWS GovCloud SRE
Listed on 2026-04-12
-
IT/Tech
Systems Engineer, Cloud Computing
Space Ground System Solutions (SGSS), a Parsons Corporation company, has an immediate full‑time opening for a Site Reliability Engineer (SRE) on its IT Support team in Alexandria, Virginia. In this role, you will support the Naval Research Laboratory in advancing the National Security Space mission by contributing to the expansion of cloud‑based infrastructure. The SRE will play a key role in evolving satellite ground system software to operate within hybrid and private cloud environments, enabling greater scalability, resilience, and mission readiness.
This position will also be responsible for managing, supporting, and facilitating infrastructure operations for development teams building systems that integrate government‑off‑the‑shelf (GOTS), commercial, and open‑source software. This SRE position requires close collaboration with engineers to ensure reliable, secure, and efficient system performance across mission‑critical environments.
- Identify appropriate cloud based (i.e., AWS Gov Cloud) infrastructure to meet mission requirements, including the specification, acquisition, configuration, dynamic provisioning and maintenance of servers
- Design, implement, manage, and automate application and infrastructure security tools (containerized scanning and VM tools) along with integrations to CI/CD pipelines, automated workflows, script‑based integrations, etc.
- Specify and configure physical and virtual machines with Red Hat Enterprise Linux with a heavy focus on stable and supported operating systems. This includes proactive and consistent maintenance to ensure systems platforms (i.e., Open Stack, Kubernetes, etc.) are up and available 98% of time
- Perform current state analysis of an organization’s system security controls and measures against DISA STIG standards, and provide recommendations for enhancement
- Implement configuration management automation (e.g., Ansible) to maintain configuration
- Assist the development team with requirements verification. The SRE’s holistic view should include, but is not limited to, capacity planning, system‑platform performance metrics, change management, high quality of services, promoting first‑class automation and reasonable cost implementation options
- Assist the organization and technical lead(s) in identifying technical problems, perform root cause analysis and corrective actions follow‑up, develop managerial summaries and technical steps for implementing software updates, fixes, and/or replacements
- Conduct post‑incident reviews. Identify what’s working and what’s not. Develop revised response plans that improve the software development life cycle, revise documentation, implement engineering processes that positively impact IT service delivery and build customer confidence after system maintenance & provisioning
- Fix support escalation issues; serve on the tier 3 support team for integrated product support and proactive response to complex support challenges
- Develop and maintain Infrastructure‑as‑Code (IaC) with security embedded using such technologies as Terraform
- Document, train, and operate a software assurance capability at multiple security levels
- Document tribal knowledge and integrate into practical use – documentation, automation, monitoring and remediation. Support feedback from practical experience to software development, support, IT operations and on‑call process improvement
- Develop workflow in Python or similar scripting languages as needed
- Build software for support team; build and implement services to improve the quality of support team delivery; improve monitoring and alerting internally and at customer sites in integration, test, and ops
Skills and Qualifications
- Must be a US citizen
- Must meet eligibility requirements to obtain a TS/SCI clearance within 18 months of hire
- B.S. in Computer Science/Engineering or other relevant field from an accredited university or equivalent combination of formal education
- Minimum 3–5+ years in SRE, Dev Ops, or systems engineering roles
- Infrastructure capabilities: operating systems, networking, identity management and access control
- 3+ years of experience with designing solutions in…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).