Senior Cloud Systems Engineer
Listed on 2026-06-04
-
IT/Tech
Systems Engineer, SRE/Site Reliability
Are you passionate about shaping the future of humanity's presence in space? Lunar Outpost, an industry leader in space robotics and planetary vehicles, invites you to join our team! Lunar Outpost is dedicated to creating a permanent presence in space, while also driving positive impacts here on Earth. Currently, we are seeking a Cloud Systems Engineer to contribute to our mission in a dynamic startup environment.
The main responsibilities of this role include managing Stargate deployments in production, ensuring high availability and uptime, executing reliable releases, and driving operational excellence through comprehensive monitoring, metrics, and infrastructure management. Stargate is a next-generation Command and Control (C2) platform - the ground software that enables and empowers all Lunar Outpost missions, including the Lunar Terrain Vehicle (LTV) program. As mission-agnostic software used by all operators in mission control, Stargate's reliability and uptime are critical to mission success.
Key Responsibilities- Own and manage Stargate production releases and deployment pipelines using Git Ops practices
- Drive operational excellence initiatives including metrics collection, log aggregation, uptime monitoring, KPI tracking, and SIM Integration
- Maintain and achieve 99.99% (four nines) to 99.999% (five nines) uptime SLAs
- Design, develop, and maintain Helm charts for Stargate and related infrastructure components
- Implement and manage progressive deployment strategies including canary deployments and blue-green deployments
- Oversee critical Kubernetes infrastructure including volume management, DNS configuration, load balancer provisioning, and secret monitoring/management
- Manage and optimize Kubernetes deployments and related AWS services
- Implement and maintain observability stack using Open Telemetry for comprehensive monitoring and alerting
- Collaborate with engineering teams to establish and enforce operational best practices and reliability standards
- 5+ years of production Dev Ops/SRE experience with demonstrable track record of maintaining high-availability systems
- Kubernetes administration experience with elevated cluster access in production environments
- Strong proficiency writing and maintaining Helm charts for complex, multi-component applications
- Hands‑on experience implementing canary deployments, blue‑green deployments, and other progressive delivery patterns
- Deep knowledge of Kubernetes infrastructure management: persistent volumes, DNS/networking, load balancers, and secrets management
- Production experience with Git Ops workflows and Flux CD
- Proven track record maintaining 99.99%+ uptime in production environments
- Excellent judgment and decision‑making skills when working with production systems
- Experience with AWS cloud services, particularly EKS (Elastic Kubernetes Service), Secrets Manager, VPC networking, IAM, and AWS Load Balancers
- Experience with Karpenter for Kubernetes node autoscaling and cluster optimization
- Experience with Open Telemetry instrumentation and observability platforms
- Kubernetes certifications (CKA, CKAD, or CKS)
- Experience building and maintaining CI/CD pipelines (Git Hub Actions, Jenkins, Git Lab CI, etc.)
- Knowledge of infrastructure-as-code tools (Terraform, CDK)
- Experience implementing SRE practices including SLIs, SLOs, and error budgets
Any offer of employment for this position is conditional upon Lunar Outpost receiving the LTVS SubCLIN 1C Award from NASA. If the contract is not awarded to Lunar Outpost, this offer will be considered null and void.
Compensation & Benefits- Comprehensive health coverage:
Medical, dental, and vision benefits, with 70% of premiums covered by the employer - Paid time off:
Three (3) weeks per year of vacation - Retirement plan:
Up to 4% employer match on 401(k) contributions - Paid holidays: 11 company‑recognized holidays
- Parental leave
- Educational reimbursement opportunities to support company objectives, continued learning, and career development
Lunar Outpost Inc. is an equal opportunity employer. Lunar Outpost Inc. does not discriminate on the basis of race, color, religion, sex (including pregnancy, sexual orientation, and gender identity), national origin, ethnicity, age, disability, veteran status, genetic information, or any other characteristic protected by applicable law. All employees, including executives and human resources personnel, are expected to conduct themselves with professionalism and treat others with dignity and respect in accordance with this policy.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).