Cloud Infrastructure Engineer
Listed on 2026-05-29
-
IT/Tech
Systems Engineer, Cloud Computing, SRE/Site Reliability, Network Engineer
Location
Rosslyn, VA
Employment TypeFull time
Location TypeOn-site
DepartmentEngineering
About RuneRune Technologies is here to revolutionize the status quo of military logistics and sustainment through the deployment of AI-enabled solutions. Combining elite Silicon Valley software expertise with deep operational experience working in and with the Department of Defense, Rune builds cutting-edge software to solve the most critical logistics challenges faced by the U.S. military and its allies. Rune’s flagship product is TyrOS, an advanced software platform to enhance logistics at tactical and operational echelons, providing unified, comprehensive management of inventory, personnel, equipment and distribution.
TyrOS integrates critical information for holistic, data‑driven logistics decisions, leveraging AI for decision support, predictive analytics and optimization at machine speed. Rune’s mission is to support and enable the military logistics and sustainment communities with software to meet needs for the next fight.
At Rune, we are building the infrastructure that powers logistics operations across cloud and edge environments — from centralized systems to distributed nodes operating in constrained and disconnected conditions. As a Cloud Dev Ops Engineer, you will be responsible for scaling and operating the cloud backbone that supports this system.
This is not a traditional cloud role. You will take a functional, early‑stage platform and scale it into a highly available, multi‑environment system, capable of coordinating with edge deployments and supporting distributed workloads. You’ll work on infrastructure that must remain reliable while interacting with thousands of nodes across heterogeneous environments, including systems with intermittent connectivity.
You will design and operate cloud infrastructure across AWS and multi‑cloud environments, build and maintain Kubernetes‑based deployment systems, and ensure that our platform can scale, recover, and evolve as usage grows. You’ll also help bridge the gap between cloud and edge — enabling synchronization, deployment, and observability across a distributed mesh of systems.
This role combines platform engineering, deployment engineering, and SRE, with a strong emphasis on building systems that actually run in production — not just in ideal cloud conditions.
What You’ll Do- Design, build, and operate cloud infrastructure across AWS and multi‑cloud environments.
- Deploy and manage applications on Kubernetes (EKS and other environments), ensuring scalability, reliability, and performance.
- Build and maintain infrastructure‐as‐code using Terraform, enabling repeatable and auditable deployments.
- Develop and maintain CI/CD pipelines (e.g., Circle
CI) to support rapid and reliable software delivery. - Implement and operate observability systems (metrics, logs, tracing) to monitor system health and performance.
- Collaborate with backend and infrastructure teams to ensure services are designed for scalability, resilience, and operability.
- Support integration with edge‑deployed systems, including handling synchronization, connectivity, and deployment workflows across distributed environments.
- Debug and resolve production issues across cloud infrastructure, networking, and application layers.
- Contribute to best practices around security, reliability, and performance across the platform.
- 5+ years of experience in Dev Ops, SRE, infrastructure or software engineering roles.
- Strong experience with AWS (e.g., EC2, RDS, autoscaling, networking) in production environments.
- Deep experience deploying and operating applications on Kubernetes.
- Proficiency with Terraform or similar infrastructure‑as‑code tools.
- Experience building and maintaining CI/CD pipelines (e.g., Circle
CI or similar). - Strong scripting skills in Bash; proficiency in Python or Go is a plus.
- Experience operating production systems with a focus on reliability, observability, and performance.
- Strong understanding of networking fundamentals and distributed systems behavior.
- Experience working in multi‑cloud environments or supporting deployments across heterogeneous…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).