Manager, AI Infrastructure
Listed on 2026-05-16
-
Software Development
Cloud Engineer - Software, DevOps
About the Opportunity
We are seeking an Infrastructure Manager with deep expertise in Kubernetes, Terraform, and Ansible to help scale Seekr’s AI platform across on‑premises, cloud, and SaaS environments. You’ll be highly hands‑on, juggling multiple projects, mentoring engineers, and driving complex initiatives to deliver robust, scalable, and reliable systems. On‑prem experience is highly preferred. This role demands a strong foundation in Linux, networking (both traditional and Kubernetes), container technologies, and automation.
You’ll collaborate closely with software engineering teams, own critical infrastructure, and solve challenging operational and scalability problems in fast‑paced, dynamic environments. From your first day, you will make a valuable – and valued contribution. We are a fast‑growing company where no one is a bystander. We offer you the opportunity to delight millions of consumers around the world while gaining meaningful experience across a variety of disciplines.
and Responsibilities
- Lead development of solutions to complex reliability, performance, and scaling challenges.
- Design, architect, and implement systems, networks, and services powering Seekr’s platform.
- Provide hands‑on leadership and mentorship to the team.
- Partner with software engineering teams to build scalable, efficient, and reliable services.
- Identify and resolve operational inefficiencies through automation.
- Troubleshoot and lead response to deployment and production incidents.
- Implement and enforce security best practices, ensuring infrastructure, deployments, and data are protected at every stage.
- Technical Leadership: 12+ years experience, Proven ability to deliver results in a high‑pressure/dynamic environment, Communication Skills, Roadmap & long‑term strategy, mentoring senior engineers.
- Kubernetes & Distributed Systems: Enterprise‑scale K8s with custom operators/controllers, multi‑platform clusters, hybrid fleet orchestration across cloud & edge, K8s control plane, k8s upgrades, Docker, containerd, CRI‑O, Ingress Controllers (Istio, NGINX, Traefik), K8s Databases, Helm charts.
- Database Management: Postgres, Elastic Search/Open Search, Kubernetes databases, Stateful sets.
- Networking: L2/L3 protocols (BGP, OSPF, VLANs, IPSec), VPNs, firewalls, redundancy paths, bare‑metal Linux networking, CoreDNS, Calico, K8s service mesh (Istio).
- Infrastructure Automation: Ansible, Terraform, CI/CD Pipelines, Git Lab, ArgoCD, MAAS, scripting (Python, Golang, Bash), AWS, Azure.
- Observability: Grafana, Prometheus, Loki, Tempo, ELK, OTEL.
- Security: Zero‑trust architecture, PKI, mTLS, SPIFFE/SPIRE, certificate automation, CVE remediation, secrets management, IAM.
- Incident Management & RCA: End‑to‑end incident lifecycle, root cause analysis, corrective action ownership.
- Meaningful Mission & Impact - Work with a deeply talented, collaborative team solving some of the toughest AI challenges that matter.
- Equity Ownership – RSUs that let you share directly in Seekr’s long‑term success and growth.
- Time Off That Respects Real Life – Unlimited PTO plus 14 paid company holidays to truly recharge.
- Work Your Way – A flexible hybrid work environment with offices in Reston, VA and Austin, TX, plus remote options and flexible working hours.
- Competitive Total Rewards – A role‑appropriate compensation structure that supports long‑term growth, including base salary, bonuses, or commission plans depending on role.
- 401(k) with Company Match – Build your future with a retirement plan that includes employer matching.
- Comprehensive Health & Wellness – Medical, dental, vision, and life insurance coverage starting day one—for you and your family.
- Parental Leave – Paid parental leave to support employees as they welcome a new child through birth, adoption, or foster placement.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).