Senior DevOps Engineer - Kubernetes
Listed on 2026-01-02
-
IT/Tech
Systems Engineer, Cloud Computing
Overview
We are seeking a Kubernetes Engineer with experience building resilient, scalable container‑based platforms in dynamic environments. You’ll play a central role in implementing our container orchestration strategy, optimizing Kubernetes clusters for reliability, performance, and developer velocity. This is a hands‑on role where architectural insight meets operational excellence—ideal for an engineer who wants to leave their fingerprints on the core of how things run.
Key ResponsibilitiesCluster Management: Design, build, and maintain Kubernetes clusters across development, staging, and production environments (EKS a strong plus).
Platform Engineering: Build tooling and abstractions that streamline application deployment and service discovery for developers.
Autoscaling & Performance: Optimize pod scheduling, resource allocation, and horizontal / vertical scaling for high‑performance services.
Security & Policy Enforcement: Implement RBAC, network policies, and runtime security tools to enforce safe, compliant workloads.
Deployment Enablement: Enhance Helm charts, Kustomize workflows, and Git Ops processes to support fast, safe, and reliable deployments.
Observability: Own the integration and tuning of observability stacks (e.g., Prometheus, Grafana, Loki) for visibility into cluster and application health.
Resilience & Recovery: Support fault‑tolerant architectures, runbooks for failover, and high availability strategies.
Collaboration: Partner with developers, QA, and platform teams to evolve infrastructure‑as‑code and self‑service systems that reduce friction and boost autonomy.
About You Experience- 5+ years in Site Reliability, Infrastructure, or Dev Ops roles with a clear Kubernetes focus
- Deep experience running production workloads on Kubernetes (especially on AWS / EKS)
- Solid understanding of container lifecycle, networking, and orchestration internals
- Strong with tools like Helm, Kustomize, ArgoCD, or Flux
- Proficiency with Terraform or Pulumi for provisioning EKS and supporting infrastructure
- Competence in at least one scripting language (Python, Bash, or Go)
- Familiarity with service meshes (Istio, Linkerd) and Kubernetes‑native security tools (OPA / Gatekeeper, Kyverno)
- You are proactive and enjoy taking ownership of infrastructure challenges
- You value automation and reducing manual toil wherever possible
- You are comfortable working in fast‑paced, collaborative environments
- You communicate clearly and can explain complex infrastructure topics to different audiences
- Experience implementing security controls in regulated or compliance‑focused environments
- Familiarity with service mesh architectures or advanced Kubernetes networking
- Background supporting multi‑region or multi‑cloud deployments
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search: