×
Register Here to Apply for Jobs or Post Jobs. X

Site Reliability Engineer - Kubernetes

Job in Bellevue, King County, Washington, 98009, USA
Listing for: Okta
Full Time position
Listed on 2026-04-23
Job specializations:
  • IT/Tech
    Cloud Computing, Systems Engineer, SRE/Site Reliability
Salary/Wage Range or Industry Benchmark: 100000 - 125000 USD Yearly USD 100000.00 125000.00 YEAR
Job Description & How to Apply Below
Position: Staff Site Reliability Engineer - Kubernetes

Secure Every Identity, from AI to Human Identity is the key to unlocking the potential of AI. Okta secures AI by building the trusted, neutral infrastructure that enables organizations to safely embrace this new era. This work requires a relentless drive to solve complex challenges with real-world stakes. We are looking for builders and owners who operate with speed and urgency and execute with excellence.

This is an opportunity to do career-defining work. We're all in on this mission. If you are too, let's talk.

Workforce Identity Cloud

Okta Workforce Identity Cloud (WIC) provides easy, secure access for your workforce so you can focus on other strategic priorities like reducing costs, and doing more for your customers.

If you like to be challenged and have a passion for solving large-scale automation, testing, and tuning problems, we would love to hear from you. The ideal candidate is someone who exemplifies the ethics of "If you have to do something more than once, automate it" and who can rapidly self-educate on new concepts and tools.

Position Overview

The Site Reliability Engineer (SRE) will play a key role in building and managing Kubernetes platforms that support cloud-native applications and services. This position focuses on architecting and managing reliable, scalable, and secure Kubernetes-based platforms on AWS, ensuring high availability and performance while optimizing costs and automation. The ideal candidate will have hands‑on experience with AWS infrastructure, Kubernetes platform creation, Helm charts, Karpenter scaling, and Istio service mesh.

Key Responsibilities
  • Design, implement, and maintain highly available, scalable, and fault‑tolerant Kubernetes platforms. Ensure clusters are optimized for production workloads, providing high resilience and operational efficiency.
  • Build, manage, and optimize AWS cloud infrastructure, including EKS, ECS, S3, VPCs, RDS, IAM, and more. Implement best practices for cost management, scaling, and security within AWS.
  • Utilize Helm to automate and streamline the deployment of applications and services to Kubernetes clusters. Create, maintain, and manage Helm charts for production‑ready deployments.
  • Implement and manage Karpenter to dynamically scale Kubernetes clusters in response to workload demands.
  • Configure and manage Istio to provide service‑to‑service communication, security, and observability within the Kubernetes clusters. Enable fine‑grained traffic management, service discovery, and policy enforcement.
  • Automate the deployment, scaling, and management of infrastructure and applications. Work with CI/CD pipelines to ensure a seamless flow from development to production with minimal downtime.
  • Respond to incidents, troubleshoot, and resolve system issues related to performance, availability, and security in a timely and effective manner.
  • Design and implement secure cloud infrastructure with appropriate access controls, network security, and compliance frameworks.
  • Create and maintain detailed documentation for Kubernetes platform setup, operational procedures, and best practices. Promote knowledge sharing across teams.
Required Qualifications
  • 4+ years of experience with Kubernetes/Helm.
  • 4+ years of experience with Terraform.
  • 5+ years of experience with AWS.
  • Experience with multi‑region cloud environments.
  • Proven experience with AWS (EC2, RDS, S3, Cloud Formation, IAM, etc.) and a solid understanding of cloud‑native architectures.
  • Strong expertise in Kubernetes platform creation, management, and optimisation (e.g., setting up highly available clusters, networking, and storage).
  • Hands‑on experience with Helm for Kubernetes application deployment and management.
  • Practical experience with Karpenter for dynamic scaling of Kubernetes clusters and optimising resource usage. Expertise in managing and securing Istio for service mesh, including traffic management, security, and observability features.
  • Proficiency in CI/CD pipelines and automation tools (e.g., Jenkins, Git Lab, Circle

    CI, Terraform, Ansible, Spinnaker). Strong scripting and automation skills in Python, Bash, or Go for infrastructure management and platform automation.
  • Experience with monitoring,…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary