Senior Software Engineer, Observability
Listed on 2026-06-28
-
Software Development
DevOps, Cloud Engineer - Software
Secure Every Identity, from AI to Human Identity is the key to unlocking the potential of AI. Okta secures AI by building the trusted, neutral infrastructure that enables organizations to safely embrace this new era. This work requires a relentless drive to solve complex challenges with real-world stakes. We are looking for builders and owners who operate with speed and urgency and execute with excellence.
This is an opportunity to do career-defining work. We're all in on this mission. If you are too, let's talk.
The Auth0 Platform Observability team owns the observability tooling that monitors the Auth0 Platform, and we are looking for an Observability Engineer to help ensure that our Product and Platform Engineers can monitor and observe our platform while continuing to rapidly ship software that our customers love. Our engineers maintain and automate observability tooling across the entire platform, including metrics, logs, and traces.
We are looking for engineers passionate about monitoring, observing, measuring uptime and availability, and ensuring platform stability. If you have experience within the Site Reliability Engineering (SRE) field or as a Development Operations (Dev Ops) engineer, and you have a passion for Observability tooling, this position will allow you to further your learning and development in these areas.
As a Senior Engineer on this team, you will act as a core technical leader. You will work cross-functionally to help integrate services with our instrumentation libraries, support product teams, and actively investigate incidents to identify our observability gaps.
Responsibilities- Champion observability best practices, acting as an educator who can effectively correct anti-patterns and teach other engineering teams how to build robust, standardized instrumentation.
- Be an expert in running services in production environments.
- Contribute to the process of designing services for high growth and high availability.
- Provision, configure, and monitor cloud-native infrastructure and services.
- Design, build, and maintain scalable observability infrastructure using tools like Terraform.
- Troubleshoot performance issues and operational issues.
- Automate operational tasks and improve scripts.
- Assist with and provide feedback for performance testing and automation.
- Actively participate in major incident response to diagnose root causes and identify critical gaps in current telemetry tooling.
- Act as a technical leader, driving cross-team initiatives to improve instrumentation and observability standards across the organization.
- 5+ years of platform engineering, SRE, or Dev Ops experience.
- Experience with cloud infrastructure such as AWS, Google Cloud, or Azure.
- Expertise in the Datadog ecosystem (Metrics, Logs, Traces, and Error Tracking), including establishing alerting standards, implementing tagging taxonomies, and managing Datadog configurations via Terraform.
- Strong coding skills in Node.js or Golang.
- Experience with containerization and orchestration tools (e.g., Docker, Kubernetes).
- A data-driven approach to debugging complex, cross-service performance bottlenecks.
- Deep understanding of microservice architecture and best practices.
- Experience coaching and mentoring junior engineers.
- Proven ability to lead cross-functional technical initiatives and collaborate seamlessly with multiple engineering teams.
- Hands‑on experience with Open Telemetry (OTel), Vector, or similar frameworks for instrumenting applications.
Below is the annual base salary range for candidates located in California (excluding San Francisco Bay Area), Colorado, Illinois, New York and Washington. Your actual base salary will depend on factors such as your skills, qualifications, experience, and work location. Okta offers equity (where applicable), bonus, and benefits, including health, dental and vision insurance, 401(k), flexible spending account, and paid leave (including PTO and parental leave) in accordance with our applicable plans and policies.
Annual base salary range: $147,000—$202,000 USD.
The Okta Experience- Supporting Your Well‑Being
- Driving Social Impact
- Developing Talent and Fostering Connection + Community
O…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).