Site Reliability Engineer Job Denver area,Colorado USA,IT/Tech

Say hello to opportunities.

It's not every day that you consider starting a new career. We're Ring Central, and we're happy that someone as talented as you is considering this role. First, a little about us, we're a $2 Billion annual revenue company with double digit Annual Recurring Revenue (ARR) and a $93 Billion market opportunity in UCaaS, Contact Center and AI‑powered adjacencies. We invest more than $250 million annually to ensure our AI‑enabled technology and platforms meet or exceed the needs of our customers.

Ring Sense AI is our proprietary AI solution. It's designed to fit the business needs of our customers, orchestrated to be accurate and precise, and built on the same open platform principles we apply to our core software solutions.

We are seeking a Site Reliability Engineer to help operate and improve large scale communications infrastructure powering SIP based voice and media services.

Our platform handles high volumes of realtime sessions globally across distributed infrastructure. You would have the opportunity to work on an experienced team that operates across SIP signaling, media processing, networking, and infrastructure.

The team deploys, monitors, and troubleshoots distributed services while contributing to the observability and automation that keeps them reliable. The work is technically challenging and contributions to tooling and process have a real impact on how the platform operates.

Responsibilities

Operate and maintain Linux based telephony and platform services in production
Troubleshoot SIP signaling and RTP media flows, including call routing, provisioning, registration, and signaling behavior
Diagnose issues across the network stack affecting realtime voice and media traffic, including analysis of packet and signaling flows
Deploy and administer Kubernetes services using Helm and Git Ops workflows (e.g. CI/CD, FluxCD), including tracing and debugging configuration through layered rendering pipelines
Manage stateful and pinned workloads, including understanding of Kubernetes scheduling primitives such as taints, tole rations, and node affinity
Monitor systems and participate in on‑call incident response for production infrastructure
Implement production changes using testing, rollback planning, and risk mitigation practices
Contribute to automation, observability, and operational tooling improvements
Coordinate with infrastructure, network, storage, and platform teams to resolve cross‑domain issues and maintain highly available global services

Requirements

Strong background administering UNIX/Linux systems and troubleshooting via the command line
Solid grasp of TCP/IP networking fundamentals including routing, NAT, load balancing, and container networking
Experience supporting SIP based VoIP or realtime communications systems, with strong understanding of SIP proxies, applications, session border controllers, RTP media servers
Familiarity with Git based version control systems (e.g. Git Lab) and common repository workflows
Experience deploying services using Git Ops automation (e.g. FluxCD, CI/CD)
Skilled in analyzing network traffic using Wireshark, tcpdump, or similar tools
Comfortable working with observability platforms including Prometheus, Grafana, Loki, and the ELK stack
Hands on experience operating containerized platforms using Kubernetes, including interaction with the Kubernetes API, container registries, and Helm-based deployments
Working knowledge of persistent storage in Kubernetes (e.g. PVCs)
Experience operating infrastructure in AWS or GCP, including services such as S3, EC2, Cloud Front, and certificate/secret management
Familiarity with change management processes and production change approval workflows
Experience automating operational tasks using Python or shell scripting

Preferred Experience

Knowledge of FoIP signaling flows, t.30/t.38 protocols, and related troubleshooting
Experience with message brokers, media routing, or session‑state management in distributed systems
Operating large scale distributed systems across multiple regions
Familiarity with hybrid cloud/baremetal environments
Exposure to infrastructure automation tools such as Ansible or Terraform
Understa…