Site Reliability Engineer
Listed on 2026-05-12
-
IT/Tech
Systems Engineer, Cloud Computing, Network Engineer, IT Support
Say hello to opportunities.
It's not every day that you consider starting a new career. We're Ring Central, and we're happy that someone as talented as you is considering this role. First, a little about us, we're a $2 Billion annual revenue company with double digit Annual Recurring Revenue (ARR) and a $93 Billion market opportunity in UCaaS, Contact Center and AI‑powered adjacencies. We invest more than $250 million annually to ensure our AI‑enabled technology and platforms meet or exceed the needs of our customers.
Ring Sense AI is our proprietary AI solution. It's designed to fit the business needs of our customers, orchestrated to be accurate and precise, and built on the same open platform principles we apply to our core software solutions.
We are seeking a Site Reliability Engineer to help operate and improve large scale communications infrastructure powering SIP based voice and media services.
Our platform handles high volumes of realtime sessions globally across distributed infrastructure. You would have the opportunity to work on an experienced team that operates across SIP signaling, media processing, networking, and infrastructure.
The team deploys, monitors, and troubleshoots distributed services while contributing to the observability and automation that keeps them reliable. The work is technically challenging and contributions to tooling and process have a real impact on how the platform operates.
Responsibilities- Operate and maintain Linux based telephony and platform services in production
- Troubleshoot SIP signaling and RTP media flows, including call routing, provisioning, registration, and signaling behavior
- Diagnose issues across the network stack affecting realtime voice and media traffic, including analysis of packet and signaling flows
- Deploy and administer Kubernetes services using Helm and Git Ops workflows (e.g. CI/CD, FluxCD), including tracing and debugging configuration through layered rendering pipelines
- Manage stateful and pinned workloads, including understanding of Kubernetes scheduling primitives such as taints, tole rations, and node affinity
- Monitor systems and participate in on‑call incident response for production infrastructure
- Implement production changes using testing, rollback planning, and risk mitigation practices
- Contribute to automation, observability, and operational tooling improvements
- Coordinate with infrastructure, network, storage, and platform teams to resolve cross‑domain issues and maintain highly available global services
- Strong background administering UNIX/Linux systems and troubleshooting via the command line
- Solid grasp of TCP/IP networking fundamentals including routing, NAT, load balancing, and container networking
- Experience supporting SIP based VoIP or realtime communications systems, with strong understanding of SIP proxies, applications, session border controllers, RTP media servers
- Familiarity with Git based version control systems (e.g. Git Lab) and common repository workflows
- Experience deploying services using Git Ops automation (e.g. FluxCD, CI/CD)
- Skilled in analyzing network traffic using Wireshark, tcpdump, or similar tools
- Comfortable working with observability platforms including Prometheus, Grafana, Loki, and the ELK stack
- Hands on experience operating containerized platforms using Kubernetes, including interaction with the Kubernetes API, container registries, and Helm-based deployments
- Working knowledge of persistent storage in Kubernetes (e.g. PVCs)
- Experience operating infrastructure in AWS or GCP, including services such as S3, EC2, Cloud Front, and certificate/secret management
- Familiarity with change management processes and production change approval workflows
- Experience automating operational tasks using Python or shell scripting
- Knowledge of FoIP signaling flows, t.30/t.38 protocols, and related troubleshooting
- Experience with message brokers, media routing, or session‑state management in distributed systems
- Operating large scale distributed systems across multiple regions
- Familiarity with hybrid cloud/baremetal environments
- Exposure to infrastructure automation tools such as Ansible or Terraform
- Understa…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).