More jobs:
Job Description & How to Apply Below
PD) is a global leader in digital operations management. Trusted by nearly half of both the Fortune 500 and the Forbes AI 50, as well as approximately two-thirds of the Fortune 100, Pager Duty is essential for delivering always‑on digital experiences to modern businesses.
Join us. At Pager Duty, you'll tackle complex problems, collaborate with kind and ambitious people, and help build a more equitable world—all in a flexible, award‑winning workplace.
As an intermediate Site Reliability Engineer on the Core Infrastructure team in our Toronto office, you'll help build and operate the foundational infrastructure that powers Pager Duty's real‑time digital operations platform. Our systems support millions of events and alerts daily, enabling customers to detect, respond to, and resolve incidents quickly and reliably.
You'll work at the intersection of platform evolution and operational excellence, building and evolving foundational network, compute, and ingress infrastructure while scaling and hardening existing systems. Your work will directly impact the reliability, scalability, and security of the services our customers rely on to keep their businesses running as Pager Duty continues to grow across products, regions, and customer use cases.
Key Responsibilities
Support and improve foundational infrastructure, including networking, compute platforms, Kubernetes clusters, and ingress/traffic management systems.
Contribute to the reliability and scalability of Pager Duty's core platform by hardening existing systems and supporting the rollout of new infrastructure capabilities.
Participate in agile rituals (standups, planning, retros) and communicate progress/risks early.
Stay current on technical trends to suggest innovative tools and approaches to interesting problems.
Monitor system health using metrics, logs, and alerts, and participate in 24/7 on‑call rotations to help detect, respond to, and resolve incidents.
Basic Qualifications
3+ years of experience in Site Reliability Engineering, Dev Ops, or Platform Engineering roles.
Hands‑on experience operating Linux‑based systems in production environments.
Working knowledge of networking fundamentals, such as load balancing, DNS, TLS, and ingress traffic flow.
Experience with container orchestration (e.g., EKS, Kubernetes).
Experience working on cloud‑native infrastructure (e.g., AWS, GCP, Azure), including networking and compute concepts.
Proficiency in at least one programming language (e.g., Python, Ruby, Go, etc.).
Experience with Infrastructure as Code (e.g., Terraform, Cloud Formation).
Preferred Qualifications
Experience with AWS cloud networking concepts such as VPCs, subnets, routing, security groups, and load balancers.
Experience operating or contributing to production Kubernetes platforms (e.g., EKS), including cluster upgrades, networking, or ingress configuration.
Experience with monitoring, observability, and logging platforms (e.g., Data Dog, New Relic, Sumo Logic, Splunk, Prometheus, Grafana).
Familiarity with service meshes, ingress controllers, or API gateways (e.g., Envoy, Istio, NGINX).
The base salary range for this position is 115, CAD. This role may also be eligible for bonus, commission, equity, and/or benefits.
Our base salary ranges are determined by role, level, and location. The range, which is subject to change based on primary work location, reflects the minimum and maximum base salary we expect to pay newly hired employees for the position. Within the range, we determine pay for an individual based on a number of factors including market location, job‑related knowledge, skills/competencies and experience.
Your recruiter can share more about the specific offerings for this role, as well as the salary range for your primary work location during the hiring process.
Pager Duty is a flexible, hybrid workplace. We embrace and encourage in‑person working as an integral part of our culture. Both our employees and external research tell us that co‑located collaboration strengthens connections, drives innovation, and accelerates learning.
This role is expected to come into our Toronto office 2 days per week , so you can…
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
Search for further Jobs Here:
×