Engineering Manager,Site Reliability Job Toronto area,Ontario Canada,IT/Tech

As Relay continues to scale, the reliability, performance, and resilience of our platform are no longer just technical concerns. They are core to our customer experience and business success.

Relay is building a platform that SMBs can rely on every day, and our Site Reliability Engineering team is at the core of making that possible. We’re looking for an Engineering Manager to lead our SRE team, responsible for the scalability, reliability, and robustness of Relay’s systems. This role is about more than running infrastructure or responding to incidents. It’s a leadership role at the intersection of technology, people, and the business.

You’ll guide a highly capable SRE team, help shape reliability shows up across the organization, and ensure our systems scale safely as customer impact and complexity increase.

If you’re excited by technically demanding environments and energized by building strong teams, healthy culture, and effective cross-functional partnerships, this role is built for you.

What You'll Be Doing

Lead, coach, and grow a high-performing Site Reliability Engineering team; support career development, technical excellence, and ownership
Own the reliability, scalability, and performance of Relay’s platform, ensuring our systems are resilient as the business grows
Lead and evolve SRE best practices, including incident management, on-call operations, SLIs/SLOs, and error budgets
Partner closely with Engineering,, and Data teams to ensure reliability and scalability are built into every feature we ship
Drive continuous improvement through post-incident reviews, root cause analysis, and preventative action
Guide infrastructure and platform investments to support long-term scalability, security, and operational efficiency
Define and track key reliability KPIs (e.g., uptime, latency, incident frequency, MTTR) and use data to inform priorities and decisions
Champion a culture of learning, operational excellence, and “running towards problems” across the engineering organization

Who You Are

You have 3+ years of experience managing or leading engineers and 6+ years of experience in Site Reliability Engineering, Platform Engineering, or Infrastructure roles
You have a strong track record of owning and improving system reliability, scalability, and performance in production environments
Experienced in improving observability, performance, or operational maturity at growing companies
You’ve led teams through incident response, postmortems, and reliability improvements, using data and clear accountability to drive better outcomes
You have a strong foundation in operating and scaling production systems in cloud environments (e.g. AWS), and modern infrastructure practices (IaC, CI/CD, monitoring, alerting)
You have a proven record of partnering with Product and Engineering leaders to balance delivery velocity with long-term reliability and operational excellence
You’re a leader who knows how to coach, motivate, and grow engineers while setting a high bar for ownership, quality and technical excellence
You’re highly collaborative and experienced in leading cross-functional initiatives that span engineering, product, and operations
You thrive in fast-paced, ambiguous environments and are comfortable leading through change as the platform and organization scale

Bonus Points

You’ve built or scaled an SRE or platform team at a growing fintech startup
You have experience supporting high-availability, customer-facing systems in fintech or other regulated environments
You’re familiar with SRE best practices such as SLOs, error budgets, and capacity planning at scale

The Interview Process:

Stage 1: A1-hour Google Meet call with a member of the People team
Stage 2: A 1-hour Google Meets video call with the hiring manager (VP of Engineering)
Stage 3: A 1-hour in-person interview with a member of Relay’s senior leadership team
Stage 4: A 1-hour live technical assessment via Google Meet, or in person, with a panel of interviewers (technical leaders & stakeholders)

Our Commitment To You:

Competitive salary and meaningful equity: Relay employees are Relay owners, complete with equity and a competitive salary.
Comprehensive health benefits: …


Increase/decrease your Search Radius (miles)



Job Posting Language

Engineering Manager, Site Reliability