Manager of Site Reliability Engineering
Job Description & How to Apply Below
Elevate platform performance as the Manager of Site Reliability Engineering at Moneris, based in Toronto with hybrid work options. Lead a dedicated team to enhance service reliability and operational excellence.
In this crucial leadership role, you will oversee SRE teams ensuring the availability, performance, and resiliency of Moneris’ platforms. Collaborate closely with Development, Dev Ops, Infrastructure, and Security teams to drive automation and reduce operational toil through effective incident response and observability. Your advanced leadership will shape SRE maturity and impactful engineering standards.
Key Responsibilities:
• Manage SRE engineers across critical applications and platforms
• Implement SRE practices like SLIs, SLOs, and incident response
• Oversee operations including on-call rotations and problem management
• Collaborate with Development and Dev Ops for system designs
• Lead observability strategy with monitoring and logging standards
Requirements:
• 8+ years in senior technical roles with distributed systems
• 3+ years of experience leading technical teams
• Strong knowledge of SRE principles and cloud platforms
• Experience with observability tools like Dynatrace or Datadog
• Solid background in Linux-based systems and automation
This role is key in enhancing Moneris’ reliability and performance through strategic leadership and technical expertise.
#J-18808-Ljbffr
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
Search for further Jobs Here:
×