×
Register Here to Apply for Jobs or Post Jobs. X

Lead Site Reliability Engineer

Job in Toronto, Ontario, M5A, Canada
Listing for: 0000050007 Royal Bank of Canada
Full Time position
Listed on 2026-02-26
Job specializations:
  • IT/Tech
    Cloud Computing, IT Support
Job Description & How to Apply Below

Job Description

What is the Opportunity?City National Bank (CNB), an RBC company, is seeking a Lead Site Reliability Engineer, who will be responsible for supporting CNB digital and corporate applications along with the implementation of Site Reliability Engineering solutions.

As a Lead SRE, you will play a critical role in ensuring the reliability, scalability, and performance of key applications, balancing production support responsibilities with continuous improvement initiatives. The ideal candidate will have expertise in agile application development, operations, technology lifecycle management, infrastructure, automation, to reduce toil, improve observability, resolve complex production incidents, address underlying root causes while fostering a culture of continuous learning through blameless postmortems, building resilience, reliability, and operational excellence and should be able to take on a production support role and part with the SRE teams in Digital, Corporate Functions and Data Services.
What will you do?
  • Advocate for automation and Dev Ops best practices, fostering an SRE mindset within the team.
  • Lead the development of SRE solutions, focusing on automation, monitoring, alerting, machine learning-based anomaly detection, self-healing, and reliability testing.
  • Implement advanced monitoring, alerting, and automated remediation strategies to prevent incidents before they impact business operations.
  • Collaborate with teams to enhance platform infrastructure, improving service resilience, reliability, quality, and time-to-market for software solutions.
  • Improve and optimize Incident, Problem, and Change management processes, to improve MTTR, Incident avoidance and resilience.
  • Oversee technology lifecycle management (server patching, certificate renewals, risk remediation) with a strong focus on automation-first principles.
  • Define and maintain Service Level Objectives (SLOs) and ensure availability targets for mission-critical applications.
  • Ensure compliance with regulatory and security requirements, including segregation of duties for sensitive environments.
  • Stay ahead of emerging technologies, leveraging continuous learning opportunities to drive innovation and efficiency.
  • Provide hands-on application production support, including off-hours coverage as needed.
  • What do you need to succeed?
    Must-have:
  • 5+ years of experience in Application Support, Software Development (SDLC), and Operations.
  • Strong proficiency in at least two programming languages (Java, Python, .NET, C, C++, C#).
  • Experience with SQL and No SQL database technologies.
  • Expertise in SRE, Dev Ops, OnPrem, Hybrid, Cloud native platforms
  • Exposure to Job Scheduling, Managed File Transfers, and Data Services.
  • Proven track record of implementing resilient IT solutions, driving continuous service improvements, and enhancing production reliability through automation and best practices.
  • Advanced experience in a variety of environments (Linux, Windows, Databases, Cloud, distributed and mainframe, business workflows, and Services/APIs).
  • Hands-on experience in a variety of Dev Ops / SRE tools (Ansible, Dynatrace, Moogsoft, Pager Duty, Service Now, Elastic, Logstash, Kibana, Logic Monitor, Jenkins, Cucumber, CA Work Automation, Power BI, ETL related tools etc.).
  • Must have excellent communication, analytical and problem-solving skills to diagnose, resolve complex production incidents and lead blameless postmortems to identify & address root causes.
  • A self-starter in taking lead roles for internal initiatives for operations excellence
  • Nice-to-have:
  • Prior experience leading SRE functions in the financial services industry.
  • Knowledge of Digital Identity Access Management, Internet / Mobile Banking Platforms, Microservices, Data Services, Test Automation and Corporate applications (HR, Finance, Risk, Compliance etc) is preferred.
  • Basic or advanced knowledge of Artificial Intelligence tools and techniques.
  • What’s in it for you?We thrive on the challenge to be our best, progressive thinking to keep growing, and working together to deliver trusted advice to help our clients thrive and communities prosper. We care about each other, reaching our potential, making a…
    Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
    To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
     
     
     
    Search for further Jobs Here:
    (Try combinations for better Results! Or enter less keywords for broader Results)
    Location
    Increase/decrease your Search Radius (miles)

    Job Posting Language
    Employment Category
    Education (minimum level)
    Filters
    Education Level
    Experience Level (years)
    Posted in last:
    Salary