×
Register Here to Apply for Jobs or Post Jobs. X

Infrastructure Site Reliability Engineer

Job in North Battleford, Saskatoon, Saskatchewan, S7W, Canada
Listing for: Remoteworldwide
Full Time position
Listed on 2026-02-20
Job specializations:
  • IT/Tech
    Systems Engineer, Cloud Computing, SRE/Site Reliability, Network Engineer
Job Description & How to Apply Below
Position: Staff Infrastructure Site Reliability Engineer
Location: North Battleford

Staff Infrastructure Site Reliability Engineer

Staff Infrastructure Site Reliability Engineer Posted: 04/05/2025

Anywhere in the world

Remote Senior

About the Team:

Netlify’s SRE team is scaling to meet the demands of our rapidly growing platform and user base. Our SRE team is responsible for ensuring the reliability, scalability, and efficiency of Netlify’s infrastructure while maintaining a focus on innovation and operational excellence. As a Staff Site Reliability Engineer, you will be at the forefront of driving organizational-level reliability strategies, shaping the direction of Netlify’s systems, and tackling complex, systemic challenges.

You will collaborate across teams to build a culture of operational excellence and deliver impactful solutions that support our mission to empower the next generation of web developers.

We are a remote-first, globally distributed group that values asynchronous communication, documentation, and a culture of transparency, empowerment, and collective ownership. Diversity and inclusion are at the heart of what we do, and we welcome team members from all backgrounds to bring their unique perspectives to our mission. Whether you’re launching a new phase of your career or growing an established one, Netlify offers a supportive environment where you can thrive while maintaining a healthy work-life balance.

What You’ll Do:

  • Lead high-impact reliability and infrastructure initiatives across the platform.
  • Drive the adoption of Infrastructure-as-Code and champion reliability-focused tooling and frameworks.
  • Manage all cloud infrastructure components, including instances, networking, DNS, Terraform automation, and Kubernetes.
  • Define and uphold architectural standards, best practices, and technical strategy for reliability at scale.
  • Provide mentorship to senior engineers and tech leads, fostering systems thinking and operational excellence.
  • Partner with Engineering, Product, and Executive teams to embed reliability into company-wide strategy.
  • Lead architecture reviews and provide oversight for critical infrastructure projects.
  • Develop and advocate for reliability metrics and SLO frameworks that align with business goals.
  • Participate in an on-call rotation and occasionally act as Incident Commander, providing technical leadership and system-level decision-making.

What You’ll Bring:

  • Deep expertise in cloud architecture, with hands-on experience designing and deploying global-scale solutions on AWS, Azure, or GCP.
  • Strong proficiency with Kafka or similar messaging systems, including deployment, scaling, and maintenance in multi-cloud environments.
  • Solid experience in database design, performance tuning, and maintenance for both relational and No

    SQL systems in high-throughput environments.
  • Skilled in programming and scripting languages such as Go or Python, with a focus on automation and infrastructure tooling.
  • A proven track record of leading large-scale, cross-team technical initiatives and delivering impactful infrastructure outcomes.
  • Proficiency in configuration management tools like Ansible, Chef, or Puppet.
  • Experience in managing CI/CD pipelines using tools such as Jenkins, Git Lab CI, Circle

    CI, or similar.
  • We welcome candidates based in Spain, Canada, or the UK for this position.
  • Excellent communication skills, with the ability to articulate complex technical strategies to executives and build consensus across diverse teams.
  • Demonstrated success in setting and scaling technical standards and best practices across large engineering organizations.

This role is a great fit if:

  • You think in systems. You’re curious about how infrastructure, networking, observability, and security connect—and enjoy breaking down complex challenges into clear, actionable strategies.
  • You’re comfortable writing code (especially in Go) and enjoy automating infrastructure workflows, building tools to reduce manual effort, and supporting reliable operations at scale.
  • You’ve collaborated on cross-functional initiatives—like operational readiness reviews, cloud migrations, or introducing monitoring standards—and know how to communicate clearly with both technical and non-technical teammates.
  • You take a thoughtful,…
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary