×
Register Here to Apply for Jobs or Post Jobs. X

Site Reliability Engineer

Job in Toronto, Ontario, C6A, Canada
Listing for: ContactMonkey
Full Time position
Listed on 2025-12-14
Job specializations:
  • IT/Tech
    Systems Engineer, Cloud Computing, SRE/Site Reliability, IT Support
Job Description & How to Apply Below
Position: Staff Site Reliability Engineer

Apply for the Staff Site Reliability Engineer role at Contact Monkey.

Hey there! We're Contact Monkey đź‘‹

Our mission? To power measurable employee engagement worldwide. And we'd love for you to join us!

About the job - Staff Site Reliability Engineer

You are not just building infrastructure—you are radically improving the daily lives and productivity of every engineer. As our Staff Engineer, Developer Experience, your primary mandate is to unlock engineering velocity. You will leverage your expertise in SRE and cloud infrastructure to eliminate friction, automate toil, and build the self‑service platforms that allow developers to ship code faster, test more robustly, and troubleshoot independently.

While you will ensure stability, security, and compliance (the essential housekeeping), your success will be measured by your measurable impact on the team's agility and innovation pipeline. You are the champion who enables engineers to focus on product, not platform.

Your impact
  • Engineering Velocity:
    Deliver robust, production‑mirroring environments that empower engineering teams to confidently test, develop, and experiment—reducing environment‑related defects and accelerating iteration cycles.
  • Cost & Performance Optimization:
    Lead infrastructure optimization with autoscaling and rightsizing initiatives to achieve significant cost savings and enhance service performance.
  • Release Automation:
    Automate build, test, and deployment pipelines to increase deployment frequency, reduce manual effort, and establish a high‑velocity CI/CD practice (Git Hub Actions).
  • Reliability & Insights:
    Build and maintain comprehensive observability platforms (Grafana, Datadog, Sentry, Prometheus, Loki) that provide actionable insights and improve service reliability and uptime.
  • Security & Agility:
    Own implementation of security best practices for identity and access management, and secrets management, ensuring agility and security in the absence of dedicated ops or security teams.
  • Infrastructure Governance:
    Champion Infrastructure‑as‑Code (Terraform, Terragrunt) and container orchestration best practices (Kubernetes) for stable, maintainable infrastructure.
  • Organizational Growth:
    Mentor peers, lead by example, and foster a culture of shared ownership, proactive problem‑solving, and continuous knowledge sharing.
  • Compliance:
    Embed regulatory controls (SOC2, GDPR) directly into infrastructure and operational processes for audit readiness.
  • Technical Modernization:
    Modernize legacy infrastructure with scalable, maintainable solutions aligned with long‑term business and technical goals.
About you
  • 7+ years in SRE, cloud infrastructure, or Dev Ops roles supporting complex distributed systems.
  • Expertise in observability, metrics, alerting, and incident response with a data‑driven approach.
  • Proficient with AWS, Terraform, Terragrunt, Kubernetes, and scripting.
  • A strong collaborator and product thinker who views developer experience as their primary product, focused on measurably improving the daily workflow and speed of feature teams.
  • Skilled in balancing rapid innovation with robust security and compliance.
  • Experience implementing compliance frameworks within infrastructure and operational processes.
How you can stand out
  • Exceptional Cross‑Functional Leadership: A highly open, approachable, and non‑confrontational collaborator who can engage effectively with individual engineers, engineering leaders, and confidently present and discuss strategic platform direction with senior leadership (VP and C‑level) at the appropriate business altitude.
  • Proven ability to lead platform initiatives with a documented track record of unlocking developer productivity and accelerating time‑to‑market.
  • Experience migrating legacy systems to cloud‑native, scalable platforms.
  • Track record of mentoring engineers and building collaborative, high‑impact teams.
  • Strong operational excellence and security mindset in cloud environments.
  • Experience with Go and/or Nix.
What we bring to the table
  • 100% employer‑paid benefits and a health spending account from day one
  • Work from anywhere in the world for up to 4 weeks
  • Stock option plan for a stake in our success
  • Generous vacation package to take…
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary