Devops Engineer; Waterloo
Listed on 2026-06-27
-
IT/Tech
SRE/Site Reliability, IT Infrastructure, Cloud Computing: Infrastructure & Operations
Location: southwestern ontario
Role Overview
Proxylink is partnering with a global leader in data analytics and large-scale software platforms to hire a Dev Ops Platform Manager. This role will lead the evolution of cloud infrastructure and platform capabilities that support mission-critical, high-scale systems used by enterprise and government customers worldwide.
This is a hands‑on leadership role combining technical depth with team leadership. You will shape platform strategy, mentor engineers, and directly contribute to architecture, automation, and reliability initiatives.
The OpportunityYou will oversee the Dev Ops and platform engineering function responsible for delivering secure, scalable, and highly reliable infrastructure. The platform supports complex distributed systems, large-scale data processing, and mission‑critical applications operating across multiple cloud environments.
Key Responsibilities Leadership & Team Development- Lead, mentor, and grow a high‑performing Dev Ops and platform engineering team.
- Foster a culture of ownership, collaboration, and continuous improvement.
- Drive hiring, performance management, and career development initiatives.
- Own the roadmap for infrastructure, platform tooling, and deployment pipelines.
- Define best practices for reliability, observability, scalability, and security.
- Lead modernization and automation initiatives across cloud environments.
- Participate directly in architecture design, coding, and complex troubleshooting.
- Establish standards for Infrastructure‑as‑Code, CI/CD, and automation.
- Conduct design and code reviews to maintain high engineering quality.
- Manage major infrastructure and platform initiatives from planning to delivery.
- Balance priorities across engineering, product, and business stakeholders.
- Identify risks, manage dependencies, and ensure successful execution.
- Partner with product, security, and engineering teams to align platform initiatives with business objectives.
- Communicate technical strategy and trade‑offs to technical and non‑technical audiences.
- Monitor and optimize cloud infrastructure usage and performance.
- Implement cost‑productive scaling strategies while maintaining reliability and security.
- Experience leading Dev Ops, SRE, or platform teams in high‑growth or enterprise environments.
- Proven track record delivering cloud infrastructure at scale.
Strong experience with several of the following:
- Services written in Go or Python
- Infrastructure‑as‑Code (Terraform, Pulumi, or similar)
- Container orchestration platforms (Kubernetes, Nomad, or equivalent)
- Serverless technologies (AWS Lambda or similar)
- CI/CD pipeline design and automation tooling
- Monitoring, logging, and incident response practices
- Deep experience designing and operating secure, scalable distributed systems.
- Strong understanding of reliability engineering and cloud security principles.
- Experience managing projects across the full software lifecycle.
- Strong stakeholder management and communication skills.
- Self‑motivated and comfortable in fast‑paced, high‑impact environments.
- Passionate about operational excellence, automation, and continuous learning.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search: