Cloud Operations Engineer
Job in
Toronto, Ontario, M5A, Canada
Listing for:
Extreme Networks
Full Time
position
Listed on 2026-02-16
Job specializations:
-
IT/Tech
Cloud Computing, Systems Engineer, SRE/Site Reliability
Job Description & How to Apply Below
Position: Cloud Operations Engineer (9986)
We are seeking a highly skilled and experienced
Staff Cloud Operations Engineer to join our growing Cloud Operations team. In this critical role, you will be responsible for designing, implementing, and optimizing our comprehensive monitoring and alerting strategy across our cloud infrastructure and applications. You will drive proactive identification of issues, ensure system health, and contribute significantly to our operational excellence and reliability goals.
We're looking for the best and the brightest 'A' players who want to
make a difference doing a job they love.
Responsibilities
Manage and maintain Extreme Cloud product and servicesParticipate in developing the Edge Cloud Support SystemTroubleshoot and follow up on Cloud infrastructure / application related issuesParticipate in continuous cloud service operations with US, EU, and China teams.Communicate with Dev/QA as well as external carriers to resolve and prevent issues.Improve and implement deployment automation platform for Kubernetes based microservices.Improve service availability and scalability through tuning, automation, tools, and process.Analyze service performance, identify bottleneck, and provide actionable improvement plans.Provide 24
* 7 support for Edge Cloud products and servicesParticipate in cloud security and compliance implementation.Ideal Qualifications
BS level technical degree required;
Computer Science or Engineering background preferred.5+ years of experience in a Cloud Ops / Dev Ops role.Hands-on experience with AWS or any public cloud (Azure, GCP etc).Hands-on experience with container-based architecture and deployment (Docker, Kubernetes.)Hands-on experience with deployment automation development (ArgoCD, Terraform, Helm).Experience in diagnosing and resolving complex application problems.Working Knowledge of Linux, security, and networking fundamentals.Working knowledge of Elasticsearch, Postgre
SQL, Redis, Ignite, Kafka and Rabbit
MQ.Comfortable working within a distributed team located in multiple time zones.We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
Search for further Jobs Here: