×
Register Here to Apply for Jobs or Post Jobs. X

Senior SRE: AI​/ML HPC Infra & GPU Cluster

Job in Toronto, Ontario, C6A, Canada
Listing for: Boson AI
Full Time position
Listed on 2026-02-13
Job specializations:
  • IT/Tech
    Systems Engineer, Cloud Computing, SRE/Site Reliability
Salary/Wage Range or Industry Benchmark: 80000 - 100000 CAD Yearly CAD 80000.00 100000.00 YEAR
Job Description & How to Apply Below
A leading technology company in Toronto is seeking a Senior Site Reliability Engineer to manage one of the most advanced GPU clusters, ensuring optimal performance and reliability. The ideal candidate will have over 5 years of experience and proficiency in Linux systems, Kubernetes, and Ceph management, alongside capabilities in automation and infrastructure-as-code tools like Ansible or Terraform. If problem-solving and continuous learning excite you, we want to hear from you.
#J-18808-Ljbffr
Position Requirements
10+ Years work experience
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary