Senior Site Reliability/Gitops Engineer
Listed on 2025-12-07
-
IT/Tech
Cloud Computing, Systems Engineer, SRE/Site Reliability
Senior Site Reliability / Git Ops Engineer
Canonical – Cape Town, Western Cape, South Africa
About CanonicalCanonical is a leading provider of open source software and operating systems. Our platform, Ubuntu, is widely used in public cloud, data science, AI, engineering innovation, and IoT. We are a pioneer of global distributed collaboration with 1200+ colleagues in 75+ countries.
Job SummaryThe Information Systems (IS) team supports & maintains Canonical’s IT production services that are used by over 60 million Ubuntu users. As a Senior Site Reliability / Git Ops Engineer, you will drive operations automation, both in private and public clouds, using infrastructure as code and CI/CD practices.
Responsibilities- Drive automation, Git Ops in the team as an embedded tech lead.
- Collaborate with the IS architect to align solutions with the IS architecture vision.
- Design & architect services that IS can offer as products.
- Apply IaC to develop and improve automation practices within IS.
- Automate software operations for re‑usability across private and public clouds, considering distributed systems complexity.
- Maintain operational responsibility for Canonical’s core services, networks, and infrastructure.
- Develop skills in troubleshooting, capacity planning, performance investigation; set up and maintain observability tools such as Prometheus, Grafana, Elasticsearch, and build monitoring and alerting.
- Provide assistance and collaborate with globally distributed engineering, operations, and support peers.
- Receive uninterrupted development time for larger projects & automation of manual tasks.
- Share best practices with the team through design sessions, mentorship and collaborative work.
- Carry final responsibility for time‑critical escalations.
- Modern view on hosting architecture driven by IaC on private & public clouds.
- Product mindset focused on developing products rather than solutions.
- Experience with Python software development on large projects.
- Experience with Kubernetes or other container orchestration systems.
- Proven exposure to managing & deploying cloud infrastructure with code.
- Practical knowledge of Linux networking, routing, firewalls.
- Affinity with various Linux storage solutions from Ceph to databases.
- Hands‑on experience administering enterprise Linux servers.
- Extensive knowledge of cloud computing concepts and technologies.
- Bachelor’s degree or greater, preferably in computer science or related field.
- Excellent communication in English.
- Motivated to troubleshoot across the stack and ready to ask for help.
- Willingness to learn quickly and adapt to fast‑changing environments.
- Comfortable working in a distributed team.
- Passionate about open‑source, especially Ubuntu or Debian.
- Distributed work environment with twice‑yearly team sprints in person.
- Personal learning & development budget of US $ 2,000 per year.
- Annual compensation review.
- Recognition rewards.
- Annual holiday leave.
- Maternity and paternity leave.
- Team Member Assistance Program & Wellness Platform.
- Opportunities to travel to new locations to meet colleagues.
- Priority Pass and travel upgrades for long‑haul company events.
Mid‑Senior level
Employment TypeFull-time
Job FunctionEngineering and Information Technology
IndustriesSoftware Development
EEO StatementCanonical is proud to foster a workplace free from discrimination. Diversity of experience, perspectives, and backgrounds creates a better work environment and better products. Whatever your identity, we will give your application fair consideration.
#J-18808-LjbffrTo Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search: