Senior Site Reliability Engineer
Listed on 2026-02-03
-
IT/Tech
Cloud Computing, SRE/Site Reliability, Systems Engineer, Data Engineer
Overview
Senior Site Reliability Engineer, Data Platform Engineering
The Wikimedia Foundation is looking for a Senior SRE to join our team, reporting to the Engineering Manager of the Data Platform Engineering SRE team. As a Senior SRE, you will be responsible for operating the systems supporting our data-oriented teams (Kubernetes, >6PB Hadoop, Open Search, Airflow, Superset, Kafka, etc), helping design and implement new systems and solutions, and ensuring that our systems scale to meet demand.
In this role, you will interact with our client teams, support them in whatever adventure they are on, investigate incidents, migrate services to Kubernetes, …
You are responsible for:
- Simplifying our operations by standardizing how we deploy services and how we benefit from virtualizing and containerizing our applications
- Supporting our users, removing roadblocks, and making them more productive!
- Monitoring of systems and services, optimization of performance, and resource utilization
- Proactively identifying sources of instability in distributed systems and analyzing how complex systems fail from a reliability and resilience perspective.
- Automation and streamlining of tasks, as well as identifying process gaps
- Collaborating with a global and asynchronously communicating team (don’t worry if you have never worked remotely, we’ll help you get used to it)
- Mentoring peers in your areas of technical and operational strength
- Expected to travel domestically or potentially internationally 2-3 times a year for team gatherings and conferences
Our backlog has even more details.
Skills and Experience:
- 5+ yearsof experience in an SRE/Operations/Dev Ops or software engineering role
- Experience with running applications and services at scale
- Proficiency with shell and a programming language used in an SRE/Operations engineering context (Python, Go, Ruby, etc.)
- Comfort with Open Source configuration management and orchestration tools (Puppet, Ansible, Terraform etc.)
- Communicative technical English
- Virtualization of data and compute
Qualities that are important to us:
- Share our values, appreciate our code of conduct, support our team norms, and work in accordance with all three
- Customer-oriented. We’re here to help, not to block.
Strong English language skills and ability to work independently, as an effective part of a globally distributed team
- Comfortable working in the open
- Passionate about supporting our communities
Additionally, we’d love it if you have:
- Experience with Kubernetes and Ceph
- Experience with operating a data platform
About the Wikimedia Foundation
The Wikimedia Foundation is the nonprofit organization that operates Wikipedia and the other Wikimedia free knowledge projects. Our vision is a world in which every single human can freely share in the sum of all knowledge. We believe that everyone has the potential to contribute something to our shared knowledge, and that everyone should be able to access that knowledge freely.
We host Wikipedia and the Wikimedia projects, build software experiences for reading, contributing, and sharing Wikimedia content, support the volunteer communities and partners who make Wikimedia possible, and advocate for policies that enable Wikimedia and free knowledge to thrive.
The Wikimedia Foundation is a charitable, not-for-profit organization that relies on donations. We receive donations from millions of individuals around the world, with an average donation of about $15. We also receive donations through institutional grants and gifts. The Wikimedia Foundation is a United States 501(c)(3) tax-exempt organization with offices in San Francisco, California, USA.
As an equal opportunity employer, the Wikimedia Foundation values having a diverse workforce and continuously strives to maintain an inclusive and equitable workplace. We encourage people with a diverse range of backgrounds to apply. We do not discriminate against any person based upon their race, traits historically associated with race, religion, color, national origin, sex, pregnancy or related medical conditions, parental status, sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).