Director, Site Reliability Engineer
Listed on 2026-06-28
-
IT/Tech
Systems Engineer, Cloud Computing: Infrastructure & Operations, SRE/Site Reliability, IT Project Manager
Director, Site Reliability Engineering Technology at Mastercard
Mastercard powers economies and empowers people in 200+ countries and territories worldwide. Together with our customers, we're helping build a sustainable economy where everyone can prosper. We support a wide range of digital payments choices, making transactions secure, simple, smart and accessible. Our technology and innovation, partnerships and networks combine to deliver a unique set of products and services that help people, businesses and governments realize their greatest potential.
About the Role
The Business Operations (Biz Ops) team is seeking a Director, Site Reliability Engineer (SRE). The role of Business Operations Organization is to be the production readiness steward for Mastercard products. As a Business Operations SRE, we are responsible for ensuring that our platform is stable and healthy. We break down barriers to run our products by fostering developer run ownership and empowering developers to build resilient products.
We support our developers during the application build phase in software run principals that includes operational design, automation, capacity planning, monitoring that leads to fault-tolerant, scalable products. We see the big picture and help create and enforce operations standards while facilitating an agile and learning culture. We are seeking a highly motivated and experienced Site Reliability Engineer (SRE) Director to join our growing team.
You will play a critical role in ensuring the reliability, scalability, and performance of our applications, supporting essential services that power Mastercard's global operations. As a thought leader in your field, you will bring technical expertise, a passion for automation, and the ability to mentor.
Team Specific Skills
It is not expected that any single candidate would have expertise across all these areas, but a Biz Ops engineer will spend a bit of time throughout their career with all of these aspects of the role:
- Operational Readiness Architect:
Serve as the primary contact responsible for the overall application health, performance, and capacity. Support services before they go live through activities such as system design consulting, capacity planning and launch reviews. Partner with the development and product team of a new application to establish the right monitoring and alerting strategy and create the framework to achieve zero downtime during deployment. - Site Reliability Engineering:
Performs operability and resilience design and implements and maintains highly reliable and scalable infrastructure. Perform root cause analysis of incidents and collaborate with development teams to resolve issues. Stay up to date with the latest technologies and trends in SRE and cloud computing. Participate in on-call rotations and be available to respond to critical incidents. Complete end-to-end run ownership of the product.
Practice sustainable incident response and blameless post-mortems while taking a holistic approach to problem solving and optimizing time to recover. Automate data-driven alerts to proactively escalate issues. Work with development teams to establish SLOs and improve reliability. - Dev Ops/Automation:
Tackle complex development, automation, and business process problems. Engage in and improve the whole lifecycle of services—from inception and design, through deployment, operation, and refinement. Support the application CI/CD pipeline for promoting software into higher environments through validation and operational gating, and lead Mastercard in Dev Ops automation and best practices. Performs operational and resilience Design and implements solutions for capacity planning and performance optimization.
Increase automation and tooling to reduce toil and manual intervention. - ITSM Practices:
Analyses ITSM activities of the platform and provide feedback loop to development teams on operational gaps or resiliency concerns.
Role Qualifications
The ideal candidate will have experience in many of these areas:
- BS degree in Computer Science or related technical field involving coding (e.g., physics or mathematics), or equivalent practical…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).