Senior Site Reliability Engineer
Job in
San Francisco, San Francisco County, California, 94199, USA
Listed on 2026-06-14
Listing for:
LiveRamp
Full Time
position Listed on 2026-06-14
Job specializations:
-
IT/Tech
Systems Engineer, SRE/Site Reliability, Cloud Computing: Infrastructure & Operations
Job Description & How to Apply Below
## Senior Staff Site Reliability Engineer Apply locations:
San Francisco time type:
Full time posted on:
Posted Yesterday job requisition :
JR012202
** Live Ramp is the data collaboration platform of choice for the world’s most innovative companies. A groundbreaking leader in consumer privacy, data ethics, and foundational identity, Live Ramp is setting the new standard for building a connected customer view with unmatched clarity and context while protecting precious brand and consumer trust. Live Ramp offers complete flexibility to collaborate wherever data lives to support the widest range of data collaboration use cases—within organizations, between brands, and across its premier global network of top-quality partners.
**** Hundreds of global innovators, from iconic consumer brands and tech giants to banks, retailers, and healthcare leaders turn to Live Ramp to build enduring brand and business value by deepening customer engagement and loyalty, activating new partnerships, and maximizing the value of their first-party data while staying on the forefront of rapidly evolving compliance and privacy requirements.
** The Global SRE team is responsible for owning and supporting deployments of global products, and providing first line operational support. We are looking for a Senior Staff Site Reliability Engineer who will set the technical direction for reliability engineering across Live Ramp's global infrastructure. This is a senior individual contributor role with organization-wide scope—you will define and own the SRE strategy, influence product and platform architecture decisions, and raise the engineering bar across multiple teams and regions.
** You Will:
*** Define and own the SRE strategy across the organization—SLOs/SLAs, error budgets, and operational excellence frameworks
* Oversee automation of critical areas to mitigate risk and align with engineering priorities
* Develop and own some of the most complex software infrastructure spanning multiple products and services
* Drive engineering-wide system design, automation, and performance optimization standards
* Lead distributed systems architecture reviews and kickoffs across engineering teams
* Drive high-quality API and interface designs across multiple teams
* Drive overall architecture improvements across multiple products and services
* Shape product and service design vision inside engineering, anticipating the unexpressed needs of internal teams
* Understand global industry and market trends and apply them to deliver superior infrastructure solutions
* Maintain a complete view of Live Ramp products and how SRE OKRs support the product roadmap
* Contribute technical due diligence to M&A evaluations of potential acquisitions and partnerships
* Serve as the escalation point of last resort for high-impact production incidents globally, leading postmortems with org-wide action items
* Establish and enforce production readiness standards across engineering
* Champion Fin Ops strategy across Kubernetes, cloud resources, and database infrastructure
* Mentor Staff Engineers and provide technical feedback and guidance
* Hold peers accountable for on-time, quality delivery
* Represent Live Ramp's best interests in the broader technology ecosystem
*
* About you:
*** B.S./M.S. in Computer Science, Software Engineering, or equivalent
* 10+ years in SRE, production engineering, or platform engineering; 3+ years at senior or staff level
* Expert in Infrastructure as Code (Terraform) at scale across multi-environment, multi-team setups
* Proven experience designing and operating highly available, globally distributed systems
* Deep Kubernetes expertise: internals, autoscaling, multi-tenant workload management, and rightsizing
* Advanced experience with real-time and No
SQL databases (Single Store, Scylla
DB, Cassandra, Dynamo
DB)
* Strong proficiency in Python and/or Go; able to build production-grade internal tooling adopted across teams
* Expertise in observability engineering—SLOs, SLI pipelines, and high-signal alerting systems
* Deep Fin Ops expertise: cost attribution, reserved capacity strategy, and cloud cost governance at scale
* Experience maturing CI/CD platforms…
Position Requirements
10+ Years
work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×