More jobs:
Senior Technology Site Reliability Engineering Manager
Job in
San Francisco, San Francisco County, California, 94199, USA
Listed on 2026-02-25
Listing for:
Cooley LLP
Full Time
position Listed on 2026-02-25
Job specializations:
-
IT/Tech
IT Project Manager, Cloud Computing
Job Description & How to Apply Below
San Francisco:
New York:
Santa Monica:
Los Angeles:
Palo Altotime type:
Full time posted on:
Posted Yesterday job requisition 4346
Senior Technology Site Reliability Engineering Manager Cooley is seeking a Senior Site Reliability Engineering Manager to join the Infrastructure & Development Operations team.
*
* Position summary:
** The Senior Technology Site Reliability Engineering (“SRE”) Manager is responsible for leading a team of SRE’s to ensure the reliability, scalability, and performance of the firm’s infrastructure and services. This role works with the Dev Ops, infrastructure, and development teams, applying engineering principles to operations in order to create scalable and resilient systems. In addition to being technically advanced, the SRE Manager will have high degree of emotional intelligence and the ability to work as a team towards complex and layered objectives.
Specific duties and responsibilities include, but are not limited to, the following:
*
* Position responsibilities:
*** Define and execute the SRE strategic roadmap aligned with business goal, providing experienced leadership in developing solutions for highly scalable, highly available, hybrid cloud (IaaS, PaaS, SaaS) infrastructure patterns and platform integrations across physical colocations and hyperscalers (AWS and Azure)
* Build and mentor a high-performing SRE team, fostering a culture of trust, collaboration, and continuous improvement
* Partner with cross-functional leaders in infrastructure, Dev Ops, and application development to scale reliability practices across the enterprise
* Oversee incident response, root cause analysis, and postmortems with a focus on accountability and learning
* Establish and enforce Service-level objectives (SLOs), service-level indicators (SLI’s), and service-level agreements (SLA’s)
* Drive proactive monitoring, alerting, observability, and capacity planning
* Lead automation initiatives for deployment, scaling, failover, and recovery
* Promote observability practices using tools like Prometheus, Grafana, Data Dog, or Splunk
* Collaborate with development teams to build self-healing, fault-tolerant systems
* Champion reliability-first thinking across engineering and operations
* Encourage blameless postmortems and a learning-oriented incident culture
* Ensure compliance with security, risk, and regulatory requirements
* Serve as direct supervisor and mentor to direct reports
* Provide day-to-day supervision of direct reports, ensure compliance with assigned work hours and monitor for compliance with all firm and department policies. Manage staffing coverage, review and process time logs/time off requests
* Support business professional development and continued educational opportunities
* In collaboration with immediate supervisor and CN HR, participate in hiring, performance appraisals, counseling, termination and other employee lifecycle events
* All other duties as assigned or required
** Skills and experience****:
*
* Required:
* After orientation at Cooley LLP, exhibit proficiency in the Microsoft Office suite, iManage and other firm applications
* Ability to work extended and/or weekend hours, as required
* Ability to travel, as required
* 7+ years’ direct applicable experience (e.g., Dev Ops or Site Reliability Engineering) with 2+ years of exempt/management experience in relevant roles
* Experience managing cross-functional projects and SRE planning and programing
* Proficiency in Terraform and programming languages such as Python, Go, or Java
* Deep expertise in cloud platforms, particularly AWS, and container orchestration
* Strong background in distributed systems, performance tuning, and automation
* Hands-on experience with configuration management tools such as Puppet, Chef, or Salt Preferred:
* Bachelor's Degree in Computer Science, Information Technology, Engineering, or associated discipline
* Experience working with advanced ETL data workflows including technologies such as AWS EMR, Azure Synapse, Azure Data Factory, or Apache Hive/Spark/Airflow
* Experience…
Position Requirements
10+ Years
work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×