Senior Technology Site Reliability Engineering Manager Job San Francisco area,California USA,IT/Tech

Senior Technology Site Reliability Engineering Manager page is loaded## Senior Technology Site Reliability Engineering Manager locations:
San Francisco:
New York:
Santa Monica:
Los Angeles:
Palo Altotime type:
Full time posted on:
Posted Yesterday job requisition 4346

Senior Technology Site Reliability Engineering Manager Cooley is seeking a Senior Site Reliability Engineering Manager to join the Infrastructure & Development Operations team.
*
* Position summary:

** The Senior Technology Site Reliability Engineering (“SRE”) Manager is responsible for leading a team of SRE’s to ensure the reliability, scalability, and performance of the firm’s infrastructure and services. This role works with the Dev Ops, infrastructure, and development teams, applying engineering principles to operations in order to create scalable and resilient systems. In addition to being technically advanced, the SRE Manager will have high degree of emotional intelligence and the ability to work as a team towards complex and layered objectives.

Specific duties and responsibilities include, but are not limited to, the following:
*
* Position responsibilities:

*** Define and execute the SRE strategic roadmap aligned with business goal, providing experienced leadership in developing solutions for highly scalable, highly available, hybrid cloud (IaaS, PaaS, SaaS) infrastructure patterns and platform integrations across physical colocations and hyperscalers (AWS and Azure)
* Build and mentor a high-performing SRE team, fostering a culture of trust, collaboration, and continuous improvement
* Partner with cross-functional leaders in infrastructure, Dev Ops, and application development to scale reliability practices across the enterprise
* Oversee incident response, root cause analysis, and postmortems with a focus on accountability and learning
* Establish and enforce Service-level objectives (SLOs), service-level indicators (SLI’s), and service-level agreements (SLA’s)
* Drive proactive monitoring, alerting, observability, and capacity planning
* Lead automation initiatives for deployment, scaling, failover, and recovery
* Promote observability practices using tools like Prometheus, Grafana, Data Dog, or Splunk
* Collaborate with development teams to build self-healing, fault-tolerant systems
* Champion reliability-first thinking across engineering and operations
* Encourage blameless postmortems and a learning-oriented incident culture
* Ensure compliance with security, risk, and regulatory requirements
* Serve as direct supervisor and mentor to direct reports
* Provide day-to-day supervision of direct reports, ensure compliance with assigned work hours and monitor for compliance with all firm and department policies. Manage staffing coverage, review and process time logs/time off requests
* Support business professional development and continued educational opportunities
* In collaboration with immediate supervisor and CN HR, participate in hiring, performance appraisals, counseling, termination and other employee lifecycle events
* All other duties as assigned or required
** Skills and experience****:
*
* Required:

* After orientation at Cooley LLP, exhibit proficiency in the Microsoft Office suite, iManage and other firm applications
* Ability to work extended and/or weekend hours, as required
* Ability to travel, as required
* 7+ years’ direct applicable experience (e.g., Dev Ops or Site Reliability Engineering) with 2+ years of exempt/management experience in relevant roles
* Experience managing cross-functional projects and SRE planning and programing
* Proficiency in Terraform and programming languages such as Python, Go, or Java
* Deep expertise in cloud platforms, particularly AWS, and container orchestration
* Strong background in distributed systems, performance tuning, and automation
* Hands-on experience with configuration management tools such as Puppet, Chef, or Salt Preferred:
* Bachelor's Degree in Computer Science, Information Technology, Engineering, or associated discipline
* Experience working with advanced ETL data workflows including technologies such as AWS EMR, Azure Synapse, Azure Data Factory, or Apache Hive/Spark/Airflow
* Experience…


Increase/decrease your Search Radius (miles)



Job Posting Language