Director of Site Reliability Engineering
Listed on 2025-12-21
-
IT/Tech
Systems Engineer, Cloud Computing, SRE/Site Reliability, IT Project Manager
Director of Site Reliability Engineering
Base pay range: $200,000 - $260,000 per year.
We are seeking a dynamic and innovative Director of Site Reliability Engineering to join our growing team. This role is pivotal in maintaining the stability and efficiency of our cutting‑edge technology services, ensuring that our systems are always online and performant. The successful candidate will be responsible for leading a talented team of engineers, developing and implementing site reliability best practices, and driving continuous improvement initiatives.
Responsibilities- Lead, mentor, and manage a high‑performing team of Site Reliability Engineers.
- Develop and implement best practices for system reliability, scalability, operability, and performance.
- Collaborate with engineering teams to define service level objectives, ensure we are exceeding them, and implement strategies to improve upon them.
- Drive the design and deployment of our multi‑region architectures and on‑prem deployments.
- Utilize expertise in Kubernetes and Cloud Formation to automate and innovate.
- Oversee compliance with frameworks such as FedRAMP and SOC 2.
- Develop a deep understanding of our AI/ML Infrastructure to ensure optimal performance and reliability.
- Work closely with other teams to identify and correct bottlenecks in the delivery process.
- Spearhead incident management, ensuring swift resolution, comprehensive post‑mortem investigations, and effective preventative measures.
- Bachelor’s degree in Computer Science, Engineering, or related field.
- Minimum of 5 years of experience in Site Reliability Engineering leadership & 10+ years of SRE/Infrastructure/Dev Ops experience.
- Proven leadership experience managing high‑performing engineering teams.
- Extensive experience with Kubernetes, Cloud Formation, and multi‑region architectures.
- In‑depth understanding of compliance frameworks such as FedRAMP and SOC 2.
- Prior experience in a startup environment is highly desirable.
- Proficiency in AI/ML Infrastructure and on‑prem deployments.
- Exceptional problem‑solving skills and attention to detail.
- Excellent communication and interpersonal skills.
- Proven ability to thrive in a fast‑paced, dynamic environment.
- Competitive Base Salary + Stock options.
- Company‑paid health plan for employees.
- Flexible hours.
- Very generous PTO.
- Dental & Vision, FSA, HSA.
- Small team, autonomy.
- Many more great perks!
Jobot is an Equal Opportunity Employer. We provide an inclusive work environment that celebrates diversity and all qualified candidates receive consideration for employment without regard to race, color, sex, sexual orientation, gender identity, religion, national origin, age (40 and over), disability, military status, genetic information or any other basis protected by applicable federal, state, or local laws. Jobot also prohibits harassment of applicants or employees based on any of these protected categories.
It is Jobot’s policy to comply with all applicable federal, state and local laws respecting consideration of unemployment status in making hiring decisions.
Sometimes Jobot is required to perform background checks with your authorization. Jobot will consider qualified candidates with criminal histories in a manner consistent with any applicable federal, state, or local law regarding criminal backgrounds, including but not limited to the Los Angeles Fair Chance Initiative for Hiring and the San Francisco Fair Chance Ordinance.
By applying for this job, you agree to receive calls, AI‑generated calls, text messages, or emails from Jobot, and/or its agents and contracted partners. You can reply STOP to cancel and HELP for help as needed.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).