Manager, Engineering - Dev Ops/SRE; Hybrid
Listed on 2026-06-14
-
IT/Tech
SRE/Site Reliability, Cloud Computing: Infrastructure & Operations
As a global leader in cybersecurity, Crowd Strike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn't changed - we are here to stop breaches, and we have redefined modern security with the world's most advanced AI-native platform. We work on large scale distributed systems, processing almost 3 trillion events per day and this traffic is growing daily.
Our customers span all industries, and they count on Crowd Strike to keep their businesses running, their communities safe and their lives moving forward. We are also a mission-driven company. We cultivate a culture that gives every Crowd Striker both the flexibility and autonomy to own their careers. We are always looking to add talented Crowd Strikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other.
Ready to join a mission that matters? The future of cybersecurity starts with you.
The Role
At Crowd Strike, Site Reliability Engineering (SRE) is at the forefront of ensuring the reliability and scalability of our cloud-native security platform. In this role, you will have the opportunity to manage a team of talented engineers, providing technical leadership on key projects and empowering them to excel in their roles. Our culture of intellectual curiosity and problem‑solving is central to our success bring together individuals with varied backgrounds and perspectives, fostering collaboration and innovation in a blame‑free environment.
WhatYou’ll Do
As an SRE Engineering Manager, you will lead a team responsible for managing the complex challenges of scale unique to Crowd Strike, leveraging your expertise in software engineering, systems design, and automation. You will play a critical role in ensuring that our services maintain the highest levels of reliability, uptime, and performance, meeting the needs of our customers while continuously improving our systems.
You’ll have the opportunity to work on meaningful projects while providing support and mentorship to your team, enabling them to learn, grow, and make a lasting impact in the cybersecurity landscape. The ideal candidate will have hands‑on experience in cloud solutions development, strong leadership skills, and a collaborative approach to working with cross‑functional teams. Given our remote‑first culture, exceptional verbal and written communication skills are essential for effective collaboration with engineering teams and colleagues worldwide.
Prior experience in the security industry is not required for this role.
- Proven track record of building, growing, and retaining high‑performing SRE/Platform engineering teams in a fast‑paced, high‑growth environment.
- 10+ years of software engineering experience with significant focus on reliability engineering, platform infrastructure, and production operations at scale.
- 3+ years of hands‑on management experience overseeing SRE/Platform engineering teams, including incident command and reliability ownership.
- Deep understanding of SRE principles including SLOs, SLAs, SLIs, error budgeting strategies applied to large‑scale distributed systems.
- Driving system reliability by blending software engineering principles with AI‑driven automation, moving from reactive firefighting to proactive, automated operations.
- Proficiency in at least one cloud environment (AWS, Azure, GCP) with emphasis on multi‑region architecture, cloud‑native reliability patterns, and security‑first cloud design.
- Proven experience owning reliability for high‑throughput distributed systems processing millions of events per second, including capacity planning, traffic management, and load shedding strategies.
- Strong incident management background including leading major incident response, facilitating blameless post‑mortems, and driving systemic reliability improvements.
- Demonstrated ability to build, operationalize, and maintain highly scalable, security‑critical systems with zero tolerance for data loss or downtime.
- Bachelor’s degree in Computer Science or related field, or equivalent work experience.
- Ability to…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).