About Off Sec
Founded in 2006 by the creators of Kali Linux, Off Sec (formerly known as Offensive Security) is the leading provider of continuous professional and workforce development, training, and education for cybersecurity practitioners. Off Sec’s distinct pedagogy and practical, hands‑on learning help organizations fill the infosec talent gap by training their teams on today’s most critical skills.
Become a part of our global presenceWith team members in over 40 countries, we believe in inspiring people of all backgrounds and communities. The Off Sec team is composed of diverse, internationally published authors, conference speakers, and seasoned information technology professionals from both the private sector and governments worldwide.
Excited about our mission and what we do? Apply and join us!
About the JobOff Sec is seeking an experienced Senior SRE to join our team and lead the design and implementation of complex, scalable lab environments that power our industry‑leading cybersecurity training and certification programs. This senior‑level position will work closely with Security Researchers and Platform Architects to architect sophisticated labs and vulnerable machine environments across hybrid cloud and on‑premises infrastructure, enabling hands‑on learning experiences for cybersecurity professionals worldwide.
The ideal candidate will bring deep expertise in Open Stack and modern SRE practices, with proven experience in large‑scale infrastructure migrations and cost optimization. You’ll design resilient, scalable, and secure infrastructure that deploys lab environments supporting thousands of concurrent users while maintaining the realistic attack scenarios our students depend on.
Duties and Responsibilities- Design and architect complex global data centers for labs supporting vulnerable machines and realistic attack scenarios using Open Stack
- Develop scalable infrastructure solutions across hybrid cloud and on‑premises environments
- Design secure hosting networks and network topologies that can be used to support realistic offensive cyber activities.
- Establish infrastructure standards, patterns, and best practices for lab environment deployment
- Create architectural solutions that reduce infrastructure costs while improving capabilities and performance
- Implement network isolation for thousands of concurrent user lab instances
- Optimize lab deployment speed and resource utilization for peak performance
- Create infrastructure supporting the deployment of concurrent vulnerable machine instances at scale
- Design workspace‑based deployment models enabling team collaboration and private lab sessions
- Partner closely with Lead Platform and Content Engineers to proactively identify and solve infrastructure requirements
- Provide strategic technical guidance and mentorship to development and operations teams
- Lead architectural reviews and challenge requirements to propose optimal technical solutions
- Drive adoption of infrastructure‑as‑code and automated deployment practices
- Identify process improvements and optimization opportunities before being asked
- Develop infrastructure automation using known Infrastructure as Code frameworks
- Create self‑service capabilities for Content Engineers to deploy and manage lab resources efficiently
- Implement comprehensive monitoring, logging, and observability solutions for lab environments
- Establish disaster recovery and business continuity procedures with minimal downtime requirements
- Automate repetitive tasks to help reduce toil
- Optimize application and infrastructure performance through automation and tuning
- Write runbooks to automate repetitive tasks using Ansible and Terraform
- Serve as a knowledge resource for the rest of the team on Ansible and Terraform
- Evaluate new and emerging products, technologies and make recommendations concerning the introduction of new technologies
- Conduct ongoing research into relevant technology stacks and architectural patterns, assessing their potential impact and value for internal use
- Assist in monitoring performance to address errors and bottlenecks
- Respond to and resolve…
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search: