Technical Senior Manager - Site Reliability Engineering
Listed on 2026-02-17
-
IT/Tech
IT Project Manager, Cloud Computing, Systems Engineer
Technical Senior Manager - Site Reliability Engineering
Coalfire is an EEO employer. We celebrate diversity and are committed to respecting one another, embracing individual differences, and creating an inclusive environment for all employees.
Coalfire is on a mission to make the world a safer place by solving our clients' hardest cybersecurity challenges. We work at the cutting edge of technology to advise, assess, automate, and ultimately help companies navigate the ever‑changing cybersecurity landscape. We are headquartered in Chicago, Illinois with offices across the U.S. and U.K., and we support clients around the world. We are thought leaders, consultants, and cybersecurity experts, but above all else, we are a team of passionate problem‑solvers who are hungry to learn, grow, and make a difference.
Position Summary: We’re looking for a Technical Senior Manager of SRE to play a central role in the implementation and maintenance of scalable, secure, and high‑performing systems, ensuring our clients' mission‑critical infrastructures remain stable and resilient. If you’re driven by a desire to innovate, excel at operational excellence, and thrive in a collaborative environment, come be part of a team committed to making the world a safer place.
WhatYou'll Do
- Allocate approximately 70% of time to hands‑on engineering tasks, such as developing new deployments, tooling, and automation scripts to address client needs
- Dedicate around 30% of time to leadership duties, including mentoring engineers, ensuring quality deliverables, and managing escalations
- Act as the primary escalation contact for complex technical issues, resolving them promptly to maintain high levels of client satisfaction
- Monitor and uphold quality standards for engineering work, confirming alignment with internal protocols, compliance regulations, and project milestones
- Identify and mitigate risks in partnership with consulting and solutions architecture teams, ensuring regulatory requirements and client expectations are fully addressed
- Coordinate day‑to‑day engineering activities, tracking progress and adjusting resources to meet project goals on schedule utilizing Agile practice methodologies
- Help create and implement solutions that improve the practice
- 9+ years in Systems Engineering and Architecture: involving requirements definition, architecture development, systems integration, and testing
- 9+ years in Cloud Computing: designing, implementing, operating, and automating environments within AWS, Azure, or GCP
- 9+ years with Infrastructure‑as‑Code: hands‑on proficiency in Terraform and Ansible for orchestration and automation
- SLA and Issue Management: proven track record of meeting SLAs, particularly regarding availability, response times, and service posture through effective collaboration and escalation processes
- Operational Excellence: demonstrated success driving continuous improvement via KPIs and best practices for operational support
- Governance and Compliance: experience guiding the creation of Infrastructure‑as‑Code solutions, governance models, and alignment with standards such as FedRAMP or other security frameworks
- Team Leadership: proven track record of managing teams (68 contributors), focusing on career development, goal setting, project oversight, and daily guidance
- Regulatory Audit Prep: prepared and coached teams for client‑facing compliance audits with third‑party auditors
- Project Definition and Documentation: lead efforts of defining, planning, and documenting key Managed Services projects and initiatives; tracked outcomes against established goals
- Managed Services Expertise: familiarity with ticket management systems and meeting SLA requirements in a managed services environment
- Cloud & Automation: extensive experience with AWS, Azure, or GCP; deep knowledge of Terraform, Ansible, Git Lab, and CI/CD technologies
- Technical
Collaboration:
proven ability to collaborate with Site Reliability Engineers and cross‑functional teams, facilitating team problem‑solving and performance improvements - Soft Skills:
strong interpersonal, organizational, and problem‑solving skills; effective at building client trust - Documentation…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).