Junior Site Reliability Engineer | Remote
Chicago, Cook County, Illinois, 60290, USA
Listed on 2025-12-20
-
IT/Tech
Cloud Computing, Cybersecurity, Systems Engineer, IT Support
Coalfire is on a mission to make the world a safer place by solving our clients’ toughest cybersecurity challenges. We work at the cutting edge of technology to advise, assess, automate, and ultimately help companies navigate the ever-changing cybersecurity landscape. We are headquartered in Denver, Colorado with offices across the U.S. and U.K., and we support clients around the world.
But that’s not who we are – that’s just what we do.
We are thought leaders, consultants, and cybersecurity experts, but above all else, we are a team of passionate problem-solvers who are hungry to learn, grow, and make a difference.
And we’re growing fast.
We’re looking for a Site Reliability Engineer I to support our Managed Services team.
Position SummaryAs a Junior Site Reliability Engineer at Coalfire within our Managed Services (CMS) group, you will be a self-starter, passionate about cloud technology, and thrive on problem solving. You will work within major public clouds, utilizing automation and your technical abilities to operate the most cutting-edge offerings from Cloud Service Providers (CSPs). This role directly supports leading cloud software companies to provide seamless reliability and scalability of their SaaS product to the largest enterprises and government agencies around the world.
This can be a remote position (must be located in the United States).
What You’ll Do- Become a member of a highly collaborative engineering team offering a unique blend of Cloud Infrastructure Administration, Site Reliability Engineering, Security Operations, and Vulnerability Management across multiple clients.
- Coordinate with client product teams, engineering team members, and other stakeholders to monitor and maintain a secure and resilient cloud-hosted infrastructure to established SLAs in both production and non-production environments.
- Innovate and implement using automated orchestration and configuration management techniques. Understand the design, deployment, and management of secure and compliant enterprise servers, network infrastructure, boundary protection, and cloud architectures using Infrastructure-as-Code.
- Create, maintain, and peer review automated orchestration and configuration management codebases, as well as Infrastructure-as-Code codebases. Maintain IaC tooling and versioning within Client environments.
- Implement and upgrade client environments with CI/CD infrastructure code and provide internal feedback to development teams for environment requirements and necessary alterations.
- Work across AWS, Azure and GCP, understanding and utilizing their unique native services in client environments.
- Configure, tune, and troubleshoot cloud-based tools, manage cost, security, and compliance for the Client’s environments.
- Monitor and resolve site stability and performance issues related to functionality and availability.
- Work closely with client Dev Ops and product teams to provide 24x7x365 support to environments through Client ticketing systems.
- Support definition, testing, and validation of incident response and disaster recovery documentation and exercises.
- Participate in on-call rotations as needed to support Client critical events, and operational needs that may lay outside of business hours.
- Support testing and data reviews to collect and report on the effectiveness of current security and operational measures, in addition to remediating deviations from current security and operational measures.
- Maintain detailed diagrams representative of the Client’s cloud architecture.
- Maintain, optimize, and peer review standard operating procedures, operational runbooks, technical documents, and troubleshooting guidelines
- BS or above in related Information Technology field or equivalent combination of education and experience
- 2+ years experience in 24x7x365 production operations
- Fundamental understanding of networking and networking troubleshooting
- 2+ years experience installing, managing, and troubleshooting Linux and/or Windows Server operating systems in a production environment
- 2+ years experience supporting cloud operations and automation in AWS, Azure or GCP (and aligned certifications)
- 2+ years…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).