Site Reliability Engineering Manager
Chicago, Cook County, Illinois, 60290, USA
Listed on 2026-01-08
-
IT/Tech
IT Support, Cloud Computing
Overview
Are you ready to make a difference in the world of consumer finance? At Attain Finance, we bring over 50 years of expertise in providing credit solutions across the U.S. and Canada. Our deep roots in the financial industry have empowered us to develop convenient, easily accessible financial services that meet our customers' growing needs.
Join a leading consumer credit lender that thrives on innovation and collaboration, where your contributions are truly valued. Our portfolio includes distinguished brands like Cash Money®, Lend Direct®, Heights Finance, Southern Finance, Covington Credit, Quick Credit, and First Heritage Credit. Each brand is constantly evolving to better serve our customers.
Be part of a dynamic team that is shaping the future of consumer finance. Apply today and take the next step in your career with Attain Finance!
We are seeking a strategic and people-first SRE Manager to lead our Site Reliability Engineering team in building scalable, resilient, and high-performing infrastructure. This role blends technical rigor with empathetic leadership, ensuring our systems are reliable while our engineers are empowered. You will architect operational excellence, drive incident response maturity, and foster a culture of trust, clarity, and continuous improvement.
ResponsibilitiesWhat you will be doing:
- Lead and mentor a team of SREs, fostering growth, accountability, and cross-functional collaboration.
- Drive incident management, postmortem analysis, and root cause remediation.
- Architect scalable monitoring, alerting, and automation frameworks using tools like Grafana, Thousand Eyes, and Go Alert.
- Configure and maintain storage alarms, ensuring proactive capacity management and system health.
- Own the reliability roadmap: SLAs, SLOs, error budgets, and performance metrics.
- Partner with Engineering, Infrastructure, IT Support, and Desktop Services to ensure seamless service delivery.
- Manage and optimize JAMS batch workflows for change tracking, approvals, and operational transparency.
- Champion infrastructure-as-code, CI/CD pipelines, and cloud-native reliability practices
- Oversee the Change Advisory Board (CAB) meetings, ensuring changes are reviewed, documented, and aligned with reliability goals.
Qualifications
- 5+ years of experience in Site Reliability Engineering, Dev Ops, or Infrastructure roles
- 2+ years of experience managing technical teams with a focus on mentorship and collaboration.
- Strong proficiency in observability tools (Grafana, Thousand Eyes), and alerting systems (Go Alert)
- Experience configuring storage alarms in Grafana and Cloud Watch. Familiarity with Azure Monitor.
- Experienced in JAMS workload automation, including failure recovery and job management
- Deep understanding of distributed systems, service-level objectives, and incident response frameworks
- Demonstrated familiarity with Python, SQL, and Power Shell to support oversight of critical automation and tooling maintained by the SRE team.
- Familiarity with ITSM platforms (Ivanti, Service Now) and device management tools (Intune, JAMF)
- Demonstrated experience contributing to or leading CAB processes, with a focus on change control, risk mitigation, and stakeholder communication.
- Excellent communication skills—able to translate technical insights into executive-ready narratives.
- Proven ability to drive cross-functional initiatives and foster a culture of reliability and ownership.
Bonus Skills & Experience
- Proven track record in cost optimization and strategic standardization
- Experience managing hybrid cloud environments and global endpoint fleets
- Familiarity with endpoint telemetry and predictive failure forecasting
- Passion for building inclusive, high-performing teams and mentoring future leaders
Additional Information
- Work Environment:
This position is fully remote. - Candidates must have a reliable internet connection and a suitable home office setup to perform their duties effectively.
- Work Schedule:
This role operates in Eastern or Central Time.
Base Salary: $120,000 - $130,000 USD
The base salary range represents the low and high end of the anticipated salary range for this position based on the U.S. average. The actual…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).