Solutions Architect, Technology Operations & Service Delivery
Listed on 2026-06-22
-
IT/Tech
Cybersecurity, Cloud Computing: Infrastructure & Operations, IT Support, Systems Engineer
Antares Capital is seeking a Vice President, Solutions Architect - Technology Operations & Service Delivery to lead and evolve our Production Support Organization, spanning Level 1 and Level 2 operations. This is a strategic, hands‑on leadership role responsible for ensuring the reliability, stability, and performance of our technology platforms while building a world‑class operational structure driven by automation, data, and continuous improvement.
You will own the end‑to‑end health of our production ecosystem — partnering with Engineering, Infrastructure, Cybersecurity, and Business stakeholders to ensure our systems are resilient, observable, and scalable. You will establish strong KPI frameworks, drive automation‑first thinking, manage third‑party vendor relationships, and act as the command center during incidents and outages. This role is ideal for a leader who excels at translating operational discipline into strategic execution plans and seeing them through to closure.
- L1 & L2 Production Support Leadership:
Build, lead, and mature both L1 and L2 support teams, establishing clear escalation paths, ownership accountability, and a high‑performance service culture. - Own end‑to‑end incident monitoring, triage, and resolution across all production environments.
- KPI Framework & Operational Governance:
Design and operationalize a comprehensive KPI and SLA/SLO framework covering incident volume, MTTR, MTTD, first‑call resolution, and system availability. - Present regular metrics and trending analysis to Technology leadership.
- Outage Command & Recurring Issue Resolution:
Serve as the executive incident commander during major outages — rapidly mobilizing cross‑functional teams, managing communications, and driving root cause analysis. - Develop and execute structured remediation plans that address recurring patterns and systemic dependencies to prevent recurrence.
- Automation & Modern Tooling:
Champion an automation‑first culture by identifying and implementing AI‑assisted and scripted solutions to reduce manual toil, accelerate resolution, and improve observability. - Maintain awareness of emerging Web and AI technologies and assess their applicability to production operations.
- Third‑Party Vendor Management:
Own relationships with key technology vendors and managed service providers. Integrate external support processes seamlessly with Antares’ internal workflows, hold vendors accountable to contractual SLAs, and lead escalations to vendor executive teams when needed. - Cross‑Functional Relationship Building:
Build trusted partnerships with Development, Infrastructure, and Cybersecurity teams to align on release readiness, change management, security controls, and platform health. - Act as the connective tissue between operational teams and strategic technology initiatives.
- Strategic Execution Planning:
Translate operational gaps and technology opportunities into actionable roadmaps and execution plans. - Own program milestones end‑to‑end, driving accountability across teams and delivering measurable outcomes on schedule.
- Resiliency & Recovery:
Collaborate with Engineering teams to ensure fault‑tolerant system design meeting RTO/RPO targets. - Oversee replay and recovery capabilities for critical business processes, and maintain runbook documentation to reduce dependency on tribal knowledge.
- Performance, Scalability & Observability:
Partner with Dev and Infrastructure teams to ensure systems scale for business growth, handle peak volumes, and employ autoscaling for cost efficiency. - Drive adoption of Datadog or equivalent platforms for end‑to‑end tracing, proactive alerting, and fast root‑cause identification.
- ITSM & Workflow Tooling:
Configure and manage Service Now for incident, change, and problem management. - Leverage Control‑M or equivalent schedulers to maintain job chain integrity and proactively identify scheduling risks.
- 10+ years of technology operations experience, including at least 5 years leading L1 and/or L2 production support teams in a complex, multi‑platform financial services or enterprise environment.
- Demonstrated ability to design and operate KPI/SLA frameworks that drive…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).