Platform Engineer Lead - Disaster Recovery and Resiliency
Listed on 2026-03-02
-
IT/Tech
Disaster Recovery IT
I can be myself at work.
You are more than a job title. We want you to feel comfortable doing great work and bringing your best, authentic self to everything you do. We value your talents, traditions, and uniqueness—and we're committed to fostering a strong sense of belonging in a respectful workplace.
We intentionally seek diverse perspectives, experiences, and backgrounds, investing in a culture designed to celebrate differences. We believe that belonging leads to better outcomes and a stronger community of associates united by our mission. At Capital, we live our core values every day:
Integrity, Client Focus, Diverse Perspectives, Long-Term Thinking, and Community.
You want to feel recognized r performance will be reviewed annually, and your compensation will be designed to motivate and reward the value that you provide. You'll receive a competitive salary, bonuses and benefits. Your company-funded retirement contribution will factor in salary and variable pay, including bonuses.
I can lead a full life.You bring unique goals and interests to your job and your life. Whether you're raising a family, you're passionate about where you volunteer, or you want to explore different career paths, we'll give you the resources that can set you up for success.
- Enjoy generous time‑away and health benefits from day one, with the opportunity for flexible work options
- Receive 2‑for‑1 matching gifts for your charitable contributions and the opportunity to secure annual grants for the organizations you love
- Access on‑demand professional development resources that allow you to hone existing skills and learn new ones
- As a Platform Engineer Lead - Disaster Recovery and Resiliency, you will be responsible for the operational side of disaster recovery and resilience. As a platform engineering lead, you are developing, implementing, and maintaining resiliency framework and capabilities that application teams can consume via automated product offerings or repeatable patterns to attest to validity and viability of their disaster recovery plans in line with business outcomes.
- You will partner with infrastructure and application teams to design and implement scripts, templates, and workflows that automate their product's disaster recovery. This includes automation for all relevant resiliency elements including disaster recovery provisioning and scaling, configuration management, monitoring and observability, resyncing and reconciliation, and testing.
- You will work and partner closely with the project managers, technical leads, and business stakeholders to identify testing scenarios for potential threats, assess impacts, and design testing solutions to ensure business continuity and minimize risks.
- You will perform detailed evaluations of platform and application resiliency readiness to identify areas of concern.
- You will conduct regular testing, monitoring, and reporting of the resiliency and disaster recovery plans and activities. You will develop the capability to capture the book of record for all disaster recovery related data. You will identify gaps and continuous improvement opportunities.
- You can design and implement data collecting scripts, implement and maintain monitoring tools, and develop front‑end dashboards to monitor the health, performance, and utilization of Capital's recovery environment to enable prompt response when signs dictate.
- You will support Global Risk and their requirements to report to regulators on our disaster recovery effort.
- 7+ years of hands‑on experience in resiliency, disaster recovery, or business continuity for midsize to large enterprises, with proven technical leadership delivering enterprise‑scale DR and resiliency solutions, preferably in regulated or financial services environments. You have a bachelor's degree in computer science, information systems, engineering, or a related field.
- Strong AWS platform engineering expertise, including hands‑on experience with AWS Resiliency Hub and AWS Fault Injector Service, and the…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).