Distinguished Engineer - Business Continuity, Governance, and Platform Resilience
Listed on 2026-02-14
-
IT/Tech
Systems Engineer, Cloud Computing
At GEICO, we offer a rewarding career where your ambitions are met with endless possibilities.
Every day we honor our iconic brand by offering quality coverage to millions of customers and being there when they need us most. We thrive through relentless innovation to exceed our customers’ expectations while making a real impact for our company through our shared purpose.
When you join our company, we want you to feel valued, supported and proud to work here. That’s why we offer The GEICO Pledge:
Great Company, Great Culture, Great Rewards and Great Careers.
GEICO is seeking an experienced Distinguished Engineer with a passion for building high-performance, low maintenance, zero-downtime platforms and applications. You will help drive our enterprise transformation by establishing engineering excellence as a core mission, with a specific focus on organizational resilience, strategic risk management, and rigorous technical governance. This role demands mastery of reliability, availability, software engineering, and best practices in BCDR.
PositionDescription
Our Distinguished Engineer works with Principal and Senior Engineers to innovate and build new systems, dramatically improve, and enhance existing systems, and identify new opportunities to apply deep knowledge to solve critical enterprise problems. You will lead the technical strategy and execution of a roadmap that increases product delivery velocity while ensuring absolute platform resilience. The ideal candidate has a deep understanding of technology, risk management, Site Reliability Engineering (SRE) principles, and strategic planning to design and implement resilient systems that safeguard our business from potential threats, enforce organizational compliance, and ensure predictable operation.
PositionResponsibilities Enterprise Resilience and BCDR Strategy
This domain focuses on establishing the core requirements for enterprise survival and recovery from major disruptions. The Distinguished Engineer is responsible for driving the technical BCDR strategy, ensuring it aligns with critical business and regulatory goals. This involves conducting comprehensive risk assessments, leading the architecture of highly resilient systems (embedding BCDR early in the design phase), and defining organization-wide Recovery Time Objective (RTO) and Recovery Point Objective (RPO) metrics.
A key accountability is validating these recovery targets by overseeing regular BCDR simulations and Chaos Engineering programs.
The role is centered on institutionalizing technical excellence across the organization. The Distinguished Engineer serves as a key leader within the Architecture Review Board, setting and rigorously enforcing architectural standards, policies, and blueprints. Responsibilities include ensuring that all major technology investments are strategically aligned with business objectives and compliance requirements, enforcing domain consistency across architecture layers, and driving strategic modernization efforts to maximize scalability and coherence.
OperationalExcellence and Case Management
This function transitions strategic resilience into operational reality, leveraging Site Reliability Engineering (SRE) principles. The Distinguished Engineer leads the SRE strategy by establishing and monitoring Service Level Objectives (SLOs) and error budgets to effectively balance feature velocity with mandatory stability. Key duties include developing and maintaining comprehensive incident response plans, runbooks, and playbooks, driving automation to achieve low Mean Time To Resolution (MTTR), and analyzing post-incident results to eradicate architectural flaws that drive down Mean Time Between Failures (MTBF).
Leadershipand Strategic Influence
As the senior technical individual contributor, the Distinguished Engineer is tasked with deep organizational and financial influence. The role requires acting as a trusted advisor to executive stakeholders on resilience and governance matters, while simultaneously serving as a role model and mentor to coach senior and principal engineering talent. Finally, the DE analyzes cost and forecast data, playing a critical role in strategic financial stewardship, particularly in Cloud Spend Optimization related to stateful services and data persistence.
Qualifications- Fluency and specialization in software development and best practices using modern programming languages.
- Deep knowledge of SRE practices, methodologies, and principles, along with a solid understanding of cloud-based compute, network, and storage technologies.
- Strong background in incident management (a core function of Case Management in platform operations), including the ability to create incident response playbooks, runbooks, and perform rigorous post-incident analysis to drive continuous improvement in reliability and availability.
- Expertise in distributed systems architecture, replication topologies, and…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).