Principal Site Reliability Engineer
Listed on 2026-06-17
-
IT/Tech
SRE/Site Reliability, Cloud Computing: Infrastructure & Operations, Systems Engineer
This range is provided by Priority. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.
Base pay range$/yr - $/yr
Job title: Principal Site Reliability Engineer
Reports to: Director, Site Reliability Engineering
Location: Remote
Grade: 21
About PriorityPriority Technology Holdings, Inc. is a leading financial technology company on a mission to deliver a personalized, easy‑to‑adopt financial toolset that accelerates cash flow and optimizes working capital for businesses. Our vision is to eliminate the barriers to unlocking revenue — empowering businesses to grow faster and operate smarter.
We achieve this through the Priority Commerce Engine, an innovative platform that combines payables, acquiring, and banking and treasury solutions. This unified approach allows businesses to streamline financial operations, reduce unnecessary costs, and uncover new revenue opportunities.
At Priority, we’re driven by results. We expect our people to be known for results — bringing expertise, momentum, and relentless focus to every challenge, helping our clients and each other thrive.
About the RoleAs a Principal Site Reliability Engineer, you will be a senior technical leader ensuring the reliability, scalability, and operational excellence of Priority’s mission‑critical financial technology platform. This role blends hands‑on engineering with leadership, mentorship, and cross‑functional influence.
You will partner with product and infrastructure teams to ensure services are observable and resilient. You will resolve incidents, automate operational workflows, and set standards that raise the bar for reliability across the organization.
This is an ideal role for an engineer who thrives at the intersection of software, systems, and operations, and who wants to shape the reliability culture at scale.
Responsibilities- Define and drive the SRE strategy, aligning reliability practices with Priority’s long‑term business and technology goals.
- Lead incident response and retrospectives, driving systemic reliability improvements across multiple product work streams.
- Own cross‑cutting platform concerns such as observability, monitoring, alerting, performance, scalability, and resiliency.
- Partner with engineering leadership and product teams to embed reliability best practices into design, planning, and delivery.
- Automate detection, resolution, and recovery for recurring production issues, reducing toil and increasing delivery velocity.
- Evaluate and introduce new technologies, frameworks, and practices to improve reliability, cost efficiency, and performance.
- Mentor and coach engineers across levels, multiplying SRE skills and mindset across the organization.
- Delivery & Execution:
Services are deployed frequently and safely; incidents are resolved quickly with minimal customer impact; operational load is reduced through automation and reliable processes. - Product Quality & Reliability:
Systems consistently meet availability, latency, error‑rate, and performance SLOs; incidents are rare, well‑mitigated, and quickly remediated; redundancy and fault tolerance are built into all layers of the stack. - Collaboration & Knowledge Sharing:
Teams design with reliability in mind; incidents are resolved faster with shared playbooks; engineers across disciplines feel confident in operational practices; blameless postmortems drive organizational learning. - Business & Product Impact:
Reliability improvements directly enhance customer experience, reduce churn, and support adoption of new products and features without sacrificing stability. - Professional Growth & Team Contribution:
You are recognized as a mentor and thought leader, elevating engineering maturity across the org; teams continuously improve reliability practices, automation, and resilience.
Required:
- 10+ years of professional software engineering / systems engineering experience, including 5+ years in SRE or reliability‑focused roles.
- Proven leadership experience influencing reliability practices across multiple teams or domains.
- Strong background in distributed systems, cloud infrastructure (AWS preferred),…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).