Software Engineer - SRE, Backend; Reliability Engineering
Listed on 2025-12-27
-
Software Development
Software Engineer
Staff Software Engineer - SRE, Backend (Reliability Engineering)
Join to apply for the Staff Software Engineer - SRE, Backend (Reliability Engineering) role at Affirm
.
Affirm is reinventing credit to make it more honest and friendly, giving consumers the flexibility to buy now and pay later without any hidden fees or compounding interest.
Responsibilities- Providing data and visibility to teams and leadership on application performance
- Guiding the development of SLOs
- Driving the Incident Management and Analysis process
- Steering the implementation of Change Management and Deployment practices
- Engaging in service and architectural conversations
- Recommending observability and alerting configurations
- Set technical strategy vision for your team on a multi‑year time scale and help the team tie it together with critical, business‑impacting projects.
- Collaborate across teams in the product development lifecycle with infrastructure, product management, developer experience, and analytics to ensure technical sustainability, risks, and trade‑offs are well understood and managed.
- Act as a force‑multiplier for the team through definition and advocacy of technical solutions and operational processes.
- Take ownership of the team’s operations and availability by ensuring monitoring, triage rotations, playbooks, policies, testing, and alerting are in place to support "keep the lights on" and on‑call efforts.
- Foster a culture of quality and ownership on the team by setting code review and design standards and advocating for them beyond the team through writing and tech talks.
- Help develop talent on the team by providing feedback, guidance, and leading by example.
- 8+ years of experience designing, developing, and launching backend systems at scale using scripting and development languages such as Bash, Python, or Kotlin.
- Extensive track record developing highly available distributed systems using technologies like AWS, MySQL, Spark, and Kubernetes.
- Experience managing, driving, and improving the Incident Lifecycle process from live incident management through retrospectives and post‑incident analysis to provide actionable insights.
- 7+ years experience in Site Reliability or Production Engineering teams.
- Demonstrated curiosity, empathy, and strong opinions held loosely.
- Experience delivering major features, system components, or deprecating existing functionality in a system through definition of a technical and execution plan. Write high quality code that is easily understood and used by others.
- Thriv e in ambiguity and comfortable moving from low‑level language idioms to large‑system architecture.
- Strong verbal and written communication skills supporting effective collaboration with our global engineering team and key stakeholders.
- Either equivalent practical experience or a Bachelor’s degree in a related field.
P
Equity Grade13
LocationRemote - US
Benefits- Health care coverage – affirm covers all premiums for all levels of coverage for you and your dependents.
- Flexible Spending Wallets – generous stipends for spending on technology, food, various lifestyle needs, and family forming expenses.
- Time off – competitive vacation and holiday schedules allowing you to take time off to rest and recharge.
- ESPP – an employee stock purchase plan enabling you to buy shares of affirm at a discount.
We believe It’s On Us to provide an inclusive interview experience for all, including people with disabilities. We are happy to provide reasonable accommodations to candidates in need of individualized support during the hiring process.
By clicking "Submit Application," you acknowledge that you have read affirm's global candidate privacy notice and hereby freely and unambiguously give informed consent to the collection, processing, use, and storage of your personal information as described therein.
Seniority levelMid‑Senior level
Employment typeFull‑time
Job functionEngineering and Information Technology
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).