×
Register Here to Apply for Jobs or Post Jobs. X

Senior Software Engineer, Site Reliability Tooling

Remote / Online - Candidates ideally in
San Francisco, San Francisco County, California, 94199, USA
Listing for: Recruiting From Scratch
Remote/Work from Home position
Listed on 2026-01-07
Job specializations:
  • IT/Tech
    SRE/Site Reliability, Cloud Computing, Systems Engineer, IT Support
Salary/Wage Range or Industry Benchmark: 163600 - 226400 USD Yearly USD 163600.00 226400.00 YEAR
Job Description & How to Apply Below

Senior Software Engineer, Site Reliability Tooling

Location: United States (Remote, with quarterly onsite sessions in San Mateo, Columbus, or Austin)

Company Stage of Funding: Public / Late-Stage

Office Type: Digital-First (Remote with Quarterly Onsites)

Salary: $163,600 – $226,400 + Bonus + Equity

Company

Description:

Our client is a leading AI-driven lending marketplace transforming how banks and credit unions evaluate and approve borrowers. Their platform delivers higher approval rates, lower loss rates, and a seamless digital-first experience—enabling more than 80% of applicants to be automatically approved without document uploads.

They operate as a digital-first company with hubs across the U.S., and employees join because they’re motivated by the mission: increasing access to fair, effortless credit by leveraging modern AI and real‑time data.

What You Will Do

As a Senior Software Engineer focused on Site Reliability Tooling
, you will play a key role in the reliability, resilience, and observability of large-scale production systems. You’ll design and build tools that empower engineering teams to maintain uptime, deploy safely, and understand system performance across complex microservice architectures.

  • Champion SRE principles across engineering and promote a strong culture of service ownership and reliability.
  • Build internal tooling from scratch to improve observability, monitoring, alerting, and operational workflows.
  • Implement standards to monitor microservices, web apps, mobile apps, machine learning systems, databases, and Kubernetes clusters.
  • Improve incident response processes, including on‑call workflows, retrospectives, and reliability reporting.
  • Automate toil through infrastructure tooling, scripts, and scalable platform services.
  • Help define the long‑term strategy for reliability, disaster preparedness, and operational risk mitigation.
  • Collaborate across multiple engineering groups to deliver enterprise‑wide reliability initiatives.
Minimum Requirements
  • 6+ years combined experience in Software Engineering, Site Reliability Engineering, and/or Dev Ops.
  • Strong proficiency in Python, Go, and/or JavaScript/Type Script.
  • Hands‑on experience with Infrastructure-as-Code (Terraform, CDK, Cloud Formation).
  • Proven background building internal tooling and applying strong software engineering fundamentals (architecture, testing, TDD).
  • Strong grounding in data structures and algorithms.
  • Experience with on‑call, incident response, and incident management workflows.
  • Experience with modern observability tools such as Datadog, Prometheus, Grafana, Cloud Watch.
  • Experience supporting high‑scale SaaS systems in microservice cloud environments.
  • Ability to work cross‑functionally to drive large engineering initiatives.
  • Data‑driven mindset focused on metrics, reliability, and continuous improvement.
Preferred Qualifications
  • Experience with service mesh technologies.
  • Full‑stack engineering capabilities.
  • Background building tooling for observability or monitoring platforms.
  • Experience leveraging LLMs / GenAI to improve SRE workflows (chatops, auto‑remediation, alert summarization, etc.).
Compensation and Benefits
  • Base Salary: $163,600 – $226,400
  • Bonus: Target bonus included
  • Equity: Included
  • Comprehensive medical, dental, and vision coverage with HSA contributions
  • 401(k) with 100% match up to $4,500 (immediate vesting)
  • Employee Stock Purchase Plan
  • Life and disability insurance
  • Flexible vacation, holidays, sick leave, and safety leave
  • Parental, family care, and military leave
  • Annual wellness, technology, and ergonomic reimbursements
  • Team events, ERGs, volunteer groups
  • When onsite: catered lunches, snacks, and drinks
  • Quarterly team onsite sessions (travel covered)

Salary Range: $142,000-$196,000 base.

#J-18808-Ljbffr
Position Requirements
10+ Years work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary