×
Register Here to Apply for Jobs or Post Jobs. X

Senior Software Engineer, Site Reliability Tooling

Remote / Online - Candidates ideally in
Columbus, Franklin County, Ohio, 43224, USA
Listing for: Upstart
Remote/Work from Home position
Listed on 2025-12-13
Job specializations:
  • IT/Tech
    Cloud Computing, SRE/Site Reliability, IT Support, Systems Engineer
Salary/Wage Range or Industry Benchmark: 163600 - 226400 USD Yearly USD 163600.00 226400.00 YEAR
Job Description & How to Apply Below

About Upstart

Upstart is the leading AI lending marketplace partnering with banks and credit unions to expand access to affordable credit. By leveraging Upstart's AI marketplace, Upstart-powered banks and credit unions can have higher approval rates and lower loss rates across races, ages, and genders, while simultaneously delivering the exceptional digital-first lending experience their customers demand. More than 80% of borrowers are approved instantly, with zero documentation to upload.

Upstart is a digital-first company, which means that most Upstarters live and work anywhere in the United States. However, we also have offices in San Mateo, California;
Columbus, Ohio;
Austin, Texas; and New York City, NY (opening Summer 2026).

Most Upstarters join us because they connect with our mission of enabling access to effortless credit based on true risk. If you are energized by the impact you can make at Upstart, we’d love to hear from you!

The Team

Upstart’s Site Reliability Engineering (SRE) team owns the reliability, resiliency, and observability of Upstart’s production systems. The SRE team builds tooling and automation to monitor the health of our infrastructure and create a fast, reliable, and productive environment for other engineers and a world-class experience for our customers. SRE defines Upstart’s strategy for technology operations risk mitigation, which includes disaster planning and on-call procedures.

We use data-driven approaches to drive our decisions, and provide reports and insights to the business to improve visibility into the system and customer experience.

As a Senior Software Engineer focused on Site Reliability Tooling
, your work will directly impact the success of the SRE team and all of Upstart. Your expertise will inform the team’s direction, and your work with other SREs and Upstart engineers will make Upstart’s systems as effective as possible for our customers. SRE at Upstart is ever-changing, and you will be a primary contributor in shaping our future path.

How you’ll make an impact:
  • Embody and share SRE principles at Upstart
  • Exercise state-of-the-art SRE practices throughout the company
  • Uphold a culture of visibility, ownership, and responsibility around service reliability
  • Implement standards for monitoring microservices, web apps, mobile apps, databases, Kubernetes clusters, and machine learning platforms, in a fast-paced environment
  • Improve incident response practices, both within SRE and throughout the company
  • Automate away toil that make sense to be automated
What we’re looking for:

Minimum requirements:
  • Minimum of 6 years combined experience between Software Engineering, Site Reliability, and/or Dev Ops Engineering including CI/CD, TDD, internal tooling, observability, and other agile development practices
  • Proficiency coding Python, Go, JavaScript/Type Script
  • Proficiency with Infrastructure as Code (Terraform, CDK, Cloud formation, etc.)
  • Software engineering background with experience building internal tooling from scratch, and other agile development techniques
  • Strong software design & architecture skills
  • Fundamentally sound with data structures & algorithms
  • Experience with on-call and incident management environments
  • Experience with observability, monitoring, and reporting tools (e.g., Datadog, Prometheus, etc.)
  • Experience supporting SaaS software in a microservice-oriented cloud environment
  • Ability to work with multiple teams for enterprise-wide deliverables
  • Data/metrics-driven mindset
Preferred qualifications:
  • Experience with service mesh
  • Full Stack development skills
  • Experience building tooling for an observability platform
  • Experience leveraging LLM/GenAI to improve SRE efficiency and processes
Position Location

This role is available in the following locations:
Remote, San Mateo, Columbus, Austin

Time Zone Requirements

This team operates across all U.S. time zones.

Travel Requirements

This team has regular on-site collaboration sessions. These occur 3 days per quarter at an Upstart office. If you need to travel to make these meetups, Upstart will cover all travel related expenses.

What you'll love:
  • Competitive Compensation (base + bonus & equity)
  • Comprehensive medical, dental, and…
Position Requirements
10+ Years work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary