×
Register Here to Apply for Jobs or Post Jobs. X

Senior Software Engineer - Site Reliability Engineering; Remote

Remote / Online - Candidates ideally in
Warner Robins, Houston County, Georgia, 31088, USA
Listing for: Home Depot
Remote/Work from Home position
Listed on 2026-05-30
Job specializations:
  • Software Development
    DevOps, Cloud Engineer - Software, Software Engineer
Salary/Wage Range or Industry Benchmark: 125000 - 150000 USD Yearly USD 125000.00 150000.00 YEAR
Job Description & How to Apply Below
Position: Senior Software Engineer - Site Reliability Engineering (Remote)

Position

Purpose:

The Senior Software Engineer for Site Reliability Engineering (Store Systems Enablement) builds and operates the internal platforms that keep Home Depot's store systems observable, reliable, and automated. This is a platform engineering role: you will design, develop, and maintain the tools that hundreds of development and reliability teams depend on, not just use them.

The team owns and operates a portfolio of reliability platforms, including a custom-built synthetic testing system that runs inside physical Home Depot stores, operational automation infrastructure serving dozens of teams, and the full observability stack (logging, tracing, and profiling) for Store Systems. You will write code, deploy infrastructure, tune distributed systems, and reduce operational toil through automation, including AI‑assisted workflows.

Key Focus Areas:
  • Platform Development: Build and extend internal reliability tools using Kubernetes, Terraform, and modern infrastructure-as-code patterns on Google Cloud Platform.

  • Observability Operations: Deploy, configure, and maintain production logging, tracing, and profiling systems. Own the SLO/CUJ platform that enables multi-window, multi-burn-rate alerting and automated tracking dashboards for RE teams across Store Systems.

  • Toil Reduction & Automation: Identify repetitive operational work and engineer it away. Build self-service capabilities, Copilot skills, and automation pipelines so teams can operate independently.

  • SLO & CUJ Enablement: Maintain and extend the platform that powers SLO and Critical User Journey definition across the organization. Educate RE teams on what good SLOs and CUJs look like, assist with onboarding, and build automation and documentation so teams can self‑serve. You will have strong opinions on the right way to measure reliability and the tooling to back them up.

  • Synthetic Monitoring: Extend our in-store synthetic testing platform: onboard teams, enable them to write and deploy their own tests, and evolve the platform's orchestration, alerting, and self‑service capabilities.

  • Incident Response & Resilience: Participate in on-call rotation for observability infrastructure. Lead and contribute to blameless post‑mortems. Design and execute destructive tests to validate platform resilience.

You will work on a small, high‑impact team where the work is varied: some weeks you’re writing Terraform and Helm charts, others you’re debugging Loki query performance or building a Copilot skill to automate a support workflow. You will be expected to own problems end‑to‑end, from investigation through implementation to production deployment.

Key Responsibilities:
  • 50% Delivery and Execution – Develops, tests, deploys, and maintains software, with a clear understanding of the value the software is to provide;
    Takes on new opportunities and tough challenges with a sense of urgency, high energy and enthusiasm;
    Consistently achieves results, even under tough circumstances;
    Develops test suites (functional, destructive, etc) to enable success, rapid deployment of code to production;
    Takes a broad view when approaching issues; using a global lens.

  • 20% Learns and Grows – Learns through successful and failed experiment when tackling new problems;
    Actively seeks ways to grow and be challenged using both formal and informal development channels.

  • 20% Plans and Aligns – Collaborates with other team members in agile processes;
    Creates new and better ways for the organization to be successful;
    Works the Product Team to ensure user stories are valuable, developer ready, easy to understand and testable;
    Delivers multi‑mode communications that convey a clear understanding of the unique needs of different audiences;
    Adapts approach and demeanor in real time to match the shifting demands of different situations;
    Relates openly and comfortably with diverse groups of people.

  • 10% Supports and Enables – Helps grow junior engineers by providing guidance on modern software development frameworks, and leading technical discussions.

Direct Manager / Direct Reports:
  • This position typically reports to Software Engineer Manager or Sr. Manager.

  • This position has 0 Direct Reports.

Travel…
Position Requirements
10+ Years work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary