Senior Engineering Manager - Observability
Listed on 2026-04-23
-
IT/Tech
Systems Engineer, SRE/Site Reliability, Cloud Computing
The pay range is $ - $. Pay is based on several factors which vary based on position. These include labor markets and in some instances may include education, work experience and certifications. In addition to your pay, Target cares about and invests in you as a team member, so that you can take care of yourself and your family. Target offers eligible team members and their dependents comprehensive health benefits and programs, which may include medical, vision, dental, life insurance and more, to help you and your family take care of your whole selves.
Other benefits for eligible team members include 401(k), employee discount, short term disability, long term disability, paid sick leave, paid national holidays, and paid vacation. Find competitive benefits from financial and education to well-being and beyond at Working at Target means shaping experiences for hundreds of millions of guests while operating one of the largest retail technology platforms in the world.
We are a Fortune 50 company with more than 450,000 team members globally. Technology is one of Target’s top four enterprise priorities. It plays an outsized role in powering growth, speed, reliability, and innovation across the company. From merchandising and supply chain to stores and digital, our technology platforms directly enable the most critical outcomes for 2026 and beyond. Target Tech builds and operates the intelligence powered architecture that fuels these outcomes.
Our teams design scalable platforms, modern data ecosystems, and resilient infrastructure that allow thousands of engineers to deliver secure, high performing systems at enterprise scale.
Reliability is foundational to delivering on Target’s purpose. Every register scan, Drive Up order, search result, inventory update, and fulfillment workflow depends on systems that are resilient, measurable, and continuously improving. The Observability team builds and operates the enterprise platform that enables engineering teams to understand, measure, and improve the reliability and performance of their products. We provide standardized telemetry, logging, tracing, metrics, alerting, system maps, and operational insights across more than 20,000 services.
OurPlatform Enables
- Enterprise scale metrics, logs, and traces built on Open Telemetry standards
- Real time system maps and dependency graphs across complex service ecosystems
- SLO management, error budget tracking, and reliability showback
- Integrated alerting, on call alignment, and actionable operational insights
- AI driven root cause analysis and automated remediation capabilities
We operate at massive scale in 24 by 7 production environments. Technologies commonly used across the team include Golang, Kubernetes, Kafka, Click House, Influx
DB, Grafana, and React that support high volume telemetry pipelines. We are moving beyond traditional monitoring. Our ambition is to build an intelligent, agent enabled observability platform that proactively detects degradation, explains system behavior, and recommends or executes recovery actions before guests are impacted.
The Role
As a Senior Engineering Manager, you will lead one of three engineering teams delivering enterprise observability capabilities. You will partner closely with peer managers, product, and a Principal Engineer to shape the technical roadmap and deliver a unified, intelligent reliability platform for Target Tech.
You Will Be Accountable For- Building and leading a high performing team of platform and full stack engineers
- Delivering mission critical, highly available observability systems
- Scaling telemetry pipelines and analytics across thousands of applications
- Embedding reliability and operational excellence directly into developer workflows
- Representing your team in cross domain architecture and enterprise planning forums
At Target scale, you will lead engineers solving distributed systems and reliability challenges that are not documented in standard playbooks. Success in this role requires strong technical judgment, clear communication, operational discipline, and the ability to drive clarity in ambiguous environments.
Why This Role…(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).