×
Register Here to Apply for Jobs or Post Jobs. X

Principal Software Development Engineer - Observability

Job in San Jose, Santa Clara County, California, 95199, USA
Listing for: Expedia, Inc.
Full Time position
Listed on 2026-01-08
Job specializations:
  • Software Development
    Software Engineer, DevOps
Salary/Wage Range or Industry Benchmark: 242000 - 338500 USD Yearly USD 242000.00 338500.00 YEAR
Job Description & How to Apply Below

Expedia Group brands power global travel for everyone, everywhere. We design cutting‑edge tech to make travel smoother and more memorable, and we create groundbreaking solutions for our partners. Our diverse, vibrant, and welcoming community is essential in driving our success.

Why Join Us?

To shape the future of travel, people must come first. Guided by our Values and Leadership Agreements, we foster an open culture where everyone belongs, differences are celebrated and know that when one of us wins, we all win.

We provide a full benefits package, including exciting travel perks, generous time‑off, parental leave, a flexible work model (with some pretty cool offices), and career development resources, all to fuel our employees' passion for travel and ensure a rewarding career journey. We’re building a more open world. Join us.

Principal Software Engineer, Observability

Our Technology Team partners with teams across Expedia Group to create innovative products, services, and tools to deliver high‑quality experiences for travelers, partners, and our employees. A singular technology platform powered by data and machine learning provides secure, differentiated, and personalized experiences that drive loyalty and traveler satisfaction.

As a Principal Engineer, you will be part of an agile development team with deep expertise in cloud, distributed systems, and observability. You will play a pivotal role in crafting the strategic technical goals for our group. The main effort will involve leading the architecture, design, and implementation of a centralized, scalable, and cost‑effective observability platform used by all engineering teams across Expedia.

You will provide technical leadership for a dynamic engineering organization and work alongside talented product managers and other technical leaders to deliver best‑in‑class capabilities to our developer community.

In this role, you will
  • Architect and Build Core Telemetry Pipelines: Lead design and implementation of highly scalable and resilient telemetry pipelines for logs, metrics, and traces. Evolve platform to handle 10x increase in data volume while maintaining performance and cost‑effectiveness.
  • Drive Open Telemetry Adoption: Spearhead strategy, rollout, and support for the Open Telemetry collector across thousands of services. Develop best practices and automated configurations to ensure seamless and consistent data collection.
  • Implement Platform Governance and Optimization: Design and build capabilities for data governance, cost allocation, and resource management within the observability platform. Define and implement SLOs for the platform itself and create tools to help teams manage their observability costs.
  • Elevate the Practice of Observability: Act as a thought leader, driving adoption of observability best practices across the engineering organization. Improve developer experience by unifying tooling (e.g., Grafana, Datadog, Splunk), documentation, and service lifecycle management within internal developer portal.
  • Automate Infrastructure Lifecycle: Author and maintain production‑grade Infrastructure as Code (IaC) using tools like Terraform and/or Crossplane. Eliminate manual toil by automating cluster provisioning, dependency upgrades, and incident remediation workflows.
  • Technical Leadership and Mentorship: Act as a force multiplier. Mentor senior engineers on the team, lead architecture review sessions, and author RFCs to build consensus on significant technical decisions. Your influence will extend beyond the team to application developers and SREs.
  • Production Debugging: Serve as final escalation point for complex, cross‑cutting production incidents related to observability platform, from telemetry agent bugs to data correlation failures in distributed systems.
  • Collaborate and Innovate: Explore and utilize a wide variety of technologies and tools, such as (but not limited to) Go, Java, Python, AWS, Kubernetes, Open Telemetry, Prometheus, Grafana, Datadog, Splunk, Clickhouse.
Minimum Qualifications
  • Bachelor’s or Master’s degree in Computer Science or related technical field, or equivalent practical experience.
  • 10+ years of experience in software…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary