×
Register Here to Apply for Jobs or Post Jobs. X

Senior Software Engineer; Observability Insights

Job in New York, New York County, New York, 10261, USA
Listing for: CoreWeave
Full Time position
Listed on 2026-06-02
Job specializations:
  • Software Development
    Software Engineer, DevOps
Salary/Wage Range or Industry Benchmark: 125000 - 150000 USD Yearly USD 125000.00 150000.00 YEAR
Job Description & How to Apply Below
Position: Senior Software Engineer (Observability Insights)
Location: New York

Requirements

  • 6+ years of experience in software or infrastructure engineering building production-grade backend systems and distributed APIs.
  • Strong focus on developer-facing infrastructure, with a customer-obsessed approach to SDKs, CLIs, and APIs.
  • Proficient in reliability engineering, including fault-tolerant design, SLOs, error budgets, and multi-tenant system resilience.
  • Familiar with observability systems such as Click House, Loki, Victoria Metrics, Prometheus, and Grafana.
  • Experienced in agentic applications or LLM-based features, including grounding, tool calling, and operational safety.
  • Comfortable writing production code primarily in Go, with the ability to integrate Python components when needed.
  • Collaborative experience in agile teams delivering end-to-end telemetry-to-insights pipelines.
  • (Desirable) Experience operating Kubernetes clusters at scale, especially for AI workloads.
  • (Desirable) Hands‑on experience with logging, tracing, and metrics platforms in production, with deep knowledge of cardinality, indexing, and query optimization.
  • (Desirable) Experienced in running distributed systems or API services at cloud scale, including event streaming and data pipeline management.
  • (Desirable) Familiarity with LLM frameworks, MCP, and agentic tooling (e.g., Langchain, Agent Core).
  • Wondering if you’re a good fit? We believe in investing in our people, and value candidates who can bring their own diversified experiences to our teams – even if you aren’t a 100% skill or experience match. Here are a few qualities we’ve found compatible with our team. If some of this describes you, we’d love to talk.
  • You love transforming complex telemetry into actionable insights.
  • You’re curious about agentic interfaces and the future of AI observability.
  • You’re an expert in building scalable, reliable systems that empower developers and customers alike.
What the job involves
  • We are seeking senior engineers to lead our Observability Insights effort, building the product experiences and agentic interfaces that sit on top of our foundational telemetry layer.
  • You will play a pivotal role in enabling Core Weave and its customers to understand, troubleshoot, and optimize complex AI systems by delivering core building blocks like multi‑tenant APIs, managed Grafana experiences, and MCP‑based tool servers.
  • You’ll collaborate closely with PMs and engineering leadership to shape the end‑to‑end observability experience, providing an outsize opportunity to influence how the world interacts with the forefront of Artificial Intelligence.
  • Design and execute the development of highly available, multi‑tenant APIs that expose telemetry and derived insights in a developer‑obsessed way.
  • Modernize how users interact with data by building agentic experiences, including MCP servers, agentic tools and API gateways that safely expose foundational telemetry.
  • Build agentic observability capabilities that will enable agentic workflows for guided debugging, workload optimization, and incident summarization to empower Core Weavers and customers alike.
  • Develop and enforce best practices regarding the health of telemetry data pipelines, specifically focused on correlation primitives and aggregation services for RCA and performance detection.
  • Improve the performance, security, reliability, and scalability of insights services including SLO ownership and latency optimization while participating in the team’s on‑call rotation.
  • Collaborate closely with internal engineering teams, applying a platform‑as‑a‑product mindset to understand their needs and embed observability best practices and custom tooling into their systems.
  • Contribute to the overall observability strategy, influencing the direction of our platform.
#J-18808-Ljbffr
Position Requirements
10+ Years work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary