Senior Software Engineer, Compute Architecture
Listed on 2026-06-18
-
IT/Tech
Systems Engineer, IT Infrastructure, SRE/Site Reliability
About the Role
As a Senior Software Engineer within our Compute Architecture organization, you will help build the software control plane for hardware lifecycle management across large-scale GPU data centers. The METALDEV team builds Go-based distributed services that bring infrastructure online, monitor production hardware health, automate safe operational workflows, and give operators the observability and control needed to manage GPU servers and rack-scale systems with reliability and confidence.
This is a software-first role at the intersection of distributed systems, production reliability, and hardware-aware automation, ideal for engineers who want their code to operate real-world infrastructure at massive scale.
- Design, build, and operate Go-based services that manage the lifecycle of large-scale GPU data center infrastructure.
- Build automation for data center bring-up, hardware discovery, health monitoring, remediation, and production operations.
- Develop reliable APIs, services, and workflows for managing BMCs, firmware state, server health, and rack-level infrastructure.
- Improve observability, alerting, and operational tooling so production issues can be detected, understood, and resolved quickly.
- Translate incidents and hardware failure modes into software improvements that make the platform more resilient.
- Partner with hardware-adjacent, infrastructure, operations, and software teams to design systems that work safely at fleet scale.
- 5+ years of experience building and operating infrastructure or backend systems.
- Bachelor’s or Master’s degree in Computer Science or a related field, or equivalent practical experience.
- Strong proficiency in Go for building production services and tools.
- Experience designing and building gRPC and REST APIs.
- Experience with Kubernetes and containerized workloads in production environments.
- Familiarity with observability tooling such as Prometheus and Grafana.
- Experience working with GPU-based systems.
- Experience with low-level hardware management such as BMCs or Redfish.
- Experience operating large-scale distributed systems or high-throughput infrastructure.
- Experience collaborating with or contributing to open-source projects (for example, Go, Redfish).
We believe in investing in our people, and value candidates who can bring their own diversified experiences to our teams – even if you aren t a 100% skill or experience match. Here are a few qualities we’ve found compatible with our team. If some of this describes you, we’d love to talk.
- You enjoy working close to the hardware and are curious about how GPUs, servers, and data centers fit together.
- You thrive in infrastructure environments where reliability, performance, and automation matter as much as features.
- You like collaborating across hardware, platform, and product teams to solve complex, ambiguous problems.
At Core Weave, we work hard, have fun, and move fast! We’re in an exciting stage of hyper-growth that you will not want to miss out on. We’re not afraid of a little chaos, and we’re constantly learning. Our team cares deeply about how we build our product and how we work together, which is represented through our core values:
- Be Curious at Your Core
- Act Like an Owner
- Empower Employees
- Deliver Best-in-Class Client Experiences
- Achieve More Together
We support and encourage an entrepreneurial outlook and independent thinking. We foster an environment that encourages collaboration and enables the development of innovative solutions to complex problems. As we get set for takeoff, the growth opportunities within the organization are constantly expanding. You will be surrounded by some of the best talent in the industry, who will want to learn from you, too.
Come join us!
The base salary range for this role is $182,000 to $242,000. The starting salary will be determined based on job-related knowledge, skills, experience, and market location. We strive for both market alignment and internal equity when determining compensation. In addition to base salary, our total rewards package includes a discretionary bonus, equity awards, and a…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).