Senior DevOps Engineer
Listed on 2026-04-17
-
IT/Tech
SRE/Site Reliability, Systems Engineer, Cloud Computing
About the role
D-Wave is seeking a Senior Dev Ops Engineer to join our Dev Ops team in New Haven, reporting to the Dev Ops Engineering Manager. In this role, you will design, build, and operate hybrid infrastructure platforms spanning on‑premises environments, Kubernetes clusters, cloud services, and CI/CD pipelines.
Your primary focus will be on developing and operating on‑prem Kubernetes platforms while supporting cloud environments such as AWS. You will also play a key role in advancing our observability capabilities, improving system visibility through logging, metrics, dashboards, and alerting.
Working closely with hardware, software, and Dev Ops teams, you will own systems end‑to‑end and drive improvements in how infrastructure is provisioned, automated, and maintained. This role is ideal for an engineer who enjoys working across the stack, solving complex operational challenges, and building reliable, scalable platforms.
What you’ll do- Design, build, and operate Kubernetes platforms, primarily on‑prem, including cluster architecture, networking, storage, and lifecycle management
- Develop and maintain hybrid infrastructure across on‑prem and cloud environments with a focus on reliability, scalability, and maintainability
- Design and optimize CI/CD pipelines (e.g., Git Hub Actions) to enable automated, low‑touch build and release processes
- Manage AWS infrastructure, including multi‑account environments, IAM, networking, and shared services
- Implement and maintain infrastructure‑as‑code and automation solutions (e.g., Terraform, Ansible)
- Design and operate virtualized platforms and VM‑based workloads
- Define and enforce standards for containerization, including image management, versioning, and security practices
- Build and support containerized applications running on Kubernetes (on‑prem and cloud‑based)
- Develop and mature observability platforms, including metrics, logging, dashboards, and alerting
- Design actionable monitoring and alerting systems to improve reliability and incident response
- Support infrastructure for hardware‑integrated systems, including OS deployment and lifecycle management
- Improve operational efficiency by automating manual processes and enhancing system reliability
- Collaborate cross‑functionally to ensure consistent practices across hybrid environments
- Participate in incident response and root‑cause analysis, driving continuous improvement
- Bachelor’s degree in Computer Science, Engineering, or equivalent practical experience
- 4+ years of experience in Dev Ops, infrastructure, or systems engineering
- Deep, hands‑on experience architecting and operating bare‑metal Kubernetes platforms from the ground up in production environments, including cluster design, networking, scalability, and reliability
- Strong Linux system administration and troubleshooting skills in production environments
- Experience working with AWS, including multi‑account environments
- Experience designing and operating CI/CD pipelines
- Proficiency with infrastructure‑as‑code and automation tools (e.g., Terraform, Ansible)
- Experience with Docker and containerized workloads in production
- Experience with virtualization platforms (e.g., VMware, KVM, Proxmox, Hyper‑V)
- Familiarity with monitoring and observability tools (e.g., Prometheus, Grafana, ELK, Zabbix)
- Solid understanding of networking fundamentals (e.g., VLANs, routing, firewalling)
- Ability to own systems end‑to‑end in complex, distributed environments
The base pay range for this role is 124,360 – 186,540 USD per year (New Haven).
Equal Employment Opportunity StatementIt is D‑Wave Systems Inc. policy to provide equal employment opportunity (EEO) to all persons regardless of race, color, religion, sex, national origin, age, sexual orientation, gender identity, genetic information, physical or mental disability, protected veteran status, or any other characteristic protected by federal, state/provincial, local law.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).