×
Register Here to Apply for Jobs or Post Jobs. X

Senior Systems Engineer, OS Automation

Job in New York, New York County, New York, 10261, USA
Listing for: CoreWeave
Full Time position
Listed on 2026-04-28
Job specializations:
  • IT/Tech
    Systems Engineer, Cloud Computing: Infrastructure & Operations, AI Engineer (Applied/Software), Data Engineering
Salary/Wage Range or Industry Benchmark: 153000 - 242000 USD Yearly USD 153000.00 242000.00 YEAR
Job Description & How to Apply Below
Location: New York

Overview

Core Weave is The Essential Cloud for AI™. Built for pioneers by pioneers, Core Weave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence. Trusted by leading AI labs, startups, and global enterprises, Core Weave combines superior infrastructure performance with deep technical expertise to accelerate breakthroughs and turn compute into capability. Founded in 2017, Core Weave became a publicly traded company (Nasdaq: CRWV) in March 2025.

Learn more at

About the Role

Sys Eng HAVOCK (Hardware - Acceleration - Virtualization - Operating Systems - Containerization - Kernel)

Core Weave is looking for a Senior Systems Engineer who is ready to evolve beyond traditional Dev Ops. You will start by stabilizing and scaling our Linux OS and Kernel build pipelines. Once the foundation is set, you will lead the transition to AI-native infrastructure, building “smart” workflows that don t just report errors, but understand and fix them.

You are a Systems Engineer at heart, but you are ready to apply LLMs, RAG, and predictive modeling to solve infrastructure challenges at scale.

Our Team’s Stack
  • Languages:

    Python, Go, bash/sh
  • Observability:
    Prometheus, Victoria Metrics, Grafana
  • OS & Kernel:
    Linux Kernel (custom build), Ubuntu
  • Hardware:
    Intel/AMD/ARM CPUs, Nvidia GPUs, DPUs, Infiniband and Ethernet NICs
  • Containerization:
    Docker, Kubernetes (k8s), Kube Virt, containerd, kubelet
Responsibilities
  • Pipeline Architecture:
    Design, maintain, and automate reproducible OS image build pipelines for our massive fleet of GPU-accelerated servers.
  • Kernel Distribution:
    Collaborate with kernel engineers to package, validate, and distribute custom Linux builds across Intel, AMD, and ARM architectures.
  • Dependency Management:
    Build tooling to manage dependencies, versioning, and release workflows, ensuring hermetic builds.
  • Telemetry & Metrics:
    Standardize the collection of build metrics to create a baseline for future AI modeling.
  • "Smart" CI/CD & Auto-Remediation:
    Architect AI agents that ingest and analyze build logs in real-time. Develop systems that auto-triage errors, categorize failure patterns, and generate context-aware fix suggestions for engineering teams.
  • Predictive Regression Modeling:
    Design ML workflows that utilize historical performance data to detect kernel and OS regressions (latency, throughput, stability) in staging environments before they impact production.
  • Dynamic Kernel Tuning:
    Implement closed-loop feedback systems that analyze real-time system metrics and automatically suggest or apply sysctl parameter optimizations for specific customer workloads.
  • Next-Gen Chat Ops:
    Engineer LLM-driven interfaces for Slack/internal tools, enabling stakeholders to query build statuses, request log summaries, or provision resources using natural language commands.
Requirements
  • 4+ years of professional experience in Linux Systems Engineering, Release Engineering, or Dev Ops.
  • Deep knowledge of Linux internals (boot process, kernel modules, networking stack).
  • Experience with package management (Debian/Ubuntu) and build systems.
  • Strong proficiency in Python (essential for the AI integration aspects of this role).
  • Demonstrable experience integrating API-based AI models (OpenAI, Anthropic, or local open-source models) into software workflows.
  • Understanding of RAG (Retrieval-Augmented Generation) architectures for querying technical documentation or logs.
  • Experience building event-driven automation (e.g., using webhooks to trigger analysis agents).
  • Familiarity with data structures required for vector search or time-series analysis.
Nice-to-haves
  • Experience with Kubeflow or MLFlow.
  • Background in High-Performance Computing (HPC).
  • Experience fine-tuning small language models (SLMs) for code or log analysis tasks.
Salary & Benefits

The base salary range for this role is $153,000 to $242,000. The starting salary will be determined based on job-related knowledge, skills, experience, and market location. We strive for both market alignment and internal equity when determining compensation. In addition to base salary, our total rewards package includes a discretionary bonus, equity awards, and a…

Position Requirements
10+ Years work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary