Software Engineer, Kubernetes/Cloud
Listed on 2026-03-11
-
Software Development
Software Engineer, DevOps
About Infrinia.ai, powered by Soft Bank
Soft Bank is making significant investments in infrastructure for AI. Through its wholly owned US subsidiary, Soft Bank Corp. has established Infrinia team in Silicon Valley, focused on infrastructure software for AI and AI foundations for mobile networks. Our goals are to challenge the norms and create products making use of our SOTA infrastructure (like Nvidia GB200, MGX and DGX Grace & Hopper platforms) and cloud-native software.
These products are geared towards centralized AI data centers as well as distributed AI Radio Access Network (AI RAN) data centers. We are looking for experienced practitioners who are inspired to bring innovation and build transformative products.
- Bachelor's degree in Computer Science, Electrical Engineering, or related field.
- 3+ years of software development experience with a focus on distributed systems or cloud infrastructure.
- Strong proficiency in Go (Golang) or C/C++.
- Deep understanding of data structures, algorithms, and software design patterns.
- Experience building and extending Kubernetes (e.g., custom controllers, Operators, CRDs, or API aggregation).
- Master's or PhD in a relevant field.
- Experience contributing to the Kubernetes open-source ecosystem (k8s upstream, CNCF projects).
- Deep knowledge of Kubernetes internals (scheduler, kubelet, etcd, networking).
- Experience building software for bare-metal provisioning or cloud orchestration (Cluster API, Tinkerbell, etc.).
- Familiarity with OCI standards, container runtimes (containerd, CRI-O), and Linux kernel features (eBPF, cgroups etc is a plus).
Be a key member of the software engineering team responsible for building the next-generation cloud-native platform for AI. You will write code to extend and customize Kubernetes, enabling it to orchestrate massive AI workloads on our SOTA GPU infrastructure. You will move beyond simply using Kubernetes APIs to designing and implementing the software logic (Operators, Controllers) that automates the lifecycle of our compute, network, and storage resources.
Responsibilities- Design and implement high-performance, production-quality code in Go to build custom Kubernetes Operators and Controllers.
- Develop software APIs and microservices that abstract complex infrastructure primitives for AI users.
- Architect and build features for multi-cluster management, scheduling optimization, and resource isolation.
- Collaborate with kernel/system engineers to expose hardware capabilities (GPU, NICs) up to the orchestration layer.
- Write comprehensive unit and integration tests to ensure software reliability and stability.
- Contribute to Product Definition (PRD) and program execution (sprint) planning.
- Role model and foster a culture of humility and innovation for product delivery.
The base salary for this position ranges from ($120,000-$180,000), with additional attractive biannual bonus and benefits.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).