Infrastructure Engineer; OpenStack Neutron Specialist
Listed on 2026-04-28
-
IT/Tech
Systems Engineer, Network Engineer
About Nscale
Nscale is the GPU cloud engineered for AI. We provide cost‑effective, high‑performance infrastructure for AI start‑ups and large enterprise customers. Nscale enables AI‑focused companies to achieve superior results by reducing the complexity of AI development. Our GPU cloud bolsters technical capabilities and directly supports strategic business outcomes, including cost management, rapid innovation, and environmental responsibility.
About the Role (Job Purpose)The Infrastructure Engineer (Neutron Specialist) sits within the Infrastructure Engineering team. The team is responsible for the design, implementation, operation and continuous improvement of the infrastructure stack that underpins all internal and customer‑facing services. This specialist role focuses on the Open Stack networking stack, with particular emphasis on Neutron and its associated technologies, including OVN, Open vSwitch, routing, DHCP, metadata services, tenant isolation and network automation.
The role ensures availability, scalability, performance and security of the networking layer across our cloud platforms and serves as a key link into the upstream Open Stack community.
- Designing, implementing and operating scalable, resilient and secure Open Stack networking platforms with a focus on Neutron and OVN/OVS.
- Owning architecture and day‑to‑day operation of virtual networking services, including L2/L3 networking, DHCP, metadata, floating IP, NAT, security groups and tenant segmentation.
- Troubleshooting complex control‑plane and data‑plane issues across Open Stack networking components and the underlying Linux networking stack.
- Driving continuous improvement in network automation, provisioning, validation, monitoring and recovery using infrastructure‑as‑code and configuration‑management tools.
- Working closely with compute, storage and platform engineering teams to ensure seamless integration between Neutron and the wider Open Stack ecosystem.
- Leading performance tuning, scalability planning and resilience improvements for network‑heavy and latency‑sensitive cloud workloads.
- Acting as a 3rd/4th line escalation point for advanced networking incidents, conducting root‑cause analysis and driving permanent fixes.
- Supporting upgrades, lifecycle management and change execution across Open Stack networking services with a focus on service continuity and operational excellence.
- Contributing specialist input to infrastructure roadmap planning, platform standards and solution design for customer and internal environments.
- Supporting pre‑sales and solution design activities by providing expert guidance on cloud networking capabilities, constraints and best practices.
- Contributing to upstream Open Stack networking communities, particularly Neutron and related projects such as OVN, through bug reports, code contributions, design discussions, testing and reviews where appropriate.
- Tracking upstream roadmaps, release changes and community direction to help shape Nscale's networking strategy, upgrade planning and platform standards.
- Representing Nscale's operational requirements and real‑world use cases in upstream discussions to drive improvements that benefit both the business and the broader community.
- Ensuring Open Stack networking platforms adhere to security, compliance and operational standards.
- Participating in on‑call rotations and incident response activities for critical infrastructure services.
- Strong Linux systems administration and troubleshooting experience.
- Deep hands‑on experience deploying, operating, upgrading and troubleshooting large‑scale Open Stack environments.
- Strong specialist knowledge of Neutron, including ML2, OVN, Open vSwitch, routing, DHCP, metadata, provider networks, tenant networks, VLAN/VXLAN/Geneve and security groups.
- Strong understanding of Linux networking concepts including routing, bridging, name spaces, iptables/nftables, bonding, MTU and packet flow analysis.
- Strong experience investigating complex network behaviour using diagnostic and observability tools such as tcpdump, iproute2, ovs/ovn tooling, logs and metrics.
- Strong experience designing and building automation for…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).