Network Engineer, Operations & Reliability
Listed on 2025-12-30
-
IT/Tech
Network Engineer, Systems Engineer, IT Support
Network Engineer, Operations & Reliability
Join to apply for the Network Engineer, Operations & Reliability role at Fluidstack
.
The Role
Fluidstack is seeking a Network Operations Engineer to serve as a Regional Site Lead for one of our datacenter campuses. This hybrid role combines hands‑on Tier 2/3 network operations with site leadership responsibilities. You’ll be the boots‑on‑the‑ground expert for your assigned datacenter or campus, ensuring network reliability through incident response, break‑fix coordination, and operational excellence. You’ll work remotely when workload allows but be on‑site as needed for deployments, complex troubleshooting, and critical incidents.
This role is ideal for experienced network operators who want ownership of a datacenter campus while being part of a broader operations organization. You’ll partner closely with the Operations & Reliability pillar lead, centralized NOC for Tier 1 escalations, and cross‑functional teams including Deployment, Hardware, and DC Operations. Success means maintaining high availability for your region, building strong relationships with onsite teams, and growing into regional operations leadership as the team scales.
Focus- Regional Operations Ownership:
Serve as the primary network operations contact for your assigned datacenter campus. Own network health, respond to incidents escalated from NOC, and ensure fabrics run reliably. Build deep knowledge of your region’s network topology, common failure modes, and operational characteristics. - Tier 2+ Incident Response:
Handle network incidents escalated from Tier 1 NOC during your coverage window. Troubleshoot complex issues across physical and logical layers, coordinate with other engineers for follow‑the‑sun coverage, and drive incidents to resolution. Lead incident response when you’re the subject matter expert on the ground. - Break‑Fix Coordination:
Coordinate hardware break‑fix activities with onsite DC Operations technicians. Manage linecard swaps, optic replacements, device troubleshooting, and RMA processes. Ensure physical infrastructure issues are resolved quickly and don’t impact production workloads. - Deployment Support:
Provide operational support during new datacenter deployments and expansions in your region. Partner with Deployment teams on turn‑up activities, validate production readiness, and ensure smooth handovers from deployment to operations. Be the person who ensures new pods integrate seamlessly into operational workflows. - Runbook Execution & Improvement:
Execute operational runbooks for common failure scenarios and maintenance procedures. Identify gaps in runbooks, document lessons learned, and provide feedback to the Operations pillar lead on runbook improvements. Build the operational knowledge base for your region. - Cross‑Team
Collaboration:
Build strong relationships with onsite DC Operations teams, structured cabling vendors, and hardware logistics partners. Serve as the network engineering liaison for your datacenter campus. Communicate clearly about network status, planned maintenance, and operational issues. - Regional Mentorship:
As the regional team scales, mentor junior operations engineers assigned to your datacenter. Share operational knowledge, provide guidance during incidents, and help build regional operations capacity.
- Strong Operations Background: 5‑8 years in network engineering with significant hands‑on operational experience. You’ve run production networks, responded to incidents at all hours, and debugged complex failures under pressure. You understand the difference between “working” and “production‑ready.”
- Datacenter Fabric Expertise:
Deep experience operating modern datacenter networks including EVPN/VXLAN, BGP, CLOS topologies, and high‑radix switching. You’re comfortable troubleshooting Layer 2/3 issues, BGP routing problems, fabric misconfigurations, and physical layer failures. - Incident Response Excellence:
Proven ability to lead incident response, perform systematic troubleshooting, and drive issues to resolution. You remain calm during outages, communicate clearly with stakeholders, and know when to elevate versus dig deeper.…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).