Support Engineer
Job in
Vineland, Cumberland County, New Jersey, 08361, USA
Listed on 2026-07-01
Listing for:
Nebius
Full Time
position Listed on 2026-07-01
Job specializations:
-
IT/Tech
Unix/Linux, Network Engineer, IT Infrastructure, Systems Engineer
Job Description & How to Apply Below
Nebius is seeking a Data Center Support Engineer to support large scale bare metal GPU infrastructure powering AI workloads. This role is responsible for operating, troubleshooting, and maintaining production infrastructure across servers, Linux systems, networking, firmware, drivers, and physical data center components.
You will serve as a senior technical escalation point between data center technicians, infrastructure engineering, network engineering, systems engineering, and customer-facing teams. The ideal candidate is hands‑on, comfortable in Linux, strong in hardware diagnostics, and able to troubleshoot complex issues across multiple layers of the infrastructure stack.
What You’ll Do- Support and troubleshoot production bare-metal GPU infrastructure across Nebius data center environments.
- Diagnose high-priority issues across Linux, hardware, firmware, BIOS, drivers, networking, storage, optics, cabling, and physical infrastructure.
- Perform hardware diagnostics, component replacement, firmware updates, BIOS configuration, break/fix, and infrastructure maintenance.
- Use Linux command-line tools, system logs, BMC data, and hardware health signals to identify root cause.
- Partner with data center technicians and engineering teams to coordinate remote troubleshooting, incident response, RCA, and operational recovery.
- Troubleshoot network connectivity issues involving TCP/IP, VLANs, DNS, DHCP, switch connectivity, optics, cabling, and link-level failures.
- Create troubleshooting guides, runbooks, escalation notes, and knowledge base documentation.
- Use scripting or automation to improve support workflows and reduce manual intervention.
- Participate in on‑call or after‑hours support for production infrastructure as needed.
- 5+ years of experience in data center operations, infrastructure support, systems administration, hardware support, cloud infrastructure, or similar technical environments.
- Hands‑on experience troubleshooting bare‑metal servers, Linux systems, hardware components, and networking issues in production environments.
- Intermediate Linux command-line proficiency, including service validation, log review, process inspection, network configuration checks, and system diagnostics.
- Experience with server hardware diagnostics, component replacement, firmware updates, BIOS configuration, driver troubleshooting, or BMC tools such as iDRAC, iLO, IPMI, Redfish, or similar.
- Strong networking fundamentals, including TCP/IP, VLANs, DNS, DHCP, routing/switching concepts, optics, cabling, and link‑level troubleshooting.
- Experience participating in incident response, escalation handling, root cause analysis, or operational recovery in high‑availability environments.
- Strong communication and documentation skills, with the ability to work cross‑functionally with data center, infrastructure, network, systems, and engineering teams.
- Experience with NVIDIA GPUs, GPU servers, HPC, AI infrastructure, or high-density compute platforms.
- Experience with Supermicro, Dell, HPE, Lenovo, Cisco UCS, Arista, Cisco, Juniper, or similar infrastructure platforms.
- Familiarity with Infini Band, RoCE, MPO/MTP fiber, high‑speed Ethernet, or clustered compute environments.
- Experience with Python, Bash, Ansible, Terraform, Power Shell, or similar automation tools.
- Familiarity with Jira, Confluence, Service Now, Grafana, Prometheus, Kubernetes, Docker, or similar operational tools.
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×