More jobs:
Infrastructure Engineer
Job in
San Francisco, San Francisco County, California, 94199, USA
Listed on 2026-02-13
Listing for:
DPP Tech
Full Time
position Listed on 2026-02-13
Job specializations:
-
IT/Tech
Systems Engineer, AI Engineer
Job Description & How to Apply Below
DPP Tech is a technology consulting and staffing services company based in the San Francisco Bay Area. We offer benefits, including 401K, sick leave, and medical insurance, and provide direct payroll deposits via Paychex.
About the RoleWe are hiring an AI Infrastructure Engineer to design, deploy, and operate AI racks and GPU
-based computer platforms supporting large-scale machine learning and generative AI workloads. This is a Remote long contract opportunity.
- Design, deploy, and support AI racks and GPU compute platforms across on-prem, colocation, and cloud environments
- Manage and optimize GPU clusters for AI/ML training and inference workloads
- Support hybrid AI infrastructure spanning data centers and cloud providers
- Configure and maintain Linux-based AI systems, GPU drivers, and firmware
- Support high-performance networking (Infini Band / high-speed Ethernet) for distributed AI workloads
- Work with vendors on AI hardware procurement, installation, and lifecycle management
- Enable containerized and scheduled AI workloads using Kubernetes or similar systems
- Partner with ML engineers and platform teams to ensure efficient model training and deployment
- Monitor infrastructure health, performance, utilization, power, and cooling
- Troubleshoot hardware, OS, networking, and performance issues end-to-end
- Implement security, reliability, and operational best practices for AI infrastructure
- 5+ years of experience in infrastructure, systems, or platform engineering
- Hands-on experience with AI racks, GPU servers, or HPC environments
- Strong knowledge of GPU hardware and accelerators (NVIDIA preferred)
- Experience administering Linux systems in production
- Experience with container platforms or job schedulers (Kubernetes, Slurm, etc.)
- Understanding of distributed systems and high-performance networking
- Ability to operate across hardware, OS, and platform layers
Design, deploy, and support AI racks and GPU compute platforms
Please send your resume and contact information to discuss further.
#J-18808-LjbffrTo View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×