Openshift AI Ops Consultant
Job in
Trenton, Mercer County, New Jersey, 08629, USA
Listed on 2026-06-11
Listing for:
TEKsystems
Part Time
position Listed on 2026-06-11
Job specializations:
-
IT/Tech
Systems Engineer, Cloud Computing, AI Engineer (Applied/Software), SRE/Site Reliability
Job Description & How to Apply Below
Pennington, NJ (Onsite 3 days/week - non-negotiable)
⏳ Contract (6 months, strong extension potential)
Overview
We're seeking a senior AI Platform SRE / MLOps engineer to support and stabilize a production Generative AI platform , running on Red Hat Open Shift.
This is a hands-on, high-impact role focused on operational excellence, reliability engineering, and performance tuning of GPU-accelerated AI workloads in a regulated enterprise environment.
You will act as a key technical resource within Dell's delivery team, helping bring structure, stability, and scalability to an evolving GenAI platform.
What You'll Do
+ Own day-to-day operations of a production GenAI platform running on Open Shift/Kubernetes
+ Diagnose and resolve performance, stability, and scaling issues across AI workloads
+ Optimize GPU-based inference pipelines using tools like:
+ NVIDIA Triton Inference Server
+ Tensor
RT / CUDA
+ Implement SRE best practices:
+ Monitoring & observability (Prometheus, Grafana, etc.)
+ Incident response & root cause analysis
+ Automation & runbook creation
+ Improve cluster performance, resource utilization, and reliability
+ Collaborate with stakeholders while operating with high autonomy and limited guidance
+ Ensure platform adheres to enterprise governance, security, and compliance standards
✅ Required Qualifications (Must-Have)
+ 8+ years in SRE / Dev Ops / Platform Engineering roles
+ Deep experience with Red Hat Open Shift or Kubernetes in production environments
+ Cluster administration, scaling, upgrades, troubleshooting
+ Hands-on experience supporting AI/ML workloads in production
+ Proven experience with GPU-accelerated environments, including:
+ NVIDIA stack (CUDA, Triton, Tensor
RT, etc.)
+ Strong SRE mindset:
+ Incident management, monitoring, uptime, reliability engineering
+ Scripting/automation experience (Python, Bash, etc.)
+ Ability to operate independently in ambiguous, high-pressure environments
➕ Nice to Have
+ Experience in financial services or regulated environments
+ Familiarity with MLOps tooling (Kubeflow, MLflow, ArgoCD)
+ Knowledge of model optimization techniques (quantization, pruning)
+
Certifications:
+ Red Hat (RHCE)
+ CKA / CKS
+ Prior consulting or residency-style engagements
⚠️ Important Notes
+ Onsite requirement: 3 days/week in Pennington, NJ
+ (No remote exceptions; travel not reimbursed)
+ This role is operations-focused, not model development
+ Initial contract is ~6 months with strong potential for extension
+ You will be expected to lead, not follow-high ownership and accountability
Why This Role?
+ Work on a live, enterprise-scale Generative AI platform
+ Solve real production challenges, not experimental projects
+ High visibility with Dell + Bank of America leadership
+ Opportunity to stabilize and shape the future of AI infrastructure in a regulated environment
Ideal Candidate
A senior AI platform engineer / SRE who thrives at the intersection of:
+ Kubernetes/Open Shift infrastructure
+ GPU-accelerated ML systems
+ Production reliability and performance
You're someone who can step into a complex environment, quickly identify gaps, and drive meaningful improvements from day one.
Job Type & Location
This is a Contract position based out of Trenton, NJ.
Pay and Benefits
The pay range for this position is $80.00 - $90.00/hr.
Eligibility requirements apply to some benefits and may depend on your job classification and length of employment. Benefits are subject to change and may be subject to specific elections, plan, or program terms. If eligible, the benefits available for this temporary role may include the following: - Medical, dental & vision - Critical Illness, Accident, and Hospital - 401(k) Retirement Plan - Pre-tax and Roth post-tax contributions available - Life Insurance (Voluntary Life & AD&D for the employee and dependents) - Short and long-term disability - Health Spending Account (HSA) - Transportation benefits - Employee Assistance Program - Time Off/Leave (PTO, Vacation or Sick Leave)
Workplace Type
This is a hybrid position in Trenton,NJ.
Final date to receive applications
This position is anticipated to close on Jun 22, 2026.
About…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×