×
Register Here to Apply for Jobs or Post Jobs. X

Senior Software Engineer – Inference Platform Infrastructure

Job in Coos Bay, Coos County, Oregon, 97458, USA
Listing for: NVIDIA
Full Time position
Listed on 2026-02-24
Job specializations:
  • Manufacturing / Production
    Systems Engineer
Salary/Wage Range or Industry Benchmark: 287500 USD Yearly USD 287500.00 YEAR
Job Description & How to Apply Below

Employer Industry: Technology - AI and GPU Infrastructure

Why consider this job opportunity:
  • Salary range up to $287,500 for Level 4 positions
  • Eligible for equity and comprehensive benefits
  • Opportunity to work with cutting‑edge technology in AI and distributed systems
  • Chance to make significant contributions to platform reliability and performance
  • Collaborative environment with forward‑thinking professionals
What to Expect (Job Responsibilities):
  • Build automation for provisioning, configuration, upgrades, and routine maintenance of inference services
  • Create and evolve deployment patterns for inference workloads on Kubernetes
  • Own platform reliability outcomes, including defining and improving SLIs/SLOs and automated remediation
  • Manage a large fleet of NVIDIA GPU and datacenter hardware from pre‑release to production
  • Collaborate with cross‑functional teams to drive improvements and ensure operational excellence
What is Required (Qualifications):
  • Strong software engineering skills with a focus on building reliable platforms and systems
  • Minimum of 5 years of experience in building and operating production distributed systems
  • Proven expertise in cloud‑native platforms, including Kubernetes and CI/CD
  • Deep experience with infrastructure‑as‑code and automation‑first operations
  • BS/MS in Computer Science, Computer Engineering, or related field, or equivalent experience
How to Stand Out (Preferred Qualifications):
  • Direct experience in operating inference serving at scale (e.g., Triton, Tensor

    RT‑LLM)
  • Experience building scheduling or quota systems for Kubernetes
  • Familiarity with fleet health systems and automation for failure triage

We prioritize candidate privacy and champion equal‑opportunity employment. Central to our mission is our partnership with companies that share this commitment. We aim to foster a fair, transparent, and secure hiring environment for all. If you encounter any employer not adhering to these principles, please bring it to our attention immediately.

#J-18808-Ljbffr
Position Requirements
10+ Years work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary