×
Register Here to Apply for Jobs or Post Jobs. X

Software Engineer, Inference - Optimization

Job in Los Angeles, Los Angeles County, California, 90079, USA
Listing for: OpenAI
Full Time position
Listed on 2026-06-18
Job specializations:
  • Software Development
    Software Engineer, Backend Developer, AI Reliability/ Performance Engineer, AI Engineer (Applied/Software)
Salary/Wage Range or Industry Benchmark: 295000 USD Yearly USD 295000.00 YEAR
Job Description & How to Apply Below
Position: Software Engineer, Inference - Performance Optimization

Software Engineer, Inference - Performance Optimization

Inference – San Francisco

About the Team

Our team analyzes inference stack performance across the application, model, and fleet layers to identify bottlenecks and drive faster, cheaper inference. We combine systems profiling, benchmarking, and analysis to understand where time and cost are spent, then turn that understanding into performance optimizations and models that project performance and capacity needs for future launches.

About the Role

In this role, you will model inference performance across application, model, and fleet layers with higher fidelity. You will build cost-to-serve estimates from microbenchmarks and create tools that help cross‑functional teams reason about latency, capacity, utilization, and cost tradeoffs.

Responsibilities
  • Build and refine performance models that translate microbenchmark results into cost-to-serve estimates.
  • Analyze inference workloads end to end across applications, models, and fleet infrastructure.
  • Enhance tooling to identify bottlenecks across layers for latency and throughput.
  • Partner with other teams to turn performance insights into concrete improvements and project how future changes affect inference.
Qualifications
  • Enjoy reasoning from first principles about distributed systems, model inference, and hardware efficiency.
  • Are comfortable working across abstraction layers, from application behavior to kernels, accelerators, networking, and fleet scheduling.
  • Have deep expertise with performance profiling, benchmarking, analysis, and optimization.
  • Enjoy collaborating with engineering and research teams to improve real production systems.
Compensation

$295K – $555K + Offers Equity

OpenAI is an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic.

#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary