More jobs:
Backend Engineer — LLM Infrastructure
Job in
Sunnyvale, Santa Clara County, California, 94087, USA
Listed on 2026-05-24
Listing for:
CloudAct Inc.
Full Time
position Listed on 2026-05-24
Job specializations:
-
Software Development
Backend Developer
Job Description & How to Apply Below
You will work on the always-in-path FastAPI proxy that sits between every customer request and every provider. That means guardrails, prompt management, A/B testing, cost attribution, rate limits, and streaming. You will own the parts of the request lifecycle that have to stay correct under real production load.
Responsibilities- Extend the Nemo Backend FastAPI proxy with new guardrails and features
- Own streaming, retries, and provider failover correctness
- Build and maintain cost attribution from x-nemo-request-cost
- Profile and tune hot paths — every millisecond is in the user-facing latency budget
- Harden multi-tenancy isolation at the request layer
- 5+ years of backend Python in production
- Deep experience with asyncio and high-concurrency services
- Comfortable with Postgres, connection pooling, and query optimization
- Production experience with streaming APIs or proxies
- Prior work on LLM APIs, model gateways, or SSE streaming
- Experience with LLM routing engines or model gateways
- Performance profiling and flame graph literacy
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×