×
Register Here to Apply for Jobs or Post Jobs. X

Backend Engineer, AI Systems

Job in Palo Alto, Santa Clara County, California, 94306, USA
Listing for: A1
Full Time position
Listed on 2026-06-18
Job specializations:
  • Software Development
    Backend Developer, AI Engineer (Applied/Software), Machine Learning/ ML Engineer
Salary/Wage Range or Industry Benchmark: 125000 - 150000 USD Yearly USD 125000.00 150000.00 YEAR
Job Description & How to Apply Below

About the Company

A1 is building a proactive AI chat app for everyday users to bring intelligence to conversations, errands, organising and workflows. Unlike traditional chat-based applications, our product focuses on achieving high reliability for long-running workflows, persistent context, and real-world task completion. The system must handle multi-step reasoning, interact with external tools, and remain reliable despite non-deterministic model behavior.

Role Overview

As a Backend Engineer, AI, you own the inference and orchestration layer that powers every AI interaction in the product. Your work sits between models and users, where latency, correctness, reliability, and cost directly impact real-world experience. Build and operate production systems that turn model capability into fast, stable, observable APIs used across mobile and desktop clients.

Focus
  • Build and operate backend systems that serve AI-powered features in production.
  • Design inference pipelines and orchestration layers that handle multi-step workflows, tool calls, and retries.
  • Manage the full lifecycle of AI requests: routing, caching, batching, streaming, and state management.
  • Optimize latency, throughput, and cost across model inference and downstream systems.
  • Design systems that remain reliable despite non-deterministic model behavior and external dependencies.
  • Implement observability for AI systems, including logging, tracing, and debugging of model outputs and failures.
  • Collaborate with ML and product teams to translate model capabilities into stable, production-grade APIs.
Ideal Experiences
  • Strong backend engineering fundamentals in production environments.
  • Experience running high-throughput, low-latency services.
  • Familiarity with AI inference patterns (LLMs, embeddings, multimodal).
  • Comfortable debugging distributed systems under load.
  • Bias toward shipping and learning from production behavior.
Outcomes
  • Backend systems run reliably at scale, handling production AI traffic with low latency and high throughput.
  • Multi-step AI workflows complete successfully across tools and services, with robust handling of failures and retries.
  • APIs are stable, clear, and support seamless integration with frontend and ML systems.
  • Production incidents are quickly detected, diagnosed, and resolved, minimizing user impact.
  • Iterative improvements based on real usage continuously increase system performance and reliability.
  • System design evolves to support increasing scale, complexity, and new AI capabilities without major rewrites.
Tech Stack
  • Python
  • Node Js
  • Pytorch
  • OpenAI / Anthropic / open-source LLMs
  • SQL & No

    SQL
  • Kubernetes
  • Docker
#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary