More jobs:
Member of Technical Staff, Forward Deployed AI Engineer
Job in
San Mateo, San Mateo County, California, 94409, USA
Listed on 2026-06-01
Listing for:
Inception
Full Time
position Listed on 2026-06-01
Job specializations:
-
IT/Tech
AI Engineer (Applied/Software), Systems Engineer, Machine Learning/ ML Engineer, Cloud Computing
Job Description & How to Apply Below
Inception is hiring Forward Deployed AI Engineers to help enterprise customers deliver the highest quality AI experiences using our diffusion-based language models.
This role sits at the intersection of product engineering, customer implementation, evals, data collection, model optimization, and enterprise deployment ownership. You will work directly with enterprise customers to identify high-value AI workflows, collect and structure customer data, build LLM-as-judge evaluation systems, tune model and product behavior for customer-specific goals, and turn fast proof-of-concepts into production deployments.
This is not a traditional solutions engineering role, a pure research role, or a long-cycle consulting implementation role. We are looking for full-stack engineers who can operate close to customers, build real systems, communicate clearly, and move fast - including running fast POC cycles that take weeks to produce customer impact rather than exploratory research projects that take months.
As an early member of the team responsible for turning Mercury models into high-value enterprise deployments and building the customer data flywheel that improves our models, products, and go-to-market motion. You will work closely with platform, serving, post-training, product engineering, and GTM teams to translate customer deployment learnings into model, product, and infrastructure improvements.
Key Responsibilities
- Enterprise customer deployments: Work directly with strategic enterprise customers to identify high-value AI workflows and turn them into production deployments.
- Rapid prototyping: Build and run fast proof-of-concepts, iterating on customer requirements and technical constraints on 2-week cycles.
- Production AI applications: Build full-stack AI applications, agentic workflows, integrations, internal tools, and customer-facing systems that bring Inception models into real enterprise environments.
- Data collection & feedback loops: Collect, structure, and operationalize customer data to improve model and product performance on customer use cases.
- Measurement and Evaluation: Define success metrics for customer deployments and design LLM-as-judge workflows, evaluation harnesses, and feedback loops for customer-specific use cases.
- Model and product optimization: Tune and customize Mercury models, prompts, workflows, and system architecture to meet customer-specific performance goals.
- Agentic workflows: Build and optimize agentic workflows including subagents involving classification, routing, context compaction, search, coding agents, voice, and other latency-sensitive applications.
- Build, prove, and generalize: Turn customer-specific deployments into repeatable product patterns, eval frameworks, implementation playbooks, and platform capabilities that improve Inception's core product.
- Strong engineering skills in Python and modern full-stack development, including APIs, backend systems, and ideally Type Script/JavaScript.
- Experience building, deploying, or integrating AI/LLM products with real users or customers.
- Familiarity with LLM evaluation, LLM-as-judge workflows, data pipelines, model tuning, prompt optimization, or agentic workflows.
- Customer-facing experience with enterprise, strategic, or high-value accounts.
- Experience deploying software or AI systems in enterprise environments with security, privacy, reliability, compliance, or integration constraints.
- Strong communication and discovery skills, with the ability to translate ambiguous customer needs into concrete technical solutions.
- Ability to operate across engineering, product, sales, and customer success without requiring heavy process or hand holding.
- Willingness to work directly with customers in person when needed, including occasional travel for strategic deployments, workshops, and executive technical sessions.
- Experience with RAG, search, voice AI, coding agents, or agentic workflow systems.
- Experience deploying AI systems for Fortune 500 or large enterprise customers.
- Track record owning technical pre-sales, post-sales, implementation, or customer expansion for million-dollar enterprise accounts.
- Familiarity with LLM serving, latency optimization, model evaluation, or production ML systems.
- Experience with data engineering, synthetic data generation, or feedback loops for model improvement.
- Background in product engineering, ML product engineering, applied AI, or forward deployed engineering.
- Experience working with customer-specific evals, benchmarks, and performance targets.
- Familiarity with latency-sensitive applications, especially voice systems where response speed is critical.
This role is for builders who want to be close to customers and close to the product.
We are not looking for traditional solutions engineers who only configure demos, nor researchers who primarily want to work on open-ended model experiments. The strongest candidates are full-stack engineers with enough ML fluency to work across LLM…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×