Senior AI Engineer
Listed on 2026-06-02
-
Software Development
AI Engineer, Machine Learning/ ML Engineer
We are searching for an AI Product expert who can take ownership of a SaaS product and work with frontend and backend engineers to release world class AI product offerings. This is a hands‑on leader with coding experience and with deep experience in AI technologies, GPUs, Multi-agent frameworks, AI models and knows a lot about inference, fine tuning, agents, optimization quantization, etc.
Salary: Depends On Experience And Current Verifiable (paychecks) Compensation.
Location: Northern Virginia
Summary OfThe Role
We're looking for a Senior AI Product Manager / Engineer to design and build end-to-end AI systems - from model deployment and optimization to autonomous agent orchestration. This is not a research-only role. You'll operate at the intersection of:
- LLMs & multimodal models
- agent frameworks and workflow orchestration
- production-grade infrastructure
You will help define how intelligent systems are built, deployed, and scaled in real‑world environments.
What You'll Work On- Design and implement autonomous AI agents capable of multi‑step reasoning, tool use, and workflow execution
- Build agent orchestration systems (memory, planning, tool calling, state management)
- Deploy and serve models (LLMs, vision, multimodal) in production environments
- Optimize models via fine‑tuning (LoRA, full fine‑tune), quantization (INT8, 4‑bit, GGUF, etc.), distillation and performance tuning
- Develop multi-model pipelines (generation + retrieval + tools + agents)
- Integrate external tools/APIs into agent workflows
- Build evaluation systems for reasoning quality, hallucination detection, task success rates
- Architect and implement end-to-end AI pipelines
- Work with open‑source and proprietary models (LLMs, diffusion, etc.)
- Implement RAG systems, embeddings, and vector search
- Design prompting + system instruction strategies
- Improve latency, throughput, and cost efficiency
- Deploy models using modern stacks (containers, GPUs, serverless where applicable)
- Build scalable inference systems
- Manage model versioning, monitoring, and rollback strategies
- Work with distributed systems and async processing pipelines
- Build custom agent frameworks or extend existing ones
- Implement planning/reasoning loops, tool usage, memory (short‑term + long‑term)
- Design reusable workflows for real‑world use cases
- Write clean, maintainable, production‑grade code (Python primarily)
- Design APIs and services for internal and external use
- Collaborate with product and design to ship user‑facing features
- Write clear technical requirements (PRDs / tech specs)
- Produce and maintain technical documentation
- Conduct code reviews and enforce engineering standards
- Define evaluation metrics and testing strategies for AI systems
- Participate in architecture discussions and system design
- 3-5+ years in software engineering, with a strong focus on AI/ML systems
- Hands‑on experience with LLMs and/or multimodal models
- Experience building or working with AI agents or multi‑step workflows
- Strong Python skills and familiarity with ML frameworks (PyTorch, etc.)
- Experience with model deployment (Docker, cloud, GPU infra), fine‑tuning, and/or quantization
- Solid understanding of prompt engineering, RAG architectures, embeddings + vector databases
- Experience with frameworks like Lang Graph, Lang Chain, Llama Index or custom agent systems
- Familiarity with model serving tools (vLLM, Tensor
RT, ONNX, etc.) - Experience with distributed systems and high‑scale APIs
- Background in performance optimization/systems engineering
- Contributions to open‑source AI projects
- Builders who ship, not just experiment
- Strong systems thinking (not just model‑level thinking)
- Ability to move between research ideas and production systems
- Clear communication and documentation habits
- Ownership mindset and product intuition
- Work on cutting‑edge agent systems, not just wrappers
- High ownership and ability to shape architecture
- Build a full‑stack AI platform, not a narrow feature
- Fast‑moving environment with real‑world impact
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).