Senior AI Systems Engineer; Agentic & LLM Production
Listed on 2026-05-30
-
Software Development
AI Engineer, Machine Learning/ ML Engineer
NTT DATA strives to hire exceptional, innovative, and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now. NTT DATA's Client is currently seeking an Application Management Specialist to join their team in New York/Dallas/Remote, New York (US-NY), United States (US).
Job Duties- Build agentic AI systems:
Design and implement tool‑calling agents that combine retrieval, structured reasoning, and secure action execution (function calling, change orchestration, policy enforcement) following MCP protocol. Engineer robust guardrails for safety, compliance, and least‑privilege access. - Productionize LLMs:
Build evaluation framework for open‑source and foundational LLMs; implement retrieval pipelines, prompt synthesis, response validation, and self‑correction loops tailored to production operations. - Integrate with runtime ecosystems:
Connect agents to observability, incident management, and deployment systems to enable automated diagnostics, runbook execution, remediation, and post‑incident summarization with full traceability. - Collaborate directly with users:
Partner with production engineers and application teams to translate production pain points into agentic AI roadmaps; define objective functions linked to reliability, risk reduction, and cost; and deliver auditable, business‑aligned outcomes. - Safety, reliability, and governance:
Build validator models, adversarial prompts, and policy checks into the stack; enforce deterministic fallbacks, circuit breakers, and rollback strategies; instrument continuous evaluations for usefulness, correctness, and risk. - Scale and performance:
Optimize cost and latency via prompt engineering, context management, caching, model routing, and distillation; leverage batching, streaming, and parallel tool‑calls to meet stringent SLOs under real‑world load. - Build a RAG pipeline:
Curate domain knowledge; build data‑quality validation framework; establish feedback loops and milestone framework to maintain knowledge freshness. - Raise the bar:
Drive design reviews, experiment rigor, and high‑quality engineering practices; mentor peers on agent architectures, evaluation methodologies, and safe deployment patterns.
- 5 years of software development in one or more languages (Python, C/C++, Go, Java); strong hands‑on experience building and maintaining large‑scale Python applications preferred.
- 3 years designing, architecting, testing, and launching production ML systems, including model deployment/serving, evaluation and monitoring, data processing pipelines, and model fine‑tuning workflows.
- Practical experience with Large Language Models (LLMs): API integration, prompt engineering, fine‑tuning/adaptation, and building applications using RAG and tool‑using agents (vector retrieval, function calling, secure tool execution).
- Understanding of different LLMs, both commercial and open source, and their capabilities (e.g., OpenAI, Gemini, Llama, Qwen, Claude).
- Solid grasp of applied statistics, core ML concepts, algorithms, and data structures to deliver efficient and reliable solutions.
- Strong analytical problem‑solving, ownership, and urgency; ability to communicate complex ideas simply and collaborate effectively across global teams with a focus on measurable business impact.
- Preferred:
Proficiency building and operating on cloud infrastructure (ideally AWS), including containerized services (ECS/EKS), serverless (Lambda), data services (S3, Dynamo
DB, Redshift), orchestration (Step Functions), model serving (Sage Maker), and infra‑as‑code (Terraform/Cloud Formation).
Where required by law, NTT DATA provides a reasonable range of compensation for specific roles. The starting hourly range for this remote role is (HOURLY RATE MIN TO MAX). This range reflects the minimum and maximum target compensation for the position across all US locations. Actual compensation will depend on several factors, including the candidate's actual work location, relevant experience, technical skills, and other qualifications.
This position may also be eligible for incentive compensation based on individual…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).