Data Scientist; Azure AI Engineer
Location: Dubai
Experience: 8+ years (Data Science / AI Engineering / Applied ML)
Job Type: Contract
Job SummaryWe are looking for a highly capable Full Stack Data Scientist / Azure AI Engineer who can build end-to-end AI products: data + ML/DL/CV models + agentic workflows + APIs + UI + scalable deployment on Kubernetes (AKS). The role requires deep expertise in the Azure AI ecosystem (Azure Machine Learning, Azure AI Foundry, Azure AI Search) and strong hands‑on experience building AI agents using Lang Chain, Lang Graph, and/or Microsoft Agent Framework, with Langfuse for tracing, evaluation, and observability.
The ideal candidate has shipped production systems with measurable business impact and can operate them reliably through strong MLOps/LLMOps practices.
Key Responsibilities1) End-to-End AI Product Delivery
- Translate business needs into robust AI solutions with clear KPIs, timelines, and measurable outcomes.
- Build AI applications that are secure, scalable, maintainable, and production ready.
2) AI Agents & Agentic Workflows (Must-Have)
- Design, implement, and orchestrate AI agents capable of planning, tool use, function calling, retrieval, and multi-step execution.
- Build agent systems using:
- Lang Chain for tool/function orchestration, retrieval, and integrations
- Lang Graph for stateful, multi-step, resilient agent workflows
- Microsoft Agent Framework for enterprise‑grade agent patterns and integrations
- Implement agent patterns: routing, task decomposition, multi‑agent collaboration, memory, verification, retries/fallbacks, and human‑in‑the‑loop approvals.
- Apply security & safety: prompt‑injection defenses, tool permissioning, grounding/citations, policy checks, and audit logs.
3) LLMOps / Observability / Evaluation (Langfuse)
- Implement Langfuse (or equivalent) for:
- Prompt and trace logging, latency/cost monitoring
- Dataset‑based evaluation, regression testing, and quality gates
- Feedback loops and continuous improvement of prompts/agents
- Establish evaluation frameworks for RAG/agents: retrieval metrics, answer quality, hallucination checks, and guardrail effectiveness.
- Training jobs, compute, environments, pipelines, MLflow tracking.
- Model registry and promotion, managed online endpoints.
- Implement CI/CD for model + application releases and MLOps practices: versioning, reproducibility, automated testing, and retraining triggers.
- Build GenAI solutions using Azure AI Foundry (prompt flows/orchestration, deployment integration, evaluation workflows).
- Ingestion/indexing of structured & unstructured data.
- Vector + hybrid search, semantic ranking (where applicable), filtering, and relevance tuning.
- Citations, metadata‑based access control, and indexing automation.
6) ML/DL & Computer Vision (Strong Requirement)
- Develop and deploy strong ML/DL solutions including Computer Vision.
- Conduct experimentation, tuning, and optimization (performance, robustness, cost).
- Productionize CV pipelines with monitoring and continuous improvement.
- Build production APIs for models and agents using FastAPI (Python) (async, OpenAPI/Swagger, auth, middleware, validation).
- Build service orchestration and integrations using Node.js where appropriate.
- Implement secure API patterns: authentication/authorization (Azure AD/RBAC patterns), rate‑limiting, caching, and error handling.
8) Frontend Engineering (React)
- Build modern UIs in React for AI applications (agent chat UI, dashboards, workflow screens).
- Support streaming responses, citations, session memory, feedback capture, and user analytics.
- Containerize services using Docker and deploy on Kubernetes (AKS preferred).
- Implement scaling, rollouts, secrets/config management, ingress, and reliability patterns.
- Set up monitoring/telemetry using Azure Monitor/App Insights (or equivalent), alerts, and runbooks.
Skills and Qualifications
Mandatory Certifications (Must)
Core Technical Skills
- Agents/Frameworks: Strong hands‑on experience with Lang Chain
, Lang Graph
, and Microsoft Agent Framework
. - LLMOps: Strong experience with Langfuse for tracing/evaluation/monitoring (or equivalent tooling, with Langfuse preferred).
- Programming: Strong Python; API development with FastAPI
;
Node.js for…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).