AI Architect Job Jamestown area,Town of Poland New York USA,Software Development

Location: Town of Poland

AI ARCHITECT - Role Overview

We are seeking an experienced AI Architect to lead the design, development, and production deployment of autonomous multi-agent systems. You will move beyond simple chatbots to build stateful, goal-oriented agentic workflows that can reliably execute complex business logic.

This role can be remote in Greece or Poland, or hybrid in our Dublin office.

Key Responsibilities

Architecture & System Design

Design multi-agent architectures (e.g., Supervisor-Worker, Hierarchical Teams) capable of breaking down complex user queries into sub-tasks.
Define the state management strategy to ensure agents retain context, memory, and user intent across long-running workflows.
Architect robust Retrieval-Augmented Generation (RAG) pipelines that allow agents to query proprietary data with high precision.
Select and integrate appropriate LLM orchestration frameworks (e.g., Lang Graph, Auto Gen, CrewAI) based on use-case requirements.

Engineering & Development

Implement tool-use capabilities (function calling), enabling agents to interact with internal APIs, databases, and third-party SaaS platforms safely.
Develop guardrails and steering mechanisms (e.g., NeMo Guardrails, LMQL) to ensure agents stay "on-rails" and avoid hallucinations or unsafe actions.
Optimize prompt engineering strategies (Chain-of-Thought, ReAct, Tree of Thoughts) for maximum reliability and minimum latency.
Oversee the transition from prototype to production, ensuring code is modular, testable, and scalable.

Production Operations (LLMOps)

Implement evaluation frameworks (e.g., Ragas, Tru Lens, Deep Eval) to quantitatively measure agent performance, accuracy, and hallucination rates before deployment.
Design observability dashboards (using tools like Lang Smith, Arize, or Datadog) to trace agent reasoning steps, token usage, and latency in real-time.
Manage cost and performance trade-offs, implementing caching strategies and selecting the right model mix (e.g., routing simpler tasks to smaller/cheaper models like GPT-4o-mini or Llama
3).

Technical Qualifications Core Tech Stack

Languages:

Expert proficiency in Python; familiarity with Type Script is a plus.
LLM Frameworks:
Deep experience with Lang Chain and specifically agentic libraries like Lang Graph, Auto Gen, or Semantic Kernel.
Vector Databases:
Experience deploying and managing vector stores like Pinecone, Weaviate, Qdrant, or pgvector.
Model APIs:
Hands-on experience integrating OpenAI (GPT-4), Anthropic (Claude), and open-source models (via Ollama or vLLM).

Infrastructure & Dev Ops

Experience containerizing AI applications (Docker, Kubernetes) for cloud deployment (AWS/Azure/GCP).
Familiarity with serverless architectures for handling asynchronous agent tasks.
Knowledge of API security standards (OAuth, API Keys) for securing agent tool access.

Nice-to-Haves (The "Edge")

Experience fine-tuning small language models (SLMs) for specific domain tasks to reduce costs and improve latency.
Background in Graph RAG (using Knowledge Graphs alongside Vector DBs) for better reasoning capabilities.
Experience dealing with structured outputs (using Pydantic/Instructor) to force LLMs to return valid JSON/Schematic data.

#J-18808-Ljbffr


Increase/decrease your Search Radius (miles)



Job Posting Language