More jobs:
AI Architect
Job in
Town of Poland, Jamestown, Chautauqua County, New York, 14701, USA
Listed on 2025-12-18
Listing for:
EBM Software
Full Time
position Listed on 2025-12-18
Job specializations:
-
Software Development
AI Engineer, Data Scientist
Job Description & How to Apply Below
AI ARCHITECT - Role Overview
We are seeking an experienced AI Architect to lead the design, development, and production deployment of autonomous multi-agent systems. You will move beyond simple chatbots to build stateful, goal-oriented agentic workflows that can reliably execute complex business logic.
This role can be remote in Greece or Poland, or hybrid in our Dublin office.
Key Responsibilities- Design multi-agent architectures (e.g., Supervisor-Worker, Hierarchical Teams) capable of breaking down complex user queries into sub-tasks.
- Define the state management strategy to ensure agents retain context, memory, and user intent across long-running workflows.
- Architect robust Retrieval-Augmented Generation (RAG) pipelines that allow agents to query proprietary data with high precision.
- Select and integrate appropriate LLM orchestration frameworks (e.g., Lang Graph, Auto Gen, CrewAI) based on use-case requirements.
- Implement tool-use capabilities (function calling), enabling agents to interact with internal APIs, databases, and third-party SaaS platforms safely.
- Develop guardrails and steering mechanisms (e.g., NeMo Guardrails, LMQL) to ensure agents stay "on-rails" and avoid hallucinations or unsafe actions.
- Optimize prompt engineering strategies (Chain-of-Thought, ReAct, Tree of Thoughts) for maximum reliability and minimum latency.
- Oversee the transition from prototype to production, ensuring code is modular, testable, and scalable.
- Implement evaluation frameworks (e.g., Ragas, Tru Lens, Deep Eval) to quantitatively measure agent performance, accuracy, and hallucination rates before deployment.
- Design observability dashboards (using tools like Lang Smith, Arize, or Datadog) to trace agent reasoning steps, token usage, and latency in real-time.
- Manage cost and performance trade-offs, implementing caching strategies and selecting the right model mix (e.g., routing simpler tasks to smaller/cheaper models like GPT-4o-mini or Llama
3).
- Languages:
Expert proficiency in Python; familiarity with Type Script is a plus. - LLM Frameworks:
Deep experience with Lang Chain and specifically agentic libraries like Lang Graph, Auto Gen, or Semantic Kernel. - Vector Databases:
Experience deploying and managing vector stores like Pinecone, Weaviate, Qdrant, or pgvector. - Model APIs:
Hands-on experience integrating OpenAI (GPT-4), Anthropic (Claude), and open-source models (via Ollama or vLLM).
- Experience containerizing AI applications (Docker, Kubernetes) for cloud deployment (AWS/Azure/GCP).
- Familiarity with serverless architectures for handling asynchronous agent tasks.
- Knowledge of API security standards (OAuth, API Keys) for securing agent tool access.
- Experience fine-tuning small language models (SLMs) for specific domain tasks to reduce costs and improve latency.
- Background in Graph RAG (using Knowledge Graphs alongside Vector DBs) for better reasoning capabilities.
- Experience dealing with structured outputs (using Pydantic/Instructor) to force LLMs to return valid JSON/Schematic data.
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×