Artificial Intelligence; AI Engineer - Backend Focus Job Rockville area,Maryland USA,Software Development

Position: Artificial Intelligence (AI) Engineer - Backend Focus

Job Title: Artificial Intelligence (AI) Engineer
- Backend Focus

Client: Federal Government

Location: Remote with occasional travel to the client site in Baltimore. Candidates must currently live within a commutable distance of the office.

Employment Type: W2 on Expedite Info Tech 's payroll. This position requires a Permanent resident or a U.S. citizen. The selected candidate will go through a Public Trust Clearance process.

About Expedite Info Tech : Expedite Info Tech is a trusted federal contractor focused on leveraging emerging technologies to modernize systems, enhance security, and drive operational efficiency across government agencies. We work with clients across AI/ML, RPA, Cloud, Enterprise Architecture, Cybersecurity, Health IT infrastructure, and Federal Financial systems—all delivered through a hands-on, collaborative, results-driven approach. Our core values are collaborative innovation, quality service, and exceeding expectations.

)

Position Summary: A backend-focused AI engineer responsible for developing secure, scalable, and production-grade AI applications, with deep experience in LLM integration, retrieval-augmented generation (RAG) pipelines, including Graph-RAG, Agentic AI, and cloud-based LLM Ops workflows. The role emphasizes Amazon Sage Maker Studio, ECS, ECR, lambdas, Agentic Core, APIs, Open Search Vector DB, and Dynamo DB for operationalizing GenAI-powered Digital Products within FedRAMP-compliant AWS environments.

Responsibilities:

AI Solution Development:

Expert hands-on building of RAG and Graph-RAG architectures to handle multiple complex data formats (PDF, images, tables, Word documents, Excel, acronyms, attachments, etc.) to create cleansed standardized data for hydration into a vector database.
Expert hands-on knowledge on text embeddings, image embeddings, chunking logic, metadata creation, and embedding vectors indexing.
Expert hands-on knowledge in creating a highly accurate RAG retrieval system with knowledge on reranking, semantic search, similarity search, hybrid search, etc., to search by text or images.
Implement secure, scalable, highly accurate RAG, Agentic AI pipelines using Lang Chain, Strands, MCP, A2A frameworks, or AWS-native services like Bedrock, Agentic Core, Open Search Vector Database, and Knowledgebase.
Create backend infrastructure for chatbot applications with long-term and short-term memory capabilities to improve user experience.
Hands-on knowledge of creating APIs, Graph-RAG, develop agentic AI workflows with MCPs, A2A, and Skills.

AI/ML Skills:

Experience operationalizing AI/ML pipelines in Sage Maker Studio with model governance
Experience with Amazon
- Bedrock, Agentic Core, Open Search Vector Database, knowledgebase, lambda, API Gateway, FASTAPIs or Flask, SQS, SNS, Step functions, DynamoDB, RDS/Postgres SQL, ECS, ECR, IAM, Cloud Watch, and EKS or Fargate.
Frameworks:
Lang Chain, Lang Fuse, Llama Index, Strands, RAGAS, CrewAI, MCP, and A2A.
Prompt engineering, LLM evaluation methodologies, bias detection, and hallucination detection.

LLM Integration & LLM Ops:

Integrate multiple LLMs via APIs (AWS Bedrock: Anthropic
- Claude, Titan, Llama, Stability Diffusion models)
Implement structured prompt engineering frameworks, response evaluation tools, and feedback loops
Build model optimization layers, including prompt selectors, model switchers, and cache layers

Cloud Infrastructure & Deployment:

Deploy AI services using Sage Maker, ECS, Lambdas, Agentic Core, and Elastic Load Balancers
Containerize backend systems with Docker and deploy to scalable environments using ECS/EKS
Implement CI/CD pipelines via Git Hub Actions integrated with AWS Systems Manager and Code Pipeline
Architect solutions for VPC isolation, IAM hardening, and FedRAMP High compliance

System Integration & Maintenance:

Integrate AI workflows with enterprise databases, legacy platforms, and identity providers
Monitor service performance, GPU utilization, and system health via Cloud Watch and custom logging
Build automated testing pipelines for model accuracy, bias detection, and system robustness
Maintain technical documentation and developer runbooks for long-term system support

Work Environment:

Remote-first with collaborative engagements and occasional client travel
Mission-focused development aligned with executive priorities
Continuous learning and rapid prototyping of cutting-edge AI technologies
Agile delivery culture with strong cross-functional collaboration

Required:

Required / Minimum Qualifications

10+ years of IT experience.
3+ years of experience as an AI Engineer
3+ years of experience in AWS
AWS Services: Graph RAG, Bedrock Agentic Core, Agentic AI, EC2 (GPU-enabled), Sage Maker (Studio, Pipelines, Endpoints, Model Registry), Bedrock, Open Search Vector DB, Systems Manager, Load Balancers, Amazon
- Bedrock, Open Search Vector Database, knowledgebase, lambda, API Gateway, FASTAPIs or Flask, SQS, SNS, Step functions, DynamoDB, RDS/Postgres SQL, and EKS or Fargate
Proficient in coding: Python (async, FastAPI,…