AI Engineer
Job in
San Jose, Santa Clara County, California, 95199, USA
Listed on 2026-06-03
Listing for:
Exaways Corporation
Full Time
position Listed on 2026-06-03
Job specializations:
-
Software Development
AI Engineer, Software Engineer, Machine Learning/ ML Engineer, Cloud Engineer - Software
Job Description & How to Apply Below
Location:
San Jose, CA / Dallas, TX (Hybrid)
Mode:
Contract
Client: iOPEX / Cisco
Mandatory
Skills:
AI, Python, GenAI, SQL / PL-SQL, LLM, RAG, Lang Chain, Lang Graph
Job Description:
Software Engineering & System Design
- Write, test, and deploy production-quality code with a strong focus on scalability and performance.
- Design and implement end-to-end systems including API, database, and application layers.
- Consider memory usage, performance optimization, and modular architecture in code design.
- Follow modern Dev Ops practices for deployment and environment management.
- Write clean, modular Python code with proper environment management (virtualenv, poetry, conda).
- Build and deploy RESTful APIs using frameworks like FastAPI or Flask.
- Connect applications to relational and No
SQL databases. - Containerize applications using Docker and deploy in Kubernetes / cloud environments.
- Write and optimize SQL and PL/SQL queries for data retrieval and transformation.
- Integrate database logic effectively into application workflows.
- Understand how Large Language Models (LLMs) work - architecture, limitations, and tuning methods.
- Design and implement RAG (Retrieval-Augmented Generation) systems:
- Build pipelines connecting vector databases, LLMs, and application layers.
- Optimize retrieval and response accuracy through prompt engineering and context management.
- Apply best practices and advanced RAG techniques (chunking strategies, caching, query re-ranking).
- Develop and manage prompt templates, parameter configurations, and evaluation metrics.
- Understand AI agents, their lifecycle, and how they integrate with tools and APIs.
- Implement tool calling and human-in-the-loop workflows.
- Design multi-agent architectures using frameworks such as Lang Chain or Lang Graph.
- Apply agent orchestration best practices, ensuring reliability, observability, and control.
- Strong understanding of software design principles and ability to think like a developer (language agnostic).
- Proficiency in Python with good exposure to building APIs, database integration, and container-based deployment.
- Working knowledge of SQL / PL-SQL for querying and joining data.
- Experience in developing or integrating GenAI or RAG-based systems using OpenAI, Anthropic, or similar models.
- Familiarity with Lang Chain, Lang Graph, or similar frameworks for AI workflow orchestration.
- Strong debugging, optimization, and analytical skills.
- Excellent communication and collaboration abilities in a fast-paced environment.
- Exposure to cloud platforms (AWS, GCP, Azure) for AI workload deployment.
- Experience with vector databases (FAISS, Pinecone, Weaviate, or Chroma).
- Knowledge of monitoring and observability tools for AI systems.
- Familiarity with CI/CD pipelines, Git Ops, and infrastructure automation.
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×