More jobs:
Principal GenAI Technical Architect
Job in
San Jose, Santa Clara County, California, 95199, USA
Listed on 2026-05-24
Listing for:
Jansoft Global
Full Time
position Listed on 2026-05-24
Job specializations:
-
Software Development
AI Engineer, Cloud Engineer - Software
Job Description & How to Apply Below
San Jose, United States | Posted on 05/20/2026
We are seeking an experienced Principal GenAI Technical Architect to lead the design and development of scalable enterprise AI platforms and Generative AI applications. The ideal candidate will have deep expertise in Python-based backend development, Large Language Models (LLMs), AI workflow orchestration, vector databases, and cloud-native deployment architectures.
This role requires strong technical leadership, hands‑on architecture experience, and close collaboration with customer stakeholders and engineering teams to deliver secure, scalable, and high‑performing AI solutions.
Key Responsibilities- Own end-to-end solution architecture and technical design for enterprise AI applications.
- Design and develop scalable backend APIs using Python and FastAPI.
- Build and optimize Generative AI applications leveraging Large Language Models (LLMs).
- Develop and manage AI agent workflows using Lang Chain and Lang Graph.
- Implement Retrieval‑Augmented Generation (RAG) architectures using FAISS and vector databases.
- Design asynchronous and event‑driven systems for real‑time AI interactions and chat platforms.
- Implement secure authentication and authorization mechanisms using OAuth
2.0 and JWT. - Lead code reviews, mentor developers, and establish engineering best practices.
- Improve platform resiliency through retries, backoff strategies, failure recovery, monitoring, and observability.
- Collaborate directly with customer leadership and cross‑functional engineering teams.
- Deploy and manage containerized applications on Open Shift/Kubernetes environments.
- Support CI/CD automation and cloud‑native deployment strategies.
- Optimize application scalability, performance, and fault tolerance.
- Strong experience with Python and FastAPI.
- Expertise in Large Language Models (LLMs) and Generative AI platforms.
- Hands‑on experience with Lang Chain and Lang Graph.
- Experience with FAISS and Vector Databases.
- Strong knowledge of REST APIs and Web Socket communication.
- Experience with Redis (cache, streams, pub/sub).
- Knowledge of asynchronous programming and event‑driven architectures.
- Experience with OAuth
2.0 and JWT security implementation. - Experience with Mongo
DB, SOLR, Teradata, or other relational/No
SQL databases. - Experience with Open Shift, Kubernetes, containers, and CI/CD pipelines.
- Strong understanding of scalable distributed systems and cloud‑native architectures.
- Excellent leadership and stakeholder communication skills.
- Experience building enterprise AI copilots or agentic AI systems.
- Familiarity with observability and monitoring tools for AI workflows.
- Prior experience working with large enterprise customers.
- Experience leading distributed engineering teams.
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×