Senior AI Engineer
Listed on 2026-05-16
-
Software Development
AI Engineer, Machine Learning/ ML Engineer
Overview
Location:
Cleveland, OH (On-site, Remote)
Type:
Full-time
Posted: 8 hours ago
Job DescriptionSenior AI Engineer
Further is seeking a Senior AI Engineer to lead the development of our cloud-based (Google Cloud) commercial AI products. You will bridge the gap between experimental data science prototypes and production-grade software. You will architect the robust systems and LLMOps workflows necessary to transform AI models into reliable, enterprise-ready applications. By designing stable backend architectures and seamless integration layers, you will ensure our AI solutions are not just functional, but ethical, efficient, and high-value products that meet the rigorous demands of our commercial clients.
ExperienceRequired
- 6+ years of software engineering experience with at least 3 years dedicated to AI/ML application development.
- Expert proficiency in Python AI application development and modern API architecture (REST, Graph
QL, gRPC) using enterprise standards like static type checking and data validation. - Deep experience building production applications with LLM frameworks such as Lang Chain, Lang Graph or Llama Index.
- Hands-on expertise with vector databases (Pinecone, Weaviate, Postgre
SQL) and search algorithms. - Strong understanding of LLMOps principles, including model registry, versioning, and serving infrastructure specifically in Google Cloud.
- Experience in Typescript development for prototyping and integrations.
- Proficiency with git workflows and understanding of standard application development processes.
- Knowledge of advanced prompt engineering and fine-tuning techniques (LoRA, PEFT).
- Experience optimizing inference costs and latency for large-scale deployments.
- Previous experience in a client-facing consulting role, managing diverse stakeholders and navigating complex organizational structures.
- Define the end-to-end architecture for AI products on Google Cloud Platform (GCP), ensuring high availability, security, and cost-effectiveness.
- Architect and develop high-performance backend services and APIs using Python (FastAPI) to serve large language models at scale.
- Design advanced Retrieval-Augmented Generation (RAG) systems, selecting and managing vector databases and optimizing embedding strategies for accuracy and speed.
- Build robust integration layers that connect AI agents securely to external enterprise systems, CRMs, and legacy databases.
- Conduct code reviews, provide technical guidance, and foster a culture of continuous learning and innovation within the engineering team.
- Collaborate with infrastructure teams to define deployment strategies, ensuring solutions scale dynamically under load.
- Lead the implementation of rigorous evaluation frameworks to monitor model performance, drift, and cost in real-time.
- Develop reusable internal libraries and architectural patterns and standards to accelerate the delivery of AI solutions across multiple client engagements.
- Mentor engineers on best practices for building deterministic software around probabilistic AI models.
Total rewards program includes net-zero cost medical option, company contributions to your HSA, fertility support, fully-paid parental leave, a monthly stipend for your lifestyle spending account, and much more.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).