Senior AI Engineer Job Cleveland area,Ohio USA,Software Development

Overview

Location:

Cleveland, OH (On-site, Remote)

Type:
Full-time

Posted: 8 hours ago

Job Description

Senior AI Engineer

Further is seeking a Senior AI Engineer to lead the development of our cloud-based (Google Cloud) commercial AI products. You will bridge the gap between experimental data science prototypes and production-grade software. You will architect the robust systems and LLMOps workflows necessary to transform AI models into reliable, enterprise-ready applications. By designing stable backend architectures and seamless integration layers, you will ensure our AI solutions are not just functional, but ethical, efficient, and high-value products that meet the rigorous demands of our commercial clients.

Experience

Required

6+ years of software engineering experience with at least 3 years dedicated to AI/ML application development.
Expert proficiency in Python AI application development and modern API architecture (REST, Graph

QL, gRPC) using enterprise standards like static type checking and data validation.
Deep experience building production applications with LLM frameworks such as Lang Chain, Lang Graph or Llama Index.
Hands-on expertise with vector databases (Pinecone, Weaviate, Postgre

SQL) and search algorithms.
Strong understanding of LLMOps principles, including model registry, versioning, and serving infrastructure specifically in Google Cloud.
Experience in Typescript development for prototyping and integrations.
Proficiency with git workflows and understanding of standard application development processes.

Preferred Qualifications

Knowledge of advanced prompt engineering and fine-tuning techniques (LoRA, PEFT).
Experience optimizing inference costs and latency for large-scale deployments.
Previous experience in a client-facing consulting role, managing diverse stakeholders and navigating complex organizational structures.

Responsibilities

Define the end-to-end architecture for AI products on Google Cloud Platform (GCP), ensuring high availability, security, and cost-effectiveness.
Architect and develop high-performance backend services and APIs using Python (FastAPI) to serve large language models at scale.
Design advanced Retrieval-Augmented Generation (RAG) systems, selecting and managing vector databases and optimizing embedding strategies for accuracy and speed.
Build robust integration layers that connect AI agents securely to external enterprise systems, CRMs, and legacy databases.
Conduct code reviews, provide technical guidance, and foster a culture of continuous learning and innovation within the engineering team.
Collaborate with infrastructure teams to define deployment strategies, ensuring solutions scale dynamically under load.
Lead the implementation of rigorous evaluation frameworks to monitor model performance, drift, and cost in real-time.

First Year Goals

Develop reusable internal libraries and architectural patterns and standards to accelerate the delivery of AI solutions across multiple client engagements.
Mentor engineers on best practices for building deterministic software around probabilistic AI models.

Benefits

Total rewards program includes net-zero cost medical option, company contributions to your HSA, fertility support, fully-paid parental leave, a monthly stipend for your lifestyle spending account, and much more.

#J-18808-Ljbffr