Lead Data Scientist, Gen AI
Job in
Chicago, Cook County, Illinois, 60290, USA
Listed on 2026-02-21
Listing for:
Caterpillar Brazil
Full Time
position Listed on 2026-02-21
Job specializations:
-
IT/Tech
AI Engineer, Machine Learning/ ML Engineer, Data Scientist, Data Analyst
Job Description & How to Apply Below
** Technology, Digital and Data
*
* Job Description:
**** Your Work Shapes the World at Caterpillar Inc.
** When you join Caterpillar, you're joining a global team who cares not just about the work we do – but also about each other. We are the makers, problem solvers, and future world builders who are creating stronger, more sustainable communities. We don't just talk about progress and innovation here – we make it happen, with our customers, where we work and live.
Together, we are building a better world, so we can all enjoy living in it.
* Cat Digital is the digital and technology arm of Caterpillar Inc., leveraging the latest technologies to build industry leading digital solutions for our customers and dealers. With over 1.5 million connected assets worldwide, our teams use data, technology, advanced analytics, telematics, and AI capabilities to help our customers build a better, more sustainable world.
**
* Job Summary:
** Join the Gen AI Delivery team of Cat Digital and be responsible for bringing world class GenAI capabilities to our products and services.
*
* What You Will Do:
**
* ** Model Development:
** Fine-tuning, implementing, developing, and optimizing models used in generative AI solutions
* ** Algorithm Research:
** Staying updated with the latest advancements in AI and machine learning algorithms and integrating them into the team's projects.
* ** Experimentation:
** Conducting experiments to test the performance of different models and approaches, and iterating based on the results.
* ** Research and Implementation:
** Agentic GenAI solution research and best approach to implement solutions
* ** Training and Mentorship:
** Providing guidance and mentorship to junior data scientists and other team members.
** What
You Will Have:
**
* ** Business Statistics**:
Knowledge of statistical tools and processes to describe results; ability to use statistical tools, including experiment design for LLM evaluation, to assist in making data-driven business decisions.
* ** Analytical Thinking**:
Knowledge of techniques to promote effective analysis; ability to determine the root cause of organizational problems and architect agentic workflows that decompose complex tasks into autonomous sub-tasks.
* ** Machine Learning**:
Knowledge of algorithms and principles; ability to develop and deliver Generative AI solutions, including Large Language Models (LLMs), Retrieval-Augmented Generation (RAG), and autonomous systems.
* ** Programming**:
Solid understanding of software engineering and clean code; ability to utilize Python and deep learning frameworks to conduct AI research, build rapid prototypes, and iterate on diverse Generative AI applications, including but not limited to agentic workflows.
* ** Requirements Analysis**:
Ability to elicit and record business requirements to ensure the success of AI-driven products, incorporating human-in-the-loop (HITL) reviews and safety guardrails for autonomous system development.
** Considerations for Top Candidates:
*** Proven ability to lead and guide a team of data scientists, providing technical mentorship and fostering a collaborative research environment.
* Graduate degree (Master's or Ph.D.) in Mathematics, Statistics, Computer Science, Engineering, or a related quantitative field.
* At least 3 years of professional experience in Data Science and Machine Learning.
* At least 1 year of professional experience in the Generative AI domain, specifically working with Large Language Models (LLM).
* Advanced proficiency in Python for carrying out AI research and developing rapid prototypes for diverse AI applications.
* Foundational knowledge in LLM as well as other AI domains such as computer vision, speech and exposure to agentic AI architectures.
* Experience to evaluate generated content from Large Language Models using both automated metrics and human-in-the-loop methodologies.
* Solid understanding of AI observability tools like Lang Smith, Langfuse, or Agent Ops to monitor traces, identify logic bottlenecks, and debug agent reasoning.
* Hands-on experience with major cloud platforms (AWS, Azure, or GCP) for model development, data handling, and AI service…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×