Lead Data Engineer; Data Engineer Job Bloomington area,Indiana USA,IT/Tech

Position: Lead Data Engineer (Data Engineer)

Job Summary

Performs advanced data management tasks, including complex data modeling, conversion, de‑duplication, migration, and identification and repair of data quality issues.
Designs, develops, and implements complex custom data systems and advanced reconciliation tools, processes, rules, solutions etc. to validate data, match/merge, and upload batch lists.
Creates and tunes highly complex stored procedures and queries for advanced data management and extraction.
May contribute to committees and communities of practice to share and improve data engineering practices across the university; provides a high level of consultation and mentoring to other groups and staff on the use of data engineering tools and software.
Makes recommendations to improve, as well as implements, documentation and security protocols and procedures for data engineering projects and/or activities; fixes complex problems and resolves issues accordingly.
Provides advanced troubleshooting and problem analysis/resolution for data related issues; acts as a point of escalation for junior team members; performs advanced scripting and modifications of application and products for corrective action.
Performs advanced‑level research and stays up‑to‑date with data engineering best practices and approaches; stays abreast of latest security threats and risks to proactively address potential exposures.
May serve as project lead; often provides guidance to junior peers.

Qualifications

Combinations of related education and experience may be considered. Education beyond the minimum required may be substituted for work experience. Work experience beyond the minimum required may be substituted for education.

Education

Bachelor's degree (preferably in computer science, information science, or related field)

Work Experience

5 years data management, engineering, or related experience

Preferred Education / Experience

Experience contributing to communities of practice or cross‑functional innovation teams is a plus.

Skills Required

Proficient written and verbal communication skills
Maintains a high degree of professionalism
Demonstrated time management and priority setting skills
Demonstrates a high commitment to quality
Possesses flexibility to work in a fast paced, dynamic environment
Seeks to acquire knowledge in area of specialty
Highly thorough and dependable
Demonstrates a high level of accuracy, even under pressure
Possesses a high degree of initiative
Ability to influence internal and/or external constituents

Preferred

Familiarity with conventional machine learning models and processes, including classification, clustering, and retrieval.
Demonstrated experience integrating AI/LLM models (e.g., LLaMA via Ollama, OpenAI, Claude, or Amazon Bedrock) into enterprise platforms and services.
Ability to process data and engineer features for downstream model integrations.
Experience with prompt engineering, inference pipelines, and APIs for deploying AI agents at scale.
Experience with fine‑tuning and/or supervising model adaptation to domain‑specific data and use cases.
Experience with agentic frameworks such as Langgraph, Diffy, or related technologies.
Knowledge on tracking data and model lineages including the ability to ensure security and compliance of the deployed models to comply with appropriate governance, to manage role‑based access to services, and to apply data masking where necessary.
Ensuring security and compliance of the deployed models to comply with appropriate governance, managing role‑based access to services, and applying data masking where necessary.
Experience building or consuming data virtualization layers to abstract complex enterprise data systems for AI consumption.
Strong understanding of modern data architectures (data lakes, lake houses, vector stores) and API‑first approaches to data access and transformation.
Familiarity with vector databases (e.g., Marqo, Milvus, Pinecone) and embedding models for semantic search and retrieval‑augmented generation (RAG).
Proficiency in building and maintaining RESTful APIs that interface with internal systems, enabling AI‑enhanced features in existing applications.
Experience with cloud services (Azure, AWS),…