Data Engineer- Python, AI/ML
Job in
Troy, Oakland County, Michigan, 48083, USA
Listed on 2026-05-24
Listing for:
Motion Recruitment
Full Time
position Listed on 2026-05-24
Job specializations:
-
IT/Tech
Data Engineer, Data Analyst, Data Scientist, Data Science Manager
Job Description & How to Apply Below
- Build and maintain Python and SQL pipelines for governance-related ingestion, cleaning, transformation, and validation of structured and semi-structured data.
- Implement and operate data quality checks, schema validation, and integrity rules across pipelines; investigate and resolve quality issues.
- Contribute to master data workflows: standardization, deduplication, and consolidation of data from heterogeneous sources into consistent reference and golden-record datasets.
- Instrument pipelines for data lineage, metadata, and catalog tooling.
- Develop pipelines that feed governance dashboards and reporting in Tableau, Power BI, or Looker.
- Build reproducible, well-documented pipelines for compliance and audit reporting.
- Contribute to AI / ML-assisted governance use cases: embedding-based data classification, anomaly detection on quality metrics, LLM-assisted catalog search, and MCP-based exposure of governed datasets to AI assistants.
- Partner with team leads, data stewards, and stakeholders to translate governance requirements into engineering work.
- Follow team engineering practices:
Git, code review, modular pipeline design, automated testing, CI/CD.
- Bachelor's or Master's degree in Computer Science, Data Science, Engineering, Statistics, or a related field.
- 2+ years building data pipelines in Python (Pandas, Num Py, Sci Py) and SQL.
- Working experience with Apache Spark or PySpark and workflow orchestration (Apache Airflow).
- Schema design across relational (Postgre
SQL, MySQL, SQL Server) and analytical databases, including standardization across heterogeneous sources. - Experience implementing data quality validation, EDA, and integrity enforcement on production datasets.
- Hands-on experience with at least one major cloud platform (AWS, Azure, or GCP).
- Working familiarity with Python ML libraries (Scikit-Learn) for feature engineering and exploratory analysis.
- Experience producing analytics-ready datasets for BI tools (Tableau, Power BI, or Looker).
- Git, code review, and CI/CD practices.
- Clear technical communication and collaborative working style.
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×