Principal Research Data Engineer
Listed on 2026-06-19
-
IT/Tech
Data Engineering, Data Analyst, Machine Learning/ ML Engineer, Data Scientist
Principal Research Data Engineer
Oversee the design, development, testing, and implementation of research data pipelines for producing data layers and storing research data. Develop and maintain scalable, data-intensive processing pipelines that apply geospatial data to machine learning and deep learning models. Architect, build, and launch new data models to provide intuitive analytics to business users. Develop infrastructure to inform on key metrics, recommend changes, and predict future results.
Develop proof‑of‑concepts for new pipelines for integration into the science data pipeline through collaboration with diverse research partners.
- Design, develop, test, and implement scalable geospatial data integration pipelines.
- Build and maintain data pipelines that process raster and vector geospatial datasets for machine learning model generation and deployment.
- Package and deploy models and data pipelines using CI/CD practices (Python, Conda, Docker, Airflow, Git).
- Utilize Google Cloud Platform services (Cloud Functions, Big Query, Data Proc) to process data at scale.
- Work with data formats such as Avro, Parquet, CSV, Geotiff, and GeoJSON.
- Write and optimize SQL queries; perform online analytical processing on RDBMS and No‑SQL databases.
- Ingest and process geospatial data using QGIS, ArcGIS, and PostGIS.
- Master’s degree in Information Science, Computer Science, Data Science, Data Analytics, or a closely related field.
- 5+ years of experience designing, developing, testing, and implementing scalable geospatial data integration pipelines.
- Proficiency with Python, Conda, Docker, Airflow, Git, and CI/CD workflows.
- Experience with Google Cloud Platform, including Cloud Functions, Big Query, and Data Proc.
- Strong knowledge of geospatial data formats and tools (Avro, Parquet, CSV, Geotiff, GeoJSON, QGIS, ArcGIS, PostGIS).
- Experience with SQL and query optimization in relational and No‑SQL databases.
- Ability to build proof‑of‑concepts and collaborate with cross‑functional research partners.
- Competitive salary range of $142,000 – $185,000 per year.
- Potential bonus or commission.
- Health care, vision, and dental coverage.
- Retirement plan.
- Paid time off (PTO) and sick leave.
St. Louis, Missouri (telecommuting permitted from anywhere in the United States).
Job ReferenceReference Code: 864075
Equal Opportunity EmployerBayer is an Equal Opportunity Employer. Bayer is committed to providing access and reasonable accommodations in its application process for individuals with disabilities. Bayer is an E‑Verify Employer.
Application ContactApply via
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).