Data Integration Engineer Job Mission area,Kansas USA,IT/Tech

OUR MISSION

We exist to create a more connected, compassionate, and confident experience for people with cancer and those who care for them. We make it easier to get answers, access high-quality care quickly, and feel supported throughout treatment and beyond.

Today, Thyme Care is a market-leading value-based oncology care enabler, partnering with national and regional health plans, providers, and employers to deliver better outcomes and lower costs for thousands of people across the country. Our model combines high-touch human support with powerful technology and AI to bring together everyone involved in a person’s cancer journey: caregivers, oncologists, health plans, and employers.

As a tech-native organization, we believe technology should strengthen the human connection at the center of care. Through data science, automation, and AI, we simplify complexity, improve collaboration, and help care teams focus on what matters most: supporting people through cancer.

Looking ahead, our vision is bold: to become a household name in cancer care, where every person diagnosed asks for Thyme Care by name. If you’re inspired to make cancer care more human and to help reimagine what’s possible, we’d love to meet you. Together, we can build a future where every person with cancer feels truly cared for, in every moment that matters.

WHAT

YOU’LL DO

As a Data Integration Engineer on our Data Ingestion & Care Enablement (DICE) team, you’ll be part of the horizontal layer that keeps partner and vendor data flowing reliably into Thyme Care. In this position, you will collaborate closely with our Product Manager and other Data teammates focused on data ingestion and analytics engineering. You’ll work across many deals and data sources, supporting Data Scientist deal owners by making ingestion consistent, debugging failures, and raising the reliability bar through better tests, monitoring, and data contracts.

In the course of your work you should also expect to:

Gain a deep understanding of our data platform and contribute to improving our data models and pipelines using SQL, dbt, and python (generally data-focused packages, e.g., pandas, polars)
Support ingestion of a wide range of healthcare-related sources (claims, eligibility, prior auth, ADT, etc.) by
- Configuring net-new ingestions (parsing file specs, validating assumptions, communicating inconsistencies)
- Debugging issues in ongoing ones
- Helping standardize our processes and pipelines
Collaborate with data scientist deal owners and internal stakeholders to turn messy, ambiguous requirements into concrete mapping/validation logic and durable data contracts
Use Dagster and Git Hub Actions to orchestrate and automate the early stages of our data pipelines, improving run reliability and reducing manual intervention
Work hands-on with raw data using Jupyter Notebooks in Databricks to investigate data issues, validate assumptions, and unblock processing
Design and support incremental data loads (append/merge/upsert patterns) and safe reprocessing (idempotent runs, late-arriving data, backfills)
Learn to use Datadog and Pager Duty to monitor pipelines, triage incidents during business hours, communicate impact clearly, and drive root-cause fixes to prevent recurrences
Contribute to a complex, self-hosted dbt monorepo: implement transformations, incremental models, tests, documentation, and conventions that scale across deals

WHAT YOU HAVE

Strong SQL skills
Familiarity with dbt (and an interest in ramping up your expertise), including working in larger/complex projects
Working knowledge of Python for data investigation in notebooks
Experience operating data pipelines: debugging failures, tracing issues across systems, and communicating clearly about root cause and mitigation
Experience with testing and data quality: writing and maintaining tests and using failures/alerts to drive durable fixes
Responsiveness and the ability to stay calm and organized when triaging failing ingestion runs or pipelines
Willingness to learn new domains and tools quickly (new partner file formats, evolving standards, Databricks), and apply feedback without ego
The ability to engage technical and…