Senior Data Ingestion & Streaming Platform Engineer
Listed on 2026-05-30
-
Software Development
Data Engineer
Datavant is the data collaboration platform trusted for healthcare. Guided by our mission to make the world’s health data secure, accessible and actionable, we provide critical data solutions for organizations across the healthcare ecosystem - including providers, health plans, researchers, and life sciences companies. From fulfilling a single patient’s request for their medical records to powering the AI revolution in healthcare, Datavant’s team are building the future of how data is connected and used to improve health.
By joining Datavant today, you’re stepping onto a driven and highly collaborative team that is passionate about creating transformative change in healthcare.
The Ingestion & Streaming team sits on our Data & Machine Learning Platform organization and owns the movement layer of Datavant’s data platform: batch and streaming pipelines, change data capture, document intake, and the self‑service frameworks product teams use to land new sources into Snowflake, our Iceberg‑backed lakehouse, and Databricks. Most data moves into the platform; some moves back out. Our job is to make both safe, fast, observable, and boring.
WhatYou Will Do
- Design, build, and operate the ingestion frameworks that pull data from operational databases, vendor APIs, document streams, and third‑party feeds into Snowflake, Iceberg, and Databricks
- Own and evolve the ingestion stack (AWS DMS, MWAA / Airflow, Fivetran, and the homegrown tooling on top) and design new patterns for API sources that don’t fit a managed connector
- Build self‑service tooling so product engineers can onboard new sources without becoming experts in our infrastructure
- Write and review the Terraform behind our ingestion infrastructure: AWS networking, IAM, compute, and data services
- Partner with product, data, and analytics teams to pick the right ingestion pattern for each source (CDC, batch, API, streaming) and stand it up end‑to‑end
- Lead production troubleshooting and incident response, and turn each incident into a durable platform fix
- Raise the bar on engineering quality, observability, cost discipline, and security in everything the team ships
- Mentor mid‑career engineers and pull peers along through code review, pairing, and design feedback
- 6+ years in data engineering, platform engineering, or data‑focused software engineering
- 3+ years of hands‑on AWS with real strength in networking (VPC, subnets, routing, Private Link, security groups), IAM (roles, policies, permission boundaries), and the data services this role touches, plus the judgment to know when to reach for what
- 2+ years writing production Terraform or equivalent IaC, with experience owning modules, reasoning about state and blast radius, and shipping infrastructure changes safely
- 1+ years building self‑service tooling, internal platforms, or paved‑path frameworks consumed by other engineers
- Strong SQL skills and the ability to reason about how data physically lives in a warehouse or lake
- Production experience with Snowflake (or an equivalent cloud data warehouse) and a workflow orchestrator (Airflow / MWAA preferred)
- Hands‑on experience with at least one ingestion approach: CDC tooling (e.g., DMS, Debezium), managed connectors (e.g., Fivetran, Airbyte), or rolling your own pipelines for API sources
- Solid CI/CD discipline in Git Hub or equivalent: branching, code review, automated checks, repeatable deployment
- AI‑native working style: daily use of Claude Code, Cursor, Copilot, or equivalent, with views on how they make a team faster
- Working knowledge of Python is expected; mastery isn’t the bar
- Clear written and verbal communication, especially in async, remote settings
- Direct production experience with Iceberg or another open table format, especially bridging Snowflake and Databricks
- Hands‑on Databricks or Spark
- Kubernetes experience
- Snowflake certification(s)
- Azure experience (we’re primarily AWS, but our customers and acquisitions aren’t always)
- In‑depth experience integrating data systems with managed identity platforms, particularly via SCIM (SailPoint a plus)
- Prior experience in healthcare or another highly regulated industry like Finance
- Prior DBA, SRE, or DRE work operating production data systems under pressure
$150,000 – $190,000 USD
We are committed to building a diverse team of Datavanters who are all responsible for stewarding a high‑performance culture in which all Datavanters belong and thrive. We are proud to be an Equal Employment Opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, sex, sexual orientation, gender identity, religion, national origin, disability, veteran status, or other legally protected status.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).