×
Register Here to Apply for Jobs or Post Jobs. X

AI Data Engineer - Scientific Data Platforms; Remote

Remote / Online - Candidates ideally in
South San Francisco, San Mateo County, California, 94080, USA
Listing for: Astrix Technology
Full Time, Contract, Remote/Work from Home position
Listed on 2026-06-18
Job specializations:
  • Software Development
    AI Engineer (Applied/Software), Data Engineering
Salary/Wage Range or Industry Benchmark: 37 - 45 USD Hourly USD 37.00 45.00 HOUR
Job Description & How to Apply Below
Position: AI Data Engineer - Scientific Data Platforms (Remote)
** AI Data Engineer - Scientific Data Platforms (Remote)*
* Science & Research

South San Francisco, CA, US

+ Added - 15/06/2026

Pay Rate Low: 37 | Pay Rate High: 45

_Our client is a leading global biotechnology and pharmaceutical organization driven by a mission to innovate, continuously advance science, and ensure everyone has access to the healthcare they need._

*
* Title:

** AI Data Engineer - Scientific Data Platforms

*
* Location:

** Remote, Must work PST

** Pay rate:** $37-45/hr (Depends on experience level)

*
* Schedule:

** Full-time (40 hours/week)

** Duration:
** 1-year contract, (Plus benefits)

** Position Overview*
* This role addresses a critical need in scaling our AI models for drug discovery by building largely automated, scalable, agent-driven data ingestion and curation pipelines for genomics data. This includes metadata inference, constructing performant query architectures, and transforming high-dimensional datasets (e.g., single-cell omics, clinical trials) into AI-ready training formats.

** Key Responsibilities*
* + Build an agentic data ingestion pipeline and move beyond bespoke steps toward agents that teams can reliably use as a shared, deployed service.

+ Triage and prioritize incoming requests to ingest specific datasets. Clean and organize data, building the first-pass cleaning and organization steps into the agentic flow.

+ Validate cross-modal linkage. Add automated checks that catch when ingested data does not connect correctly and flag low-quality or mismatched records.

+ Version every dataset, retaining and making prior versions addressable. Preserve raw data and provenance, ensuring agent workflows log validation and transformation steps so lineage is fully traceable.

+ Partner with AI, software engineering, and computational biology groups to co-define data standards and conventions.

** Qualifications & Requirements*
* + Demonstrated experience building multi-agent workflows or LLM workflows using tools/frameworks such as Lang Graph or Llama Index, including tool/function calling and asynchronous task execution.

+ Strong Python skills for data manipulation, working with APIs and databases, and handling heterogeneous data formats.

+ Familiarity with dataset versioning approaches (e.g., DVC, lakeFS, or equivalent).

+ Comfortable with or showing a strong willingness to learn common omics data formats like Ann Data, H5AD, and TileDB.

+ No deep bioinformatics expertise required; just a basic conceptual understanding of different modalities (e.g., RNA-seq vs. scRNA-seq vs. WES; genomics vs. transcriptomics vs. proteomics vs. metabolomics).

+ Comfortable writing unit and functional tests to ensure data processing workflows are reliable and reproducible.

+ Degree in a technical field or equivalent practical experience.

+ Must be Authorized to work in the United States without Sponsorship.

** Nice to Have*
* + Experience deploying agent workflows as a shared service (e.g., FastAPI or MCP endpoints).

+ Exposure to cloud platforms (AWS, GCP) and containerization (Docker).

+ Familiarity with scientific workflow managers such as Nextflow or Snakemake.

INDBH

LI-MG1

We are an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, disability status, protected veteran status, or any other characteristic protected by law.
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary