Senior AI Curation Data Scientist
Listed on 2026-06-06
-
IT/Tech
AI Engineer, Data Scientist
The xCures platform helps improve clinical care via comprehensive, intelligent access to healthcare data on an AI-assisted platform. Delivered using a Software as a Service (SaaS) model, xCures enables healthcare organizations to quickly find and use clinical insights. Our platform enables true interoperability and provides enhanced capabilities for patient care and clinical operations.
About the role:Reporting to the VP of Data Science, the AI Curation Data Scientist will, using traditional computing and custom AI model training, work on mission-critical projects driven by xCure’s product development needs and will expand xCures’ complex, innovative health data processing, extraction, and analysis capabilities. We’re looking for an individual who values team-building, cooperation, and communications with colleagues to serve the needs of our customers.
You will reach out across the organization for guidance on engineering, clinical, PHI/PII, data policy, and business topics as needed. Projects will include significant data processing challenges, such as C-CDA XML parsing and de-identification of structured and unstructured EHR content. Equally important projects will address data curation and custom AI model training. You will author software and AI models and take the lead on data set curation.
You will serve as the anchor of Data Science data set quality assurance within an innovative, fast-moving team. Team responsibilities are key requirements for this position, which will deliver large and complex data products and data analysis tools.
This position is fully remote, but will coordinate very closely with a small team and thus requires excellent communication and coordination skills. Occasional travel is required.
This job is right for you if you like:- A high-energy start-up working with a brilliant and passionate team
- Working on problems that make a real difference in people’s lives
- Understanding and delivering on reliable and well-characterized products and deliverables within a highly innovative and fast-changing environment: clinical data extraction and aggregation, the relation between data processing and QA framework; LLM tuning and training, pedantic data curation, compute architecture, data exchange.
- Rockstar teammates: you will be working with a strong team with decades of prior work experience in artificial intelligence, software systems, molecular biology, and clinical medicine
- Innovation and problem solving to provide order-of-magnitude improvements in capabilities for data handling and analysis while maintaining traceable data and methods development
- Developing and testing data extraction and integration software for structured EHR content (XML, FHIR) and unstructured text content (attached documents)
- Planning, explaining, and implementing projects to curate data sets used in model training
- Maintaining understanding of current and new generative AI and transformer technologies
- Tuning and training LLMs
- Maintaining a strong understanding of PHI/PII and de-identification policies and strategies at xCures and implementing software solutions compliant with policies and strategies
- Developing and implementing tests of data extraction and aggregation performance to improve efficiency, timeliness, and cost-effectiveness
- Flexibly taking on technical leadership or participation roles per project
- Implementing and maintaining code repositories
- Working closely with manager to explore methods, test hypotheses, and collaboratively implement innovative solutions for data science
- Coordinating as required for a fully remote role
- Working with Engineering and other groups to improve overall company efficiency and effectiveness
- Ph.D., or equivalent experience in Computer Science, Software Engineering, Statistics, Biology, or related field
- Minimum of 10 years of hands-on experience in data science, machine learning, AI, data analysis, software development, and/or predictive analytics
- Expertise in generative AI and transformer models, especially training of LLMs
- Significant prior experience with curating data sets to train LLMs
- Significant hands‑on coding experience…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).