AI Research Scientist
Listed on 2026-02-16
-
IT/Tech
Data Scientist, Artificial Intelligence -
Research/Development
Data Scientist, Artificial Intelligence
Wirestock is one of the leading data platforms for ethically sourced multimodal data. We serve some of the world’s top AI labs, including several foundation models, by providing high-quality, fully licensed training datasets. As the AI data landscape undergoes a major shift, we are scaling rapidly to meet rising demand for curated visual data.
Role DescriptionAt Wirestock, we believe that the next leap in AI performance won't come from larger models, but from better data. We are looking for an AI Research Scientist who is obsessed with the Data half of Algorithms. Your role is to be the technical architect of our data lab, identifying what data the world's leading AI labs need before they even realize they need it.
WhatYou Will Own The Research Frontier
You will be our SOTA scout. You’ll spend your time dissecting the latest models and papers to identify gaps and then design the dataset that fixes it.
Dataset ArchitectureYou will design the ground truth for multi-modal systems. This means architecting how we use real world data to create datasets that don't just look good, but are mathematically superior for training.
Technical DiplomacyYou will be the peer-level contact for researchers at top-tier AI labs, helping them develop the next big models.
Validation & PrototypingYou will run data experiments using synthetic tools and our global creator network, you’ll build small-scale proof of concept datasets to validate your hypotheses before we scale them to millions of assets.
What You Will NOT Do Model TrainingYou are here to build the engine’s fuel, not the engine itself. We leave the model training to our clients so that you can focus 100% of your intellectual energy on data innovation.
Who You Are The Multimodal ExpertYou have a Master’s or Ph.D. in CS/ML or equivalent industry research experience and can talk at length about:
- Dataset bias measurement
- Synthetic vs real data mixing
- Evaluation protocols
You believe that a perfectly curated 10k‑video dataset is worth more than a billion‑scale “scrape” of the internet.
The Creative ScientistYou are as comfortable discussing the physics of light and motion as you are writing PyTorch scripts to validate CLIP‑alignment.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).