Senior Data Scientist – AI Booster Team
in
10115, Berlin, Berlin, Deutschland
Verfasst am 2026-01-08
Unternehmen:
idealo internet GmbH
Vollzeit, Teilzeit
position Verfasst am 2026-01-08
Berufliche Spezialisierung:
-
IT/Informationstechnik
Künstliche Intelligenz Ingenieur, Daten Analyst, Datenwissenschaftler, Maschinelles Lernen
Stellenbeschreibung
At idealo, Generative AI (GenAI) is becoming a multiplier across every team. The AI Booster Team is our internal technical competence center: we pair with product teams, build reusable GenAI building blocks and share best practices company-wide. We validate AI business cases through data and ship evaluation frameworks that turn pilots into production. As a Data Scientist you will translate ideas into evidence: designing experiments, measuring LLM quality, and unlocking the full value of idealo’s data assets to guide today’s and tomorrow’s GenAI initiatives.
This position is available full-time or part-time.
About your new role- Quantify opportunities & run experiments - perform causal analyses using experiments and observational methods to evaluate the business impact of GenAI features.
- Own model evaluation pipelines - create metrics dashboards and human / AI-assisted reviews that benchmark LLM quality, cost and safety.
- Guide model selection - compare foundation models, fine-tunes and RAG setups, recommending the right balance of performance vs. cost.
- Champion data strategy - surface high-value datasets (product, pricing, behaviour) and advocate their use in current and future AI products.
- Pair & coach - work embedded with engineers and analysts, sharing best practices in experimentation, metrics, and GenAI evaluation.
- Harvest patterns - document reusable evaluation playbooks so every team can measure GenAI success consistently.
- 3 + years in data science / analytics, including A/B testing or causal inference at scale.
- Expert SQL and Python (pandas, Stats Models / Sci Py, scikit-learn); comfortable with notebooks and BI tools for storytelling.
- Hands-on with LLM assessment - prompt / temperature sweeps, embedding similarity metrics, human-in-the-loop studies, and LLM-as-a-judge tools (e.g. Bedrock model evaluation, OpenAI Evals).
- Familiar with Generative AI stacks (Hugging Face, Lang Chain/Llama Index, vector DBs like Pinecone/Qdrant) and retrieval-augmented generation concepts.
- Proficiency in AWS analytics & MLOps:
Sage Maker Experiments / Pipelines, Bedrock, Athena, Lambda, Step Functions; able to automate evaluation workflows and cost dashboards. - Strong communication: can turn complex findings into clear, actionable insights and coach cross-functional teams.
- We’re keen to see evidence of exceptional achievement - perhaps you’ve scaled a personal project to thousands of users, published influential research, ranked highly in competitive arenas (e.g. sports, Kaggle, hackathons) or maintain widely-used open-source libraries. Tell us what makes you stand out!
You don’t tick every single box? No worries! We hire people, not checklists, and value motivation to grow.
#J-18808-LjbffrStellen-Anforderungen
10+ Jahre
Berufserfahrung
Bitte beachten Sie, dass derzeit keine Bewerbungen aus Ihrem Zuständigkeitsbereich für diese Stelle über diese Jobseite akzeptiert werden. Die Präferenzen der Kandidaten liegen im Ermessen des Arbeitgebers oder des Personalvermittlers und werden ausschließlich von diesen bestimmt.
Um nach Stellen zu suchen, sie anzusehen und sich zu bewerben, die Bewerbungen aus Ihrem Standort oder Land akzeptieren, klicken Sie hier, um eine Suche zu starten:
Um nach Stellen zu suchen, sie anzusehen und sich zu bewerben, die Bewerbungen aus Ihrem Standort oder Land akzeptieren, klicken Sie hier, um eine Suche zu starten:
Suchen Sie hier nach weiteren Stellen:
×