×
Register Here to Apply for Jobs or Post Jobs. X

Senior Research Data Engineer

Job in Poland, Androscoggin County, Maine, 04274, USA
Listing for: The Rundown AI, Inc.
Full Time position
Listed on 2026-02-16
Job specializations:
  • IT/Tech
    Data Engineer, Data Scientist
Salary/Wage Range or Industry Benchmark: 60000 - 80000 USD Yearly USD 60000.00 80000.00 YEAR
Job Description & How to Apply Below
Location: Poland

Overview

Meet DeepL

DeepL is a global communications platform powered by Language AI. Since 2017, we’ve been on a mission to break down language barriers. Our human-sounding translations and intelligent writing suggestions are designed with enterprise security in mind. Today, they enable over 100,000 businesses to transform communications, reach new markets, and improve productivity. And, empower millions of individuals worldwide to make sense of the world and express their ideas.

Our goal is to become the global leader in Language AI, building products that drive better communication, foster connections, and make a real-life impact. To achieve this, we need talented individuals like you to join our exciting journey. If you're ready to work with a dynamic team and build your career in the fast-moving AI space, DeepL is your next destination.

What

Sets Us Apart

What sets us apart is our blend of modern technology, competitive benefits, and an open, welcoming work culture that enables our people to thrive. When we share what it's like to work at DeepL, the reactions are overwhelmingly positive. This may be because of our products that have helped countless people worldwide or our shared mission to improve communication for individuals and businesses, bringing cultures closer together.

What we know for sure is this: being part of DeepL means joining a team dedicated to innovation and employee well-being. Discover what our teams have to say about life at DeepL on Linked In, Instagram and our Blog.

Meet the team behind this journey

DeepL is renowned for its AI products - from language and translation, to enterprise agents. At the core of these products are custom-built algorithms and models that are trained using data. The quality and volume of data are key factors in our success.

You will join our Foundation Model Training team. As a cross-functional team of research scientists and data engineers specialising in machine learning, we develop foundation models and manage the pre-training corpora and associated data preparation pipelines. We work with unstructured data on a petabyte scale. This is a fast-paced and highly competitive field where we face challenging problems at the frontier of research and engineering.

Your

Responsibilities
  • Work on ambitious frontier research projects as part of a foundation model training research team consisting of research scientists and research data engineers.
  • Architect, design and build scalable data pipelines from the ground up, e.g. for downloading and preparing multimodal unstructured data for training.
  • Build on top of a modern tech stack incl. Kubernetes, Dask, Ray, etc., and make extensive use of actively developing open-source solutions, where needed debugging low level issues and potentially submitting fixes to upstream.
  • Deploy complex Python data solutions to cloud infrastructure, incl. AWS and company data centers (on prem) where you will own operation of data processing at massive scale.
  • Go beyond “Big Data” and ETL. You will engineer and operate large scale data solutions for real-world unstructured data incl. text, code, image and audio modalities.
  • Collaborate with stakeholders, research scientists, other research data engineers and data tooling and platform teams.
  • Raise the standard for excellence and act as owner and champion for the quality and availability of our foundation model training data.
  • Ensure mission-critical reliability of data pipeline jobs, maintain high quality code with documentation and provide a great data product user experience.
Qualities we look for
  • Degree in a scientific or technical field.
  • Previous work experience as Data Engineer or a similar data- and engineering-centric role in a scaled-up tech company with a focus on large-scale unstructured data.
  • Extensive experience with large scale data engineering writing high quality Python code and leveraging the full Python data ecosystem in cloud deployments.
  • Exploratory data analysis, data cleaning, data validation, ideally ML feature engineering for text and other unstructured data.
  • Developing, testing and deploying data pipelines and infrastructure
  • End-to-end ownership of data solution…
Position Requirements
10+ Years work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary