×
Register Here to Apply for Jobs or Post Jobs. X

Language Data Scientist

Job in Greenville, Greenville County, South Carolina, 29610, USA
Listing for: Synodex
Full Time position
Listed on 2026-03-10
Job specializations:
  • IT/Tech
    Data Scientist, AI Engineer
Salary/Wage Range or Industry Benchmark: 100000 - 125000 USD Yearly USD 100000.00 125000.00 YEAR
Job Description & How to Apply Below

Job Title: Language Data Scientist

Location: Fully Remote within the U.S. (excluding California, Washington, Alaska, Colorado, Montana, New York, Puerto Rico, Nevada, Nebraska)

Employment Type: Full-Time (40 hours per week) Fixed-Term

Who We Are

Innodata (NASDAQ: INOD) is a leading data engineering company with over 2,000 customers and operations in 13 cities worldwide. We are an AI technology solutions provider-of-choice for 4 out of 5 of the world’s biggest technology companies, as well as leading firms in financial services, insurance, technology, law, and medicine.

With advanced machine learning and AI technologies, a global workforce of subject matter experts, and a high-security infrastructure, we help usher in the promise of AI. Innodata offers a powerful combination of digital data solutions and easy-to-use, high-quality platforms.

Our global workforce includes over 7,000 employees in the United States, Canada, United Kingdom, the Philippines, India, Sri Lanka, Israel, and Germany. We are poised for a period of explosive growth over the next few years.

About

The Role

Innodata is building a team of Language Data Scientists and Gen AI experts to help our customers advance GenAI applications. You will work hands‑on with multi‑modal and multi‑lingual datasets and collaborate with cross‑functional partners. You will use your experience with human and synthetic data workflows to drive innovation and continuous improvement. The ideal candidate must have the right mix of skills in (computational) linguistics and human evaluation tasks, data science, and data engineering.

Key Responsibilities
  • Design/improve workflows to create data for AI/ML training and evaluation, including human annotation, data collection, and synthetic workflows.
  • Dive deep into existing workflows and processes to gather data and insights, make recommendations, and drive improvement through innovation and cross‑functional collaboration with customers.
  • Critically assess annotation tooling and workflows.
  • Quantitatively analyze large datasets, perform statistical analysis, calculate metrics, and make recommendations to improve accuracy and performance.
  • Work closely with client stakeholders on understanding goals, gathering requirements, proposing solutions, and executing them.
Qualifications
  • Familiarity with social media platforms and cultural context with North American (Canada, USA) trending social media content.
  • Familiarity with language use in online spaces, particularly language trends and innovations, and a nuanced interpretation of social media content in its setting.
  • Knowledge of how components of GenAI products or services combine to work.
  • Collaborating with cross‑functional teams to define AI project requirements and objectives, ensuring alignment with overall business goals.
  • MA in (computational) linguistics, data science, computer science (AI/ML/NLU), quantitative social sciences, or a related field;
    PhD strongly preferred.
  • Language and language data expertise:
    Extensive experience working with human language data and designing human evaluation tasks, including multi‑phase and complex workflows.
    • Deep understanding of language and its relationship with culture.
    • Ability to identify ambiguity and subjectivity in language.
    • Ability to work with multi‑lingual and multi‑modal projects.
  • Quantitative analysis skills:
    Advanced knowledge of statistics, metrics (e.g., F1 score, inter‑rater reliability metrics), and data analysis methods such as sampling.
  • Technical skills:
    • Experience with NLP techniques and tools such as Spa Cy, NLTK, or Hugging Face.
    • Proficiency in Python for handling/transforming large datasets (pre-/post‑processing, pandas), performing quantitative analyses, and visualizing data (e.g., matplotlib, seaborn).
  • Data processing:
    • Deep understanding of data pipelines to support ML and NLP workflows.
    • Knowledge of efficient data collection, transformation, and storage.
    • Knowledge of data structures, algorithms, and data engineering principles.
  • Excellent interpersonal skills for effective cross‑functional stakeholder engagement.
  • Excellent problem‑solving skills, with the ability to think critically and creatively to develop innovative AI…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary