×
Register Here to Apply for Jobs or Post Jobs. X
More jobs:

LLM Evaluation Specialist – Cultural and Linguistic Alignment - Arabic native speaker

Job in Khobar, Eastern Province, Saudi Arabia
Listing for: Innodata
Full Time position
Listed on 2025-12-17
Job specializations:
  • IT/Tech
    Data Analyst
Salary/Wage Range or Industry Benchmark: 120000 - 150000 SAR Yearly SAR 120000.00 150000.00 YEAR
Job Description & How to Apply Below
Position: LLM Evaluation Specialist – Cultural and Linguistic Alignment - Arabic native speaker -

Overview

We are looking for linguistically and culturally aware professionals to support the evaluation and enhancement of multilingual prompt-response datasets for large language models (LLMs). This role involves rubric design, evaluation of translations and model outputs, prompt creation, and red teaming focused on identifying and surfacing cultural nuances and biases in LLM behaviour.

Responsibilities

Key Responsibilities :

Rubric Definition & Prompt Evaluation
  • Update rubric definitions with Arabic specific examples to ensure cultural and linguistic relevance.
  • Identify the need for additional rubrics tailored to specific languages or regional contexts.
  • Review prompts translated from English into Arabic and revise where translations appear unnatural or inaccurate.
  • Writing of thoughtful prompts which can test the cultural awareness of LLM models.
  • Rate prompt-response pairs using a standardized evaluation template based on rubrics and provide detailed justifications to base the findings.
  • Document problematic outputs and annotate them with clear explanations of rubric violations or cultural in sensitivities.
Required Qualifications
  • Native proficiency in the Arabic and deep familiarity with cultural norms in the corresponding region.
  • Experience in LLM evaluation, content moderation, or linguistic QA preferred.
  • Strong attention to detail with the ability to identify subtle issues in language use, tone, and cultural references.
  • Comfortable working in spreadsheets and evaluation templates.
  • Master’s degree in relevant stream.
Preferred Qualifications
  • Prior experience with prompt engineering or LLM testing.
  • Familiarity with tools such as Gemini, ChatGPT or similar LLM platforms.
  • Ability to clearly articulate reasoning behind rubric ratings or prompt edits.
#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary