×
Register Here to Apply for Jobs or Post Jobs. X

AI Research Engineer; Pre-training - LLM & Multi-Modal

Job in Dubai, Dubai, UAE/Dubai
Listing for: tether
Apprenticeship/Internship position
Listed on 2026-06-14
Job specializations:
  • Software Development
    AI Engineer (Applied/Software), Machine Learning/ ML Engineer
Salary/Wage Range or Industry Benchmark: 120000 - 200000 AED Yearly AED 120000.00 200000.00 YEAR
Job Description & How to Apply Below
Position: AI Research Engineer (Pre-training - LLM & Multi-Modal)

About the job

As a member of the AI model team, you will drive innovation in architecture development for cutting‑edge models of various scales, including small, large, and multi‑modal systems. Your work will enhance intelligence, improve efficiency, and introduce new capabilities to advance the field.

You will have a deep expertise in Large Language Model (LLM) and Multi‑Modal architectures, a strong grasp of pre‑training optimization, and a hands‑on, research‑driven approach. Your mission is to explore and implement novel techniques and algorithms that lead to groundbreaking advancements: multi‑modal data curation and alignment, strengthening baselines, and identifying and resolving existing pre‑training bottlenecks to push the limits of cross‑modal AI performance.

Responsibilities
  • Large‑Scale Pre‑Training: Conduct foundational pre‑training for LLMs and Multi‑Modal models (integrating text, vision, audio, or other modalities) on large, distributed servers equipped with multi‑nodes and thousands of NVIDIA GPUs.
  • Architecture & Alignment Innovation: Design, prototype, and scale innovative architectures, tokenizers, and cross‑modal alignment layers to enhance model intelligence and multi‑modal understanding.
  • Data Strategy: Source, filter, and curate massive‑scale textual and multi‑modal datasets, establishing robust data pipelines for efficient pre‑training.
  • Experimental Research: Independently and collaboratively execute experiments, analyze results, and refine training methodologies for optimal performance and token efficiency.
  • Optimization & Debugging: Investigate, debug, and eliminate bottlenecks in model efficiency, computational performance, and multi‑modal alignment stability during long training runs.
  • System Scalability: Contribute to the advancement of distributed training systems to ensure seamless scalability and hardware efficiency on target platforms.
Qualifications
  • A degree in Computer Science or related field. Ideally PhD in NLP, Machine Learning, or a related field, complemented by a solid track record in AI R&D (with good publications in A
    * conferences).
  • Hands‑on experience contributing to large‑scale LLM or Multi‑Modal pre‑training runs on large, distributed servers equipped with thousands of NVIDIA GPUs, ensuring scalability and impactful advancements in model performance.
  • Familiarity and practical experience with large‑scale, distributed training frameworks, libraries and tools.
  • Deep knowledge of state‑of‑the‑art transformer and non‑transformer modifications aimed at enhancing intelligence, efficiency and scalability.
  • Strong expertise in PyTorch and Hugging Face libraries with practical experience in model development, continual pre‑training, and deployment.
  • Excellent English communication skills.
#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)
0
200
Filters
Education Level
Experience Level (years)
Posted in last:
Salary