AI/ML/LLM Systems Engineer - Enterprise AI Platform Engineer - Relocate to Saudi Arabia, Perman
Listed on 2026-05-18
-
IT/Tech
AI Engineer, Machine Learning/ ML Engineer
This position requires full relocation to Saudi Arabia. It is a permanent full time Expat position with an attractive relocation package. Please note only qualified candidates will be contacted.
*** 8+ years of experience in Python/SQL, LLM and AI/ML systems is REQUIRED***
We are seeking an AI/ML/LLM Systems Engineer to join our Digital & AI Center of Excellence and contribute to the development of enterprise-scale AI platforms that support advanced machine learning and language model inference across Saudi Aramco’s operations.
The Digital & AI Center of Excellence is responsible for delivering scalable, secure, and high-performance AI/ML/LLM systems that drive innovation and operational efficiency. In this role, you will design and maintain infrastructure for deploying and optimizing large language models (LLMs) and vision models, hosted on NVIDIA Super Pods/ Cloud and containerized environments.
Your primary role is to ensure the efficient and scalable operation of AI models within enterprise platforms. You will be responsible for deploying, monitoring, and optimizing inference workloads, integrating vector and relational databases, and implementing orchestration and Dev Ops pipelines to support continuous model improvement and delivery.
Duties & Responsibilities- Deploy and manage LLMs and vision models on NVIDIA Super Pods, Cloud, ensuring high performance and efficient use of GPU resources.
- Build and maintain scalable inference pipelines using Kubernetes (K8s), Docker, and Open Shift for enterprise AI platforms.
- Optimize inference performance through multiple techniques.
- Benchmark and evaluate LLMs for performance, accuracy, latency, and resource utilization across different hardware and software configurations.
- Implement and support LLMOps frameworks with full observability, including logging, tracing, and model performance tracking.
- Integrate and manage vector databases (Elasticsearch) and relational databases (Postgre
SQL) for efficient data retrieval and user interaction history tracking. - Implement and maintain CI/CD (Continuous Integration/Continuous Delivery) pipelines for model and platform updates using Git, Bitbucket, Jenkins, and ArgoCD.
- Ensure high availability and reliability of AI application workflows using frameworks like Haystack.
- Collaborate with infrastructure teams on GPU provisioning and resource allocation for AI workloads.
- Develop and maintain monitoring, alerting, and dashboarding systems for AI/ML workloads to ensure SLA/SLO compliance.
- Hold a Bachelor’s degree in Computer Science, Software Engineering, or a related field.
- Have 8 years of experience in AI/ML systems or cloud-native infrastructure, including at least 4 years in LLM deployment and optimization.
- Proficiency in Python and SQL is required, with experience in building and optimizing AI/ML applications.
- Ability to work with Kubernetes (K8s), Docker, and Open Shift in production environments.
- Experience deploying and optimizing LLMs and vision models on NVIDIA GPU clusters and high-performance computing (HPC) and Cloud environments.
- Demonstrated proficiency in inference scaling, distributed computing, and SLA/SLO planning for AI workloads.
- Strong knowledge in Elasticsearch, Postgre
SQL, and workflow frameworks like Haystack for AI application development. - Experience implementing CI/CD pipelines using tools such as Git, Bitbucket, Jenkins, and ArgoCD.
- Experience in benchmarking and evaluating LLMs for performance, accuracy, and efficiency.
- Monitoring and dashboarding for AI/ML systems is also necessary.
Work Location:
Within Saudi Arabia – To be specified in Job offer
Work Schedule:
Full Time - To be specified in Job offer
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).