LLM Serving Engineer; Cloud AI Engineering KSA
Listed on 2026-05-18
-
Software Development
Software Engineer, AI Engineer
Company:
Qualcomm Middle East Information Technology Company LLC
Job Area:
Engineering Group >
Systems Engineering
Qualcomm is growing its presence in Riyadh and is hiring Data Centre Engineers to support our expanding infrastructure across the region. As Saudi Arabia accelerates its digital transformation under Vision 2030, Qualcomm is investing in world‑class computing and data centre capabilities to power AI, cloud, and advanced connectivity s is a unique opportunity to work in a fast‑growing technology hub, supporting critical environments and helping shape the future of data centre operations in the Kingdom and beyond.
ThisRole Involves The Following Activities
- Building a scalable LLM inference platform using inference techniques (e.g. disaggregated serving and KV‑Cache management, advanced parallelism, speculative algorithms, model optimization, specialized kernels).
- Contribute to the development of LLM serving packages (e.g. vLLM, SGLang, TGI, Triton‑Inference server, Dynamo, LLM‑d).
- Work closely with customers to drive solutions by collaborating with internal compiler, firmware and platform teams.
- Work at the forefront of GenAI by understanding advanced algorithms (e.g. attention mechanisms, MoEs) and numerics to identify new optimization opportunities.
- Drive efficient serving through smart autoscaling, load balancing and routing.
- Engage with open‑source serving communities to evolve the framework.
- Hands‑on experience in one or more of the following LLM serving/orchestration packages (Triton‑Inference Server, vLLM, SGLang, Ollama, llm‑d, KServe, LMCache, Moon Cake).
- Deep understanding of foundational LLMs, VLMs, SLMs, transformer‑based architectures.
- Strong experience in developing language models using PyTorch.
- Strong computer science fundamentals – algorithms, data structures, parallel and distributed programming.
- Understanding of computer architecture, ML accelerators, in‑memory processing and distributed systems.
- Strong Python development skills for large‑scale projects with passion for software engineering.
- Experience in analyzing, profiling, and optimizing deep learning workloads.
- Proactive learning about the latest inference optimization techniques.
- Excellent communication and problem‑solving skills, with the ability to thrive in a fast‑paced and collaborative environment.
- MS in Computer Science, Machine Learning, Computer Engineering or Electrical Engineering.
- Open‑source contribution to any GenAI package.
- Experience architecting and developing large‑scale distributed systems.
- High‑level kernel design experience (PyTorch, CUDA, Triton).
- Knowledge of torch.compile or torch
Dynamo. - PhD in Computer Science, Computer Engineering or Machine Learning.
- Bachelor's degree in Computer Science, Electrical or Computer Engineering, Information Systems, or related field and 5+ years of hardware, software, or systems engineering experience.
- Master's degree in Computer Science, Electrical or Computer Engineering, Information Systems, or related field and 4+ years of hardware, software, or systems engineering experience.
- PhD in Computer Science, Engineering, Information Systems, or related field and 2+ years of hardware, software, or systems engineering experience.
- Salary including housing & transport allowance
- Stock (RSU's) and performance‑related bonus
- 16 weeks fully paid maternity leave
- 6 weeks fully paid paternity leave
- Employee stock purchase scheme
- Child education allowance
- Relocation and immigration support (if needed)
- Life and medical insurance
- Live+ Well reimbursement for health and recreational membership fees
Qualcomm is an equal opportunity employer. If you are an individual with a disability and need an accommodation during the application/hiring process, rest assured that Qualcomm is committed to providing an accessible process. Qualcomm is also committed to making our workplace accessible for individuals with disabilities. The contact email provided is for accommodation requests only.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).