Model Accuracy Development and Test Engineer, Senior; Datacentre AI Engineering KSA
Listed on 2026-05-30
-
IT/Tech
Data Engineer, Systems Engineer, Data Scientist, AI Engineer
Company
Qualcomm Middle East Information Technology Company LLC
Company
Qualcomm Middle East Information Technology Company LLC
Job Area
Engineering Group, Engineering Group >
Software Engineering
General Summary
About Us
Qualcomm is growing its presence in Riyadh and is hiring Data Centre Engineers to support our expanding infrastructure across the region. As Saudi Arabia accelerates its digital transformation under Vision 2030, Qualcomm is investing in world‑class computing and data centre capabilities to power AI, cloud, and advanced connectivity s is a unique opportunity to work in a fast‑growing technology hub, supporting critical environments and helping shape the future of data centre operations in the Kingdom and beyond.
About
The Role
We are seeking an Inference Accuracy senior engineer to design, develop, and validate model accuracy of deep learning models deployed role focuses on deep accuracy analysis, debugging, accuracy evaluation, and recoveryduring inference on large data-center hardware platforms. This position requires strong problem-solving ability, excellent Python programming skills, and hands-on expertise with inference pipelines.
Key Responsibilities Include
- Define and implement accuracy KPIs across precision modes
- Develop scalable Python-based accuracy evaluation tools and automated pipelines.
- Implement accuracy-preserving optimizations for inference frameworks (Tensor
RT, ONNX Runtime, AI Template, Triton). - Build and maintain automated pipelines for accuracy evaluation across multiple frameworks (ONNX, Tensor Flow, PyTorch).
- Develop reusable plugins for preprocessing, post-processing, and metric evaluation.
- Execute comprehensive accuracy tests for large-scale models (LLMs, vision, diffusion).
- Validate accuracy under various quantization and precision settings (FP32, FP16, INT8).
- Perform accuracy analysis with deep understanding of model architecture, including layers, attention mechanisms, and parameter configurations.
- Identify architecture-driven accuracy degradation trends and propose optimization strategies.
- Identify issues related to preprocessing drift, tokenization mismatches, operator fallback, and quantization effects.
- Analyze accuracy differences across hardware targets, firmware versions, and runtime backends.
- Perform slice-based accuracy analysis (batch size, concurrency, sequence length, domain shifts).
- Design and run experiments to recover accuracy, including fine-tuning, calibration, and hyperparameter adjustments.
- Debug accuracy failures by tracing root causes across data preprocessing, model layers, quantization steps, and deployment pipelines.
- Compare results across different hardware/software stacks and generate actionable insights.
- Document workflows, maintain dashboards, and publish accuracy results for stakeholders.
- Strong background in AI/ML model evaluation and accuracy metrics.
- Solid understanding of model architectures (transformers, CNNs, RNNs, MoE) and their impact on accuracy.
- Experience with large language models (LLMs) and generative AI accuracy validation.
- Expertise with inference runtimes (Tensor
RT, ONNX Runtime, Triton). - Understanding of quantization (INT8/FP8/INT4), calibration, QAT, and accuracy trade-offs.
- Experience with model graph conversion (PyTorch → ONNX → backend engines).
- Hands-on experience with accuracy pipeline development and automation frameworks.
- Proficiency in Python and familiarity with ML toolkits (ONNX Runtime, Tensor Flow, PyTorch).
- Expertise in accuracy analysis, including statistical methods and visualization tools
- Ability to design experiments for accuracy recovery and debug accuracy failures effectively.
- Knowledge of quantization techniques and mixed-precision workflows.
- Strong problem-solving and analytical skills with the ability to isolate complex accuracy issues.
- Understanding of video generation model accuracy and multi-modal evaluation benchmarking
- Experience with data-center accelerators (NVIDIA A100/H100/B200, AI100 Ultra, Gaudi, TPU).
- Knowledge of LLM accuracy evaluation tools (lm-eval, HELM, synthetic benchmarks) is an advantage
- Familiarity with distributed deployment systems (Kubernetes, cloud inference…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).