Model Accuracy Development and Test Engineer,Senior; Datacentre AI Engineering KSA Job Riyadh area,Riyadh Region Saudi Arabia,IT/Tech

Position: Model Accuracy Development and Test Engineer, Senior (Data centre AI Engineering KSA

Company

Qualcomm Middle East Information Technology Company LLC

Company

Qualcomm Middle East Information Technology Company LLC

Job Area

Engineering Group, Engineering Group >
Software Engineering

General Summary

About Us

Qualcomm is growing its presence in Riyadh and is hiring Data Centre Engineers to support our expanding infrastructure across the region. As Saudi Arabia accelerates its digital transformation under Vision 2030, Qualcomm is investing in world‑class computing and data centre capabilities to power AI, cloud, and advanced connectivity s is a unique opportunity to work in a fast‑growing technology hub, supporting critical environments and helping shape the future of data centre operations in the Kingdom and beyond.

About

The Role

We are seeking an Inference Accuracy senior engineer to design, develop, and validate model accuracy of deep learning models deployed role focuses on deep accuracy analysis, debugging, accuracy evaluation, and recoveryduring inference on large data-center hardware platforms. This position requires strong problem-solving ability, excellent Python programming skills, and hands-on expertise with inference pipelines.

Key Responsibilities Include

Define and implement accuracy KPIs across precision modes
Develop scalable Python-based accuracy evaluation tools and automated pipelines.
Implement accuracy-preserving optimizations for inference frameworks (Tensor

RT, ONNX Runtime, AI Template, Triton).
Build and maintain automated pipelines for accuracy evaluation across multiple frameworks (ONNX, Tensor Flow, PyTorch).
Develop reusable plugins for preprocessing, post-processing, and metric evaluation.
Execute comprehensive accuracy tests for large-scale models (LLMs, vision, diffusion).
Validate accuracy under various quantization and precision settings (FP32, FP16, INT8).
Perform accuracy analysis with deep understanding of model architecture, including layers, attention mechanisms, and parameter configurations.
Identify architecture-driven accuracy degradation trends and propose optimization strategies.
Identify issues related to preprocessing drift, tokenization mismatches, operator fallback, and quantization effects.
Analyze accuracy differences across hardware targets, firmware versions, and runtime backends.
Perform slice-based accuracy analysis (batch size, concurrency, sequence length, domain shifts).
Design and run experiments to recover accuracy, including fine-tuning, calibration, and hyperparameter adjustments.
Debug accuracy failures by tracing root causes across data preprocessing, model layers, quantization steps, and deployment pipelines.
Compare results across different hardware/software stacks and generate actionable insights.
Document workflows, maintain dashboards, and publish accuracy results for stakeholders.

Required Skillset

Strong background in AI/ML model evaluation and accuracy metrics.
Solid understanding of model architectures (transformers, CNNs, RNNs, MoE) and their impact on accuracy.
Experience with large language models (LLMs) and generative AI accuracy validation.
Expertise with inference runtimes (Tensor

RT, ONNX Runtime, Triton).
Understanding of quantization (INT8/FP8/INT4), calibration, QAT, and accuracy trade-offs.
Experience with model graph conversion (PyTorch → ONNX → backend engines).
Hands-on experience with accuracy pipeline development and automation frameworks.
Proficiency in Python and familiarity with ML toolkits (ONNX Runtime, Tensor Flow, PyTorch).
Expertise in accuracy analysis, including statistical methods and visualization tools
Ability to design experiments for accuracy recovery and debug accuracy failures effectively.
Knowledge of quantization techniques and mixed-precision workflows.
Strong problem-solving and analytical skills with the ability to isolate complex accuracy issues.

Preferred Skillset

Understanding of video generation model accuracy and multi-modal evaluation benchmarking
Experience with data-center accelerators (NVIDIA A100/H100/B200, AI100 Ultra, Gaudi, TPU).
Knowledge of LLM accuracy evaluation tools (lm-eval, HELM, synthetic benchmarks) is an advantage
Familiarity with distributed deployment systems (Kubernetes, cloud inference…

Model Accuracy Development and Test Engineer, Senior; Datacentre AI Engineering KSA