Data Center Principal Product and Solutions Architect – Software KSA
Listed on 2026-05-31
-
IT/Tech
Systems Engineer, Cloud Computing
Company:
Qualcomm Middle East Information Technology Company LLC
Job Area:
Engineering Group, Engineering Group >
Software Engineering
About us
As a leading technology innovator, Qualcomm pushes the boundaries of what's possible to enable next-generation experiences and drives digital transformation to help create a smarter, connected future for all. As a Qualcomm Software Engineer, you will design, develop, create, modify, and validate embedded and cloud edge software, applications, and/or specialized utility programs that launch cutting‑edge, world‑class products that meet and exceed customer needs.
Qualcomm Software Engineers collaborate with systems, hardware, architecture, test engineers, and other teams to design system‑level software solutions and obtain information on performance requirements and interfaces.
Qualcomm is building a comprehensive Data Center Software Stack and AI Inference Suite to enable customers to deploy, operate, and scale AI inference at rack‑scale and cluster‑scale. We are seeking a Principal Product & Solutions Architect with deep expertise in systems software, inference platforms, orchestration, runtime integration, and large‑scale distributed systems.
In this role, you will operate at the intersection of platform engineering, AI inference serving, Kubernetes orchestration, rack‑scale operations, and customer enablement. You will lead architecture and design decisions resulting in production‑grade, deployable software stacks, while also shaping roadmap direction based on real customer deployments and workload requirements.
This role requires 15+ years of experience and a proven ability to influence engineering, product, and customer organizations.
What You’ll DoA) Data Center Platform Software (Provisioning → Orchestration → Operations)- Architect and guide implementation of cluster and rack‑scale platform software, including node provisioning, lifecycle management, telemetry, monitoring, and certification‑grade releases.
- Define and support Day‑2 operations readiness, including operational models, escalation paths, and serviceability considerations.
- Lead solution architecture for bare‑metal and rack management, including secure access, authentication, and identity integration aligned with enterprise requirements.
- Guide platform networking architecture in collaboration with infrastructure teams, including IP planning, segmentation, and integration with storage and external systems.
- Establish best practices for deployment automation, configuration management, and operational repeatability across customer environments.
- Define and evolve Inference‑as‑a‑Service architectures, integrating runtime libraries, model serving frameworks, and Kubernetes orchestration.
- Lead architecture for disaggregated and distributed inference to scale workloads across nodes while maintaining predictable latency and throughput.
- Guide design decisions for multi‑instance inference, model placement, accelerator utilization, and scheduling behavior in real‑world deployments.
- Drive architecture for inference traffic routing and load balancing, including gRPC/HTTP‑based serving and health‑aware request distribution.
- Lead performance engineering efforts across the software stack, including profiling, benchmarking, and optimization across runtime, serving, networking, and system layers.
- Translate customer and workload requirements into platform architecture decisions, feature definitions, and roadmap priorities.
- Lead deep technical discovery engagements with strategic customers, defining proof‑of‑concepts, success metrics, rollout strategies, and operational readiness checks.
- Act as a senior technical advisor in architecture reviews, workshops, and executive‑level discussions with customers and partners.
- Collaborate closely with product management to ensure the software stack evolves in alignment with market needs and deployment realities.
- Create and maintain technical documentation, including architecture diagrams, HLDs/LLDs, deployment guides, sizing guidance, and operational runbooks.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).