Senior Cluster Engineer – AI Inference Infrastructure
Listed on 2025-12-02
-
IT/Tech
Systems Engineer
Company:Qualcomm Technologies, Inc.
Job Area:Engineering Group, Engineering Group >
Software Engineering
General
Summary:
As a leading technology innovator, Qualcomm pushes the boundaries of what's possible to enable next-generation experiences and drives digital transformation to help create a smarter, connected future for all.
We are seeking a Senior Cluster Engineer to design, deploy, and manage our AI inference cluster ecosystem. This role will deliver and deploy clusters providing high availability, scalability, and performance for our customers.
Key Responsibilities:- Design and manage large-scale AI inference clusters.
- Oversee server provisioning, networking, and OS lifecycle management in datacenters.
- Implement automation frameworks for cluster deployment and maintenance.
- Integrate Out-of-Band management using Red Fish APIs.
- Manage and optimize Kubernetes and Slurm clusters for AI workloads.
- Ensure high-performance networking with RoCE/RDMA.
- Build telemetry and observability systems using Prometheus and Open Telemetry.
- 2+ years of experience in infrastructure engineering or HPC environments.
- Deep expertise in Linux system administration and cluster orchestration (
Kubernetes and Slurm
). - Strong knowledge of datacenter networking fundamentals and RoCE/RDMA.
- Proficiency in Python and Shell scripting for automation.
- Hands-on experience with Ansible or similar automation tools.
- Strong software engineering background (design patterns, CI/CD, testing).
- Exposure to cloud platforms (AWS, Azure, GCP) and hybrid deployments.
- Familiarity with AI inference frameworks and GPU-based workloads.
- Bachelor's degree in Engineering, Information Systems, Computer Science, or related field and 2+ years of Software Engineering or related work experience.
- Master's degree in Engineering, Information Systems, Computer Science, or related field and 1+ year of Software Engineering or related work experience.
- PhD in Engineering, Information Systems, Computer Science, or related field.
- 2+ years of academic or work experience with programming languages such as C, C++, Java, Python, etc.
Qualcomm is an equal opportunity employer. If you are an individual with a disability and need an accommodation during the application/hiring process, Qualcomm is committed to providing an accessible process. You may provide disability accommodations to assist in participating in the hiring process;
Qualcomm will provide reasonable accommodations as required by law.
EEO
Employer:
Qualcomm is an equal opportunity employer; all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or any other protected classification.
Qualcomm expects its employees to abide by all applicable policies and procedures, including security requirements regarding protection of confidential information and other proprietary information, to the extent those requirements are permissible under applicable law.
Pay range and Other Compensation & Benefits:$ - $
The above pay scale reflects the broad, minimum to maximum, pay scale for this job code for the location for which it has been posted. Salary is only one component of total compensation also offer a competitive annual discretionary bonus program and opportunity for annual RSU grants. Our benefits package supports your success at work, at home, and r recruiter can discuss details and you can review more about our US benefits at the provided link.
If you would like more information about this role, please contact Qualcomm Careers.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).