Cloud Hardware Software Engineering Lead AI Compute
Listed on 2026-05-29
-
Software Development
Embedded Software Engineer, Software Engineer
Summary
Celestica is looking for talented and passionate Sr. Manager, Software Engineering. This team is responsible for designing and developing Cloud Hardware infrastructure that includes Compute, Storage, AI servers and Networking Switches. As a Sr. Manager, you will play a key role in defining and delivering state‑of‑the‑art Cloud Hardware products.
Celestica leads the charge in the AI‑driven era by developing ultra‑dense, power‑efficient, and high‑speed infrastructure solutions. Our approach is grounded in extensive engineering expertise, a proven track record of technology experience, a resilient supply chain and strong global partnerships. We collaborate with customers to design purpose‑built solutions that guarantee optimized performance, engineered reliability, and a truly agile, future‑ready foundation for tomorrow’s AI infrastructure demand.
Role OverviewAn active, hands‑on Manager who balances deep past expertise in low‑level embedded software with active, comprehensive people management. You will lead multiple high‑performance engineering teams developing firmware, drivers, and specialized software stacks for next‑generation AI Compute/Storage and Data Center hardware (CPU/GPU).
Management & Leadership Responsibilities- Team Leadership: Lead more than one high‑performance software team working across multiple firmware layers (low‑level drivers to specialized AI stacks) and be responsible for creating the Technical Roadmap.
- People Management: Set yearly budgets, plans, and team goals. Provide direction, regular performance evaluations, ongoing feedback, and drive KPI targets.
- Mentorship: Coach tech leads on strengths/weaknesses; mentor team members on technical problem‑solving, strategic thinking, and domain skills.
- Cross‑Functional & Customer Strategy: Act as a primary customer contact to review/interpret specifications. Collaborate closely with hardware design, software, program, and product management teams.
- Execution & Innovation: Take the lead in New Product Introductions (NPIs), drive continuous improvements, and contribute to industry technical forums.
- Gather system‑wide requirements and provide cross‑system technical solutions.
- Design, implement, and maintain drivers, application software, and software libraries that drive next‑generation CPU/GPU Data Center systems.
- Review and maintain the quality of production‑bound code.
- Education: Bachelor’s Degree or higher in Engineering.
- Overall
Experience:
15+ years of experience in the industry with a proven ability to lead teams. - Programming Core: 8+ years of experience in C / C++ and other low‑level programming languages and embedded systems.
- Hardware & Board Bring‑up: 5+ years of experience in hardware bring‑up, validation, troubleshooting, and resolution of hardware/software interfaces (Boot loader and high‑speed interfaces: PCIe, USB, DDR, MIPI, Ethernet).
- Driver Development: 5+ years of experience in Device Drivers (PCIe, I2C, etc.), including board bring‑up to verify functionality (Linux kernel bring‑up, enablement, and up streaming).
- Systems & Architecture Knowledge: Strong understanding of BIOS, BMC, Secure Boot, and Linux Kernel.
- Experience in Real‑Time OS (RTOS) / Linux kernel memory management, interrupts, and IOMMU.
- Soft Skills: Strong analytical and problem‑solving abilities; proven capability to articulate requirements through written architecture specs and verbal presentations.
- Familiarity with major hardware elements: CPU, DPU, memory, and NICs.
- Experience designing/developing low‑level software specifically to enable Data Center hardware.
- Prior hands‑on experience with GPU computing libraries like CUDA and ROCm is a major plus.
- Prior hands‑on with Linux Kernel/drivers, PCIe, BMC, BIOS/BSP.
- CPU/GPU based compute server firmware development.
- AI workload/orchestration.
- Networking data path knowledge.
- Strong SW engineering and product development experience.
Duties of this position are performed in a normal office environment. Duties may require extended periods of sitting and sustained visual concentration on a computer…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).