More jobs:
Senior Principal Software Engineer - AI Infrastructure Innovation
Job in
Redwood City, San Mateo County, California, 94061, USA
Listed on 2026-01-01
Listing for:
Oracle
Full Time
position Listed on 2026-01-01
Job specializations:
-
Software Development
AI Engineer, Software Engineer
Job Description & How to Apply Below
Join to apply for the Senior Principal Software Engineer - AI Infrastructure Innovation role at Oracle
5 days ago – be among the first 25 applicants
Oracle Cloud Infrastructure’s (OCI) architecture development engineering team is seeking a highly driven GPU platform software & system development engineer at the Principal Engineer level. We are at the forefront of AI innovation, exploring the next generation of AI accelerators and hardware solutions.
Responsibilities
As a Senior Principal software engineer, part of our growing team, you will be involved in evaluation, prototyping, and optimizing cutting‑edge AI hardware, including custom‑designed AI chips and systems and software to drive next‑gen Cloud AI Infrastructure platforms.
• Evaluation of system architecture and proposed implementation path analysis.
• Work directly with hardware design and development teams on architecture, implementation, development, deployment, and troubleshooting of AI hardware platforms. Collaboration is also expected with the wider Oracle engineering and operations functional groups as well as our external partners.
• Conduct comprehensive benchmarking and performance analysis of AI accelerators from emerging hardware vendors (e.g., Samba Nova, Groq).
• Compare and contrast new AI accelerators with industry‑standard hardware (e.g., NVIDIA GPUs) for training and inference workloads.
• Develop tools and processes for evaluating the performance of hardware in real‑world AI applications.
• Contribute to the design and improvement of performance optimization algorithms for AI models running on the hardware.
Our Senior Principal engineers are also the people who can work independently and provide technical leadership to the broader organization. You should have experience developing AI infrastructure and operating high‑scale services, and an understanding of how to make these cloud‑scale services resilient. The ideal candidate will be technically strong and productive; someone who knows how to balance speed and quality with iterative and incremental improvements.
You understand operational excellence and know how to infuse a culture of being proactive within your team. You recommend and justify major changes to new and existing products and establish consensus with data‑driven approaches.
Basic Qualifications
• BS or MS degree in Computer Science or relevant technical field involving coding or equivalent practical experience.
• 10+ years of total experience in software development.
• Demonstrated ability to write great code using Java, GoLang, C#, or similar OO languages.
• Solid knowledge of AI / GPU platform architecture and their capabilities.
• Experience working on large‑scale, highly distributed services infrastructure.
• Solid working experience with GPU supplier test code as well as open‑source AI test / characterization tools.
• Experience with the architecture, design, and implementation of modern server platforms consisting of multiple architectures and vendors, including x86 and ARM server architectures.
• Demonstrated experience debugging and root‑causing complex issues that may have a mix of hardware and software causes.
• Systematic problem‑solving approach, strong communication skills, a sense of ownership, and drive.
Preferred Qualifications
• Experience as technical lead on a large‑scale cloud service.
• Hands‑on experience developing and maintaining services on a public cloud platform (e.g., AWS, Azure, Oracle).
• Experience with AI accelerator chips (e.g., Samba Nova, Groq, etc.).
• Knowledge of AI accelerator benchmarks and tools for performance evaluation (e.g., MLPerf, Deep Bench).
• Understanding of AI model optimization techniques for hardware acceleration.
• Strong understanding and experience running firmware and system diagnostics tools using BMC firmware, UEFI/BIOS and Linux tools. Skilled in scripting to customize tests.
Compensation and Benefits
US:
Hiring Range in USD from: $96,800 – $251,600 per year. May be eligible for bonus, equity, and compensation deferral.
• Medical, dental, and vision insurance, including expert medical opinion.
• Short‑term…
Position Requirements
10+ Years
work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×