Senior Engineer, AI Systems in San Jose
Listed on 2026-06-19
-
IT/Tech
Systems Engineer, Hardware Engineer, AI Engineer (Applied/Software)
Overview
The AGI (Artificial General Intelligence) Computing Lab is dedicated to solving the complex system-level challenges posed by the growing demands of future AI/ML workloads. Our team designs and develops scalable platforms that handle the computational and memory requirements of AI/ML workloads while minimizing energy consumption and maximizing performance. We collaborate with hardware and software engineers to identify challenges and explore computing abstractions that balance hardware and software components.
We conduct research across memory, computing, interconnect, and AI/ML to ensure our platforms handle demanding workloads and advance AGI in an affordable and sustainable manner.
This role is being offered under the AGICL lab as part of DSRA. We are a research-driven systems lab working at the intersection of large models, accelerator hardware, and high-performance software stacks. Our mission is to design, prototype, and optimize next-generation AI systems through tight hardware–software co-design. Our team works hands-on with cutting-edge accelerator hardware, experimental memory systems, and emerging domain-specific (DSLs).
We build and optimize a Triton-based software stack that pushes the limits of performance, efficiency, and scalability for modern LLM workloads.
We are looking for a Senior AI Systems Engineer with deep experience in high-performance Triton kernel development on modern accelerators. In this role, you will design, analyze, and optimize performance-critical kernels used in large-scale LLM inference and training pipelines. You will work closely with hardware architects, compiler engineers, and ML researchers to identify performance bottlenecks, interpret profiling data, and co-design solutions that span software and hardware boundaries.
This role is ideal for engineers who enjoy working close to the hardware stack while still reasoning deeply about model-level abstractions.
Daily onsite presence at our San Jose, CA office / U.S. headquarters in alignment with our Flexible Work policy.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).