Lead Vector Compute Architect
Job in
Sunnyvale, Santa Clara County, California, 94087, USA
Listed on 2026-06-02
Listing for:
Bolt Graphics
Full Time
position Listed on 2026-06-02
Job specializations:
-
Engineering
Systems Engineer, Electrical Engineering, Hardware Engineer, Software Engineer
Job Description & How to Apply Below
Our Values
- Be Fearless: Unmute yourself. Test boundaries and get proven right.
- Remain Adaptable: Stay comfortable in a continuously changing world. If you're wrong, concede and move on.
- Educate Your Ego: Selflessly collaborate towards our shared purpose.
We are looking for an experienced and highly motivated Lead Vector Compute Architect to lead the architecture definition and technical direction for Bolt's next-generation GPUs. The ideal candidate will have strong expertise in data parallel compute unit architecture development, performance modeling, data path integration, and cross-functional collaboration across hardware, software, and systems teams.
This role involves defining scalable and high-performance architectures for advanced compute workloads including graphics, HPC, and system management. This role is on-site and requires someone to be local to the Bay Area.
What you'll do:
- Define data parallel microarchitecture satisfying ISA constraints.
- Drive architecture tradeoff analysis for performance, power, area, bandwidth, latency, and scalability.
- Develop and review system architecture specifications, interface definitions, and microarchitecture requirements.
- Collaborate with RTL, verification, physical design, firmware, software, and system teams throughout the development cycle.
- Lead performance modeling, workload analysis, and bottleneck identification using C/C++/System
C or similar modeling environments.
- Define memory hierarchy, coherency architecture, and cache structures.
- Work closely with verification teams to define architectural test plans and validation strategies.
- Support silicon bring-up, debug, performance tuning, and post-silicon optimization.
- Contribute to long-term technology and product roadmap planning.
- Strong understanding of modern data parallel microarchitectures and subsystem integration.
- Bachelor's or Master's degree in Electrical Engineering, Computer Engineering, Computer Science, or related field.
- 6+ years of experience in modern data parallel microarchitecture including:
- Workload characterization and profiling
- Performance modeling
- Out-of-order data dependency and control
- Utilization / occupancy optimization
- High-performance architecture design techniques
- CPU/GPU/NPU architectures
- NoC/interconnect architectures
- Cache coherency protocols (CHI/ACE/CXL)
- High-speed interfaces (PCIe, UCIe, Ethernet)
- Memory systems (DDR, LPDDR, HBM, GDDR)
- Power, performance, and area optimization
- Strong knowledge of RTL development and verification methodologies.
- Experience with architecture modeling and performance analysis tools.
- Familiarity with firmware/software interaction in complex SoC systems.
- Excellent problem-solving, communication, and leadership skills.
Benefits:
- Medical, Dental, & Vision - 100% covered premiums
- Equity - Stock Options
- 401(k) match
- WFH Hardware
Professionalism, Integrity, and Respect. As an equal opportunity employer, all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, genetic information, national origin, age, disability, or status as a protected veteran.
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×