AI System Architect Job Oxford area,England UK,IT/Tech

The Role

We are looking for a System Architect who thinks in AI first and hardware second. You will own the architectural vision that bridges our AI workload requirements with our silicon and software execution. This is not a role for someone who designs chips and then asks what AI runs on them — it's a role for someone who deeply understands AI models, inference and training pipelines, and then works backwards to define the hardware and software systems that serve them best.

You will sit at the intersection of leadership, hardware engineering, and software engineering — translating high-level product strategy into concrete architectural decisions and ensuring all three teams are aligned, unblocked, and pulling in the same direction.

What You'll Do

Architecture & Technical Leadership

Define and own the end-to-end system architecture, from AI workload characterisation through to chip microarchitecture trade-offs and software stack interfaces
Drive architectural decisions by starting with AI model and operator analysis — identifying compute, memory bandwidth, data movement, and sparsity patterns that constrain and shape the hardware design
Develop and maintain architectural specifications, performance models, and design documents that serve as the single source of truth across teams
Evaluate architectural trade-offs (latency vs. throughput, on-chip vs. off-chip memory, dataflow strategies, precision formats) with a quantitative, workload-grounded methodology
Stay current with the frontier of AI research (model architectures, training techniques, inference optimisations) and translate emerging trends into architectural foresight

Cross-functional Coordination

Act as the primary technical bridge between the leadership team, hardware engineering, and software engineering — ensuring that decisions made in one domain are properly communicated, challenged, and integrated in others
Partner with the leadership team to translate business goals and product roadmap into architectural requirements and phased execution plans
Work closely with the hardware team to ensure microarchitectural decisions are grounded in realistic AI workload demands; push back constructively when hardware-centric thinking diverges from AI requirements
Collaborate with the software team to define clean hardware/software interfaces, programming models, and runtime abstractions that make the hardware genuinely usable for AI practitioners
Facilitate architectural reviews and design discussions that create shared understanding rather than siloed decision-making

Execution & Delivery

Identify and resolve cross-team dependencies and ambiguities early, before they become schedule risks
Define and track key architectural metrics and milestones throughout the chip development lifecycle (pre-RTL through tape-out and post-silicon validation)
Support benchmarking and performance analysis efforts, helping teams understand where the system delivers against AI workload targets and where it falls short

What We're Looking For

Must-Have

Deep, hands‑on understanding of modern AI/ML workloads — transformer architectures, convolutional networks, recommendation systems, or similar — including their compute and memory access patterns
Proven experience defining system or chip architecture in the context of AI/ML acceleration (inference, training, or both)
Ability to build and use analytical performance models (roofline models, memory bandwidth analysis, cycle‑accurate estimates) to guide architectural decisions
Strong communication skills with the ability to adapt technical depth for leadership, hardware engineers, and software engineers alike
Experience working across hardware and software boundaries — comfortable discussing ISA design, compiler interfaces, and runtime scheduling as well as datapath microarchitecture
Suitable University education and/or practical experience

Strong Preference For

Experience at an AI chip startup, AI hardware team at a major tech company (e.g. Google TPU, Meta MTIA, AWS Trainium/Inferentia, Tesla Dojo), or a leading fabless semiconductor company
Familiarity with AI compiler stacks (MLIR, XLA, TVM, Triton) and how they interact with hardware…