LLM Architect
1001, Lausanne, Canton de Vaud, Switzerland
Listed on 2026-05-27
-
IT/Tech
AI Engineer (Applied/Software), Systems Engineer, Machine Learning/ ML Engineer, Data Scientist
Giotto.ai is a Switzerland-based AI company building intelligence systems for Switzerland and Europe.
Our mission is to build AI capabilities that enable Switzerland and Europe to preserve strategic independence, cultural identity, and core values while achieving world-class performance in advanced reasoning systems.
Our research has demonstrated strong performance on international benchmarks such as ARC-AGI. We are now building the next generation of production AI systems focused on reasoning, efficiency, and scalable deployment.
The RoleWe are looking for a senior engineer to help architect and build large-scale AI systems around LLMs, reasoning models, and distributed inference infrastructure.
You will work on core AI architecture problems:
- scalable inference systems
- distributed training pipelines
- reasoning-oriented model architectures
- efficient serving and orchestration
- production systems for advanced AI workloads
Depending on your profile, you may operate as:
- a highly autonomous senior individual contributor
- a technical lead for a core AI initiative
- or an architect helping shape the long-term direction of our AI stack
We value people who combine deep technical judgment with strong execution.
What You’ll Work On- Architecting production-grade LLM systems
- Designing scalable inference and training infrastructure
- Optimizing performance across GPU and distributed environments
- Building systems around reasoning and agentic workflows
- Improving efficiency, latency, reliability, and throughput
- Working closely with research teams to bring frontier ideas into production
- Contributing to the long-term technical direction of sovereign AI systems in Europe
- Python
- Py Torch
- Hugging Face ecosystem
- Large Language Models (LLMs)
- vLLM
- Distributed systems
- Ray
- Docker
- Kubernetes
- Designing or operating large-scale LLM systems
- Distributed inference or training at scale
CUDA programming or GPU optimization
Systems-level performance engineering
Experience with model serving infrastructure
Technical leadership on complex AI systems
Research or engineering work on reasoning models
Experience in high-performance engineering environments Ideal Profile- 5+ years building ML/NLP systems
- Strong expertise in Python and Py Torch
- Proven experience deploying ML systems into production
- Deep understanding of transformer architectures and modern LLM systems
- Experience with distributed compute environments
- Comfortable operating in fast-moving research + production settings
- Strong ownership mindset and ability to work from first principles
We especially appreciate candidates who have built difficult systems end-to-end and can speak concretely about trade-offs, failures, scaling challenges, and engineering decisions.
Nice to Have- PhD in Computer Science, Mathematics, Physics, or related hard sciences
- Experience at top-tier AI labs or large-scale technology companies
- Background in systems optimization or infrastructure engineering
- Competitive programming or Olympiad background (IMO, IOI, IPhO, etc.)
- Open-source contributions in ML infrastructure or LLM tooling
We offer full-time employment in Switzerland.
Hybrid setup:
- Remote work fully supported
- Team gathers one week per month in the Swiss office
Exceptional candidates elsewhere in Europe may also be considered.
Why Giotto.aiWe are building frontier AI systems with a small, highly technical team focused on reasoning, efficiency, and sovereignty.
This is an opportunity to work on foundational AI infrastructure and architecture problems with meaningful technical ownership from day one.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search: