×
Register Here to Apply for Jobs or Post Jobs. X

Serverless LLM Architect

Job in Edinburgh, City of Edinburgh Area, EH91, Scotland, UK
Listing for: Huawei Technologies Research & Development (UK) Ltd
Full Time position
Listed on 2025-10-08
Job specializations:
  • IT/Tech
    AI Engineer, Data Scientist
Job Description & How to Apply Below
Location: City of Edinburgh

Join to apply for the Serverless LLM Architect role at Huawei Technologies Research & Development (UK) Ltd

About Huawei Research And Development UK Limited

Huawei is a leading global provider of information and communications technology (ICT) infrastructure and smart devices. We have 207,000 employees and operate in over 170 countries and regions, serving more than three billion people around the world.

Our vision and mission is to bring digital to every person, home and organization for a fully connected, intelligent world. To this end, we will drive ubiquitous connectivity and promote equal access to networks; bring cloud and artificial intelligence to all four corners of the earth to provide superior computing power where you need it, when you need it; build digital platforms to help all industries and organizations become more agile, efficient, and dynamic;

redefine user experience with AI, making it more personalized for people in all aspects of their life, whether they’re at home, in the office, or on the go.

This spirit of innovation has led Huawei to work in close partnership with leading academic institutions in the UK to develop and refine the latest technologies. With a shared commitment to innovation and progress, both parties have worked together to achieve common goals and establish a strong partnership.

Huawei’s vision is a fully connected, intelligent world. To achieve this, we work to inspire passion for basic research around the world. Our combined passion drives development across the global innovation value chain. Huawei has the largest Research and Development organization in the world with 96,000+ employees in research centers around the globe.

Huawei Research And Development UK Limited Overview

We continue to explore and define new research directions and new services. We have expanded our collaborations with academic researchers; researched new network architectures, integration of communications and key enabling technologies; and developed the fundamental theories of these technologies.

Job Summary

As a pioneer in global technological innovation, Huawei is committed to advancing the development of information technologies and has made remarkable achievements in server and device services, showcasing its strong technological innovation and market reach.

Joining the Huawei Serverless LLM team, you will be in cutting-edge fields such as AI infrastructure, data systems, artificial intelligence, and cloud computing. You will work side by side with global expert teams to meet hundreds of millions of service requirements.

Key Responsibilities:

  • Use serverless methods to ensure excellent performance of the LLM service in high-concurrency scenarios, optimize the response speed and resource consumption of the LLM service, and achieve high throughput and low latency in inference.
  • Explore the next-generation distributed inference engine to ensure high reliability, scalability, and O&M convenience of the system and support large-scale LLM commercial use in the future.
  • Track the latest LLM optimization technology to ensure model performance while effectively reducing computing costs, improving loading efficiency, and achieving ultimate system throughput.
  • Identify and define future-oriented technical challenges in the serverless LLM field, and enhance technical communication and cooperation with European academia.
  • Work closely with cross-functional teams to participate in the innovation of AI infrastructure, data systems, and cloud computing technologies, and promote the commercial application and implementation of Huawei's serverless LLM architecture.

Person Specification:

Required:

  • Understand the principles and architecture design of LLMs. Have strong experience in LLM optimization and servitization, including technologies for reducing resource consumption and response delay.
  • Have a basic command of the distributed system framework and serverless architecture. Have a good command of the core concepts of distributed computing.
  • Have experience in designing and optimizing large-scale distributed cluster systems. Have a basic command of common serverless technologies such as on-demand invoking,…
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary