Sustainable generative AI models

About the project

This project aims to pioneer advancements in the efficiency of Generative AI Models (GenAI), focusing on achieving lower latencies and smaller model sizes without compromising performance.

As GenAI become increasingly central to a wide range of applications, from generating images to generating videos and music, their computational demand and the time required for training and inference have escalated.

This research seeks to address these challenges by developing innovative techniques for efficiency, including architectural innovations, compression strategies, algorithmic improvements, and system-level optimizations.

The goal of this project is to enable the deployment of state-of-the-art Generative AI models across broader scenarios of computing environments, from high-end servers to consumer-level machines.

You will contribute to making GenAI more democratic, efficient, and scalable, paving the way for their application in real-time and resource-constrained scenarios.

The overall objectives of your research will be:

to develop cutting-edge techniques for model compression, such as pruning, quantization, and knowledge distillation, tailored for GenAI models
to design and experiment with new GenAI architectures that are more efficient, requiring less computational power and memory
to create new algorithms and system-wide optimizations to accelerate both training and inference processes for GenAI, making them more suitable for deployment across a variety of computing environments
to develop and utilize benchmarks and metrics specifically designed to evaluate the efficiency and performance of GenAI under various computational constraints.

Potential supervisors

Lead supervisor

Supervisors

Dr Zhiwu Huang

Lecturer

Research interests

Computer Vision
Machine Learning
Generative AI

Entry requirements

You must have a UK 2:1 honours degree, or its international equivalent.

You need to have:

strong ML background
good programming skills

Background in systems will be an additional advantage but is not necessary.

Fees and funding

We offer a range of funding opportunities for both UK and international students. Horizon Europe fee waivers automatically cover the difference between overseas and UK fees for qualifying students.

Competition-based Presidential Bursaries from the University cover the difference between overseas and UK fees for top-ranked applicants.

Competition-based studentships offered by our schools typically cover UK-level tuition fees and a stipend for living costs (minimum of £19,237 in 2024-25) for top-ranked applicants.

Funding will be awarded on a rolling basis, so apply early for the best opportunity to be considered.

How to apply

Apply now

You need to:

choose programme type (Research), 2025/26, Faculty of Engineering and Physical Sciences
select Full time or Part time
choose the relevant PhD in Computer Science
add name of the supervisor Dr Jagmohan Chauhan in section 2 of the application form

Applications should include:

personal statement
your CV (resumé)
2 academic references
degree transcripts to date

Contact us

Faculty of Engineering and Physical Sciences

If you have a general question, email our doctoral college (feps-pgr-apply@soton.ac.uk).

Project leader

For an initial conversation, please email Dr Jagmohan Chauhan: j.chauhan@soton.ac.uk

Postgraduate research project