Postgraduate research project

AI-based multi-modal 3D environment understanding and visualisation

Funding
Fully funded (UK only)
Type of degree
Doctor of Philosophy
Entry requirements
2:1 honours degree View full entry requirements
Faculty graduate school
Faculty of Engineering and Physical Sciences
Closing date

About the project

This project aims to develop an AI-based practical solution for 3D environments understanding from multi-modal (audio/visual) input data and reproducing it in a virtual or augmented reality space allowing real-time 3D interaction with spatial audio adapted to the environment and user locations.

Computer Vision is one of the most active areas where artificial intelligence (AI) is being used. This area is extremely expanding and getting a lot of interests and investments these days. Active perception of a surrounding environment through AI relies heavily on the design of architectures and their extensive training to generate compact representations. Taking advantages of recent advancements in AI technology, these representations have shown significant improvement in building new knowledge and acquiring new skills for AI agents and practical applications in our daily life.

Scene understanding, studies the task of representing a captured scene in a manner emulating human-like understanding of that space. Attaining this understanding is crucial for applications such as robotics, tele-communication, smart home, healthcare and assisted living.

In this project, You will join a team working on a pipeline for modelling and rendering of the full environment including 3D geometry, semantic objects and material attributes from multi-modal inputs such as video, audio and text. You will join this team and investigate topics in AI-based multi-modal 3D environmental scene understanding and visualisation.

Various chances to attend the British vision summer school or major international conferences such as the Conference on Computer Vision and Pattern Recognition (CVPR) and the International Conference on Computer Vision (ICCV).