About the project
In this project, several PhD students are working on a pipeline for modelling and rendering of the full environment including 3D geometry, semantic objects and material attributes from multi-modal inputs such as video, audio and text.
Computer Vision is one of the most active areas where artificial intelligence (AI) is being used. This area is extremely expanding and getting a lot of interests and investments these days.
Active perception of a surrounding environment through AI relies heavily on the design of architectures and their extensive training to generate compact representations. Taking advantages of recent advancements in deep learning, these representations have shown significant improvement in building new knowledge and acquiring new skills for AI agents and practical applications in our daily life.
Scene understanding studies the task of representing a captured scene in a manner emulating human-like understanding of that space. Attaining this understanding is crucial for applications such as robotics, tele-communication, smart home, healthcare and assisted living.
You will join this team and investigate topics in AI-based multi-modal 3D environment model reconstruction.
Programme Start dates:
- September 2025 for UK student scholarship.
- April, June, or September 2025 for self-funded students.