Module overview
Welcome to the Foundations of Data Science! 'Data scientist' has been described as the sexiest job of the 21st century, with the demand for highly skilled practitioners rising quickly to leverage the increasing amount of data available for study. As the amount of data increases, so too does the need for employees who can extract meaningful insights from this data. This course is designed to introduce you to a range of topics and concepts related to the data science process. It will cover the technical pipeline from data collection, to processing, analysis and visualisation. You will be introduced to and gain knowledge of various topics such as statistics, crawling data, data visualisation, advanced databases and cloud computing, along with a toolkit to use with data (including R, D3, Google Refine and Hadoop). The course will include a mix of lectures, tutorials, hands-on exercises and invited talks from expert data science practitioners. Coursework will allow you to gain experience using the theory and techniques delivered in the lectures, while the group project will give you the chance to apply knowledge of the data science process and toolkit in the development of a data science application.