Data Science Research Methods (L7) (970G1)

15 credits, Level 7 (Masters)

Autumn teaching

This module will provide you with the practical tools and techniques required to build, analyse and interpret 'big data' datasets. It will cover all aspects of the data science process including:

  • collection
  • munging or wrangling
  • cleaning
  • exploratory data analysis
  • visualisation
  • statistical inference
  • model building
  • implications for applications in the real world.


You will be taught how to scrape data from the internet, develop and test hypotheses, use principal component analysis (PCA) to reduce dimensionality, prepare actionable plans and present your findings. In the laboratory, you will develop your Python programming skills. You will introduced to a number of fundamental standard Python libraries/toolkits for data scientists including NumPy, SciPy, PANDAS and SCIKIT-Learn. In these sessions and yout coursework, you will work with real-world datasets and apply the techniques covered in lectures to that data.

Teaching