MATH38161 Multivariate Statistics and Machine Learning
Coursework
November 2024
Overview
The coursework is a data analysis project with a written report. You will apply skills
and techniques acquired from Week 1 to Week 8 to analyse a subset of the FMNIST
dataset.
In completing this coursework, you should primarily use the techniques and methods
introduced during the course. The assessment will focus on your understanding and
demonstration of these techniques in alignment with the learning outcomes, rather
than the accuracy or exactness of the final results.
The project report will be marked out of 30. The marking scheme is detailed below.
You have twelve days to complete this coursework, with a total workload of approximately 10 hours (including preliminary coursework tasks).
Format
• Software: You should mainly use R to perform the data analysis. You may use
built-in functions from R packages or implement the algorithms with your own
codes.
• Report: You may use any document preparation system of your choice but the
final document must be a single PDF in A4 format. Ensure that the text in the
PDF is machine-readable.
• Content: Your report must include the complete analysis in a reproducible format,
integrating the computer code, figures, and text etc. in one document.
• Title Page: Show your full name and your University ID on the title page of your
report.
• Length: Recommended length is 8 pages of content (single sided) plus title
page. Maximum length is 10 pages of content plus the title page. Any content
exceeding 10 pages will not be marked.
1
Submission process and d