Teaching : Statistical Data Mining (MS1b HT2013)

Term: Hilary Term, January 14 - March 8, 2013
Lecturer: Yee Whye Teh
[website] [email]
Teaching Assistant: Yuanyuan Liu
[email]
Lectures: 1100-1200 Wednesdays (all weeks)
1100-1200 Thursdays (odd weeks)
Mathematical Institute L1
Problem Sheets:
Classes:
(Part C only)
Due 1200 Mondays (Weeks 2-8) in 1 South Parks Road
1500-1700 Wednesdays (Weeks 2-8).
Week 2: Seminar room in 2 SPR, Weeks 3-8: Seminar room in 1 SPR.
Practicals:
(Part C only)
1100-1200 Thursdays (even weeks)
1 South Parks Road Computing Lab
Miniproject:
(MSc only)
TBA
To be carried out over Easter break
Google Group: https://groups.google.com/forum/?hl=en-GB#!forum/statistical-data-mining
Not formally part of course, but you might find it useful to ask other students questions.

News

Compiled Slides

Schedule

14/1 16/1 Lecture: Intro PCA 17/1 Lecture: SVD MDS Isomap
21/1 Problem Sheet 1 23/1 Lecture: Linkage K-means VQ 24/1 Part C: Practical 1
28/1 Problem Sheet 2 30/1 Lecture: Mixtures 31/1 Lecture: decision theory
04/2 Problem Sheet 3 06/2 Lecture: LDA QDA Naive Bayes
06/2 5pm MSc Class
07/2 Part C Practical: Practical 2
11/2 Problem Sheet 4 13/2 Lecture: Bayes LogReg Evaluating 14/2 Lecture: kNN LVQ
18/2 Problem Sheet 5 20/2 Lecture: CART ModelComplexity 21/2 Part C Practical: Practical 3
25/2 Problem Sheet 6 27/2 Lecture: NeuralNets 28/2 Lecture: Bagging
04/3 Problem Sheet 7 06/3 Lecture: RandomForest Boosting 07/3 Part C Practical: Practical 4

Data

R

Some Textbooks

There are (too?) many textbooks on Data Mining. Some popular ones: