Data Mining

Statistics for Knowledge Extraction

This course is a part of the UEA "Knowledge Extraction MSc". It is a course on statistics, concentracting on Modelling. While it does aim to be self contained some prior knowledge of statistics is pretty well essential. This pages gives the syllabus together with the locations of some lecture material and some data sets While short documents are as far as possible written in html we will use pdf as our standard for more substantial material. You will therefore need a pdf reader.

Documentation

Handouts Data Sets Exercises
Syllabus (pdf) Wool Data Exercises
Syllabus (html) Plane data Course Work 1 2004/5
Introduction to probability and statistics UK Aids data Course Work 2006/7
US Aids data
Crabs data for course work 1 (2004/5)
Generalized linear Models 2006/07 Data for 2006/7 cw as .csv file
Data for 2006/7 cw as Excel file
Data for mothers and hypertension
Divorce data

Computing

It is obviously impossible to specify a programme which will satisfy all our needs. My preference is for R, details of which can be found at CRAN

Dr. G.Janacek
School of Computing Sciences
UEA
Norwich NR4 7TJ
tel: +44 (0) 1603 591206
fax: +44 (0) 1603 593674