Special Topics in Computer Science:
Probabilistic Data Mining
Data mining is defined as the process of "information extraction" from a set of data. In formal terms all data mining problems can be restated as inferring model parameters, consequently we will discuss the issues related both with the choice of good -- or less adequate -- models for a given set of observation and to the problems related with the parameter estimation process given the model.
We will use the framework provided by the machine learning methodology where the emphasis is both on the models and on the type of data and observation process at hand. The illustration of methods is done with real data and realistic observation models.
Dr. Lehel Csato
The handouts of this course can be downloaded from here.
Students will have to do the following project and will have to send it to the lecturer not later than December 11, 2010. The marks for this course will be based on the project.