[Wiki] Data mining is an interdisciplinary subfield of computer science. It is the computational process of discovering patterns in large data sets involving methods at the intersection of artificial intelligence, machine learning, statistics, and database systems. The overall goal of the data mining process is to extract information from a data set and transform it into an understandable structure for further use.
After completing this course, students will be able to:
- Describe data mining concepts and considerations.
- Select an appropriate method for a data extraction process.
- perform the CRISP process on a data mining project.
- Design and customize data mining algorithms using R library.
- Learning and discovering interesting relations between variables in large databases using Association rules.
- Create a decision support tool based on machine learning and Decision Tree techniques.
- Create scoring models optimizing direct marketing.
- Perform hierarchical clustering (also called hierarchical cluster analysis or HCA) to describe large databases.
|Course track||Labs track (Pr. Nabila ZRIRA)|
Basics about probability theory and a certain taste for Statistics, Data analysis techniques and data bases are required for this course.
The classes will be given in French by default. Slides will be in French/ English and available in PDF.
- coming soon
|21/04/17||08:00 1M to 12:00 AM||
|28/04/17||08:00 1M to 12:00 AM||
|05/05/17||08:00 1M to 12:00 AM||
|12/05/17||08:00 1M to 12:00 AM||
|19/05/17||08:00 1M to 12:00 AM||
R labs (Download here)
Last exam (Download here)
Case study R (Download here)
Project (Download here)
• Team of 2 students at most;
• The work must be provided as a report and CD containing the data and R scripts before: Thursday, December 15, 2016
• The date of the individual oral examination will be communicated later.