Data and Text Mining

Materials and instructions by Bojan Cestnik

Link to the course Data and Text Mining

Data and Text Mining course

Materials ICT2

ICT2 Course I (November 22, 2023, 15:00-18:00): Data preprocessing I

Additional materials

Intructions

QTvity collaboration:

Questions and Answers activity: QTvity
Course pwd: MPSDmKD2023

Points for QTvity collaboration during the course lectures in 2023/24:

    22. 11. 2023Total
No. Student Ans.PtsΣ andΣ pts
1Karin    
2Jordan    
3Erik    
4Maja    

Domain and data for practical demostrations

Domain description and data sets: ASHRAE - Great Energy Predictor III

Training datasets can be downloaded as zip file from ASHRAE_data.zip (108 MB).

Initial script in R can be downloaded from Kaggle_EC_begin.R.

ER diagram for three data files (tables): METER, BUILDING, WEATHER:

ER diagram of ASHRAE

Literature:

Gordon S. Linoff. Data Analysis Using SQL and Excel. Wiley, 2008.

Dorian Pyle. Data Preparation for Data Mining. Morgan Kaufmann, 1999.

Gerhard Widmer et al. In Search of the Horowitz Factor. AI Magazine, 2003.

Last update: 8. 11. 2023