space
Section for Cognitive Systems
DTU Compute

02450 Introduction to Machine Learning and Data Mining

Georgios Arvanitidis
Georgios Arvanitidis
 
Bjørn Sand Jensen
Bjørn Sand Jensen
 
Tue Herlau
Tue Herlau
 
Morten Mørup
Morten Mørup
 
Mikkel N. Schmidt
Mikkel N. Schmidt
 
_________________
Teaching Assistants:

 
Anna Emilie Jennow Wedenborg
Anna Emilie Jennow Wedenborg
 
Niccolo' Ciolli
Niccolo' Ciolli
 
Alejandro Valverde Mahou
Alejandro Valverde Mahou
 
Diego Rodriguez Gordo
Diego Rodriguez Gordo
 
Albert Kjøller Jacobsen
Albert Kjøller Jacobsen
 
Nándor Takács
Nándor Takács
 
Jeremi Wojciech Ledwon
Jeremi Wojciech Ledwon
 
Clara Davila Duarte
Clara Davila Duarte
 
Emmie Cluzel
Emmie Cluzel
 
Emma Klostergaard Christensen
Emma Klostergaard Christensen
 
Mikkel Broch-Lips
Mikkel Broch-Lips
 
Reinis Muiznieks
Reinis Muiznieks
 
Asger Bjørn Larsen
Asger Bjørn Larsen
 
David Miles-Skov
David Miles-Skov
 
Farkas Attila Jakab
Farkas Attila Jakab
 
Rémi Thomas Larnouhet
Rémi Thomas Larnouhet
 
Peter Vestereng Larsen
Peter Vestereng Larsen
 
Aleksandra Wozniak
Aleksandra Wozniak
 
Zixi Luo
Zixi Luo
 

Machine learning and data mining

The course is designed around a data modeling framework shown in the figure. Each lecture/assignment will focus a subset of the data modeling framework.

data modeling framework

We emphasize the holistic view of modeling in order to motivate and stress the relevance of individual components and building blocks, disseminate the obtained competence (see the course learning obejctives), and make them applicable for a broad spectrum of engineering problems in e.g. biomedical engineering, chemistry, electrical engineering, and informatics.

Resources

DTU Learn

If you are enrolled in the course you can access material and participate in the course through the DTU Learn homepage.

Lectures

The lectures will take place in Building 116 auditorium 81 on Tuesdays from 13:00-15:00.

Due to restrictions on the auditorium capacity, we will stream the lecture to Building 116 auditorium 82. Seats in Building 116 auditorium 81 and Building 116 auditorium 82 will be allocated on a first come, first served principle. You can use the rooms allocated to exercises to stream the lecture yourself (Zoom link on DTU Learn).

We will record the lecture and make available online on DTU Learn.

Exercises

Exercises will take place after lectures Tuesdays from 15:00-17:00.

There is room capacity for all signed up students at the exercises every week.

Please bring a laptop computer for the exercises. The exercises will be available in Matlab, R, and Python and we recommend selecting a language you are familiar with. If you are unfamiliar with any of the languages, we recommend Python. The exercise rooms are (room capacity square brackets and programming language in parentheses):

Exercises on Microsoft Teams :

Reading material, lecture slides and exercises

The course will use lecture notes and other freely available material. Lecture notes, slides, course assignment instructions etc. is available at the DTU learn course page (requires formal enrolment to the course).

Online demos

We have developed several online demos which illustrates key concepts from the course. The topics discussed currently includes PCA, regression, classification and density estimation.

Course description

A description of the course can be found at the DTU Coursebase

Online help and support

Online help and support is available through the discussion forum.

Teachers

Lecture schedule

Week Date Who Subject Reading Homework Project deadlines
(17:00 CET via DTU Learn)
1 3 September, 2024 GEAR Introduction C1
Data: Feature extraction, and visualization
2 10 September, 2024 GEAR Data, feature extraction and PCA C2, C3 P3.1, P2.1, P3.2
3 17 September, 2024 GEAR Measures of Similarity, summary statistics and probabilities C4, C5 P4.1, P4.2, P4.3
4 24 September, 2024 GEAR Probability densities and data visualization C6, C7 P6.1, P6.2, P7.1
Supervised learning: Classification and regression
5 1 October, 2024 GEAR Decision trees and linear regression C8, C9 P9.1, P8.1, P8.2 Project 1 due 3 Oct.
6 8 October, 2024 GEAR Overfitting, cross-validation and Nearest Neighbor C10, C12 P10.1, P10.2, P12.1
Holiday
7 22 October, 2024 GEAR Performance evaluation, Bayes, and Naive Bayes C11, C13 P13.1, 13.2, P12.2 Project 1 feedback available 24 Oct.
8 29 October, 2024 GEAR Artificial Neural Networks and Bias/Variance C14, C15 P15.1, P15.2, P15.3 (Project 1 resubmission due 31 Oct.)
9 5 November, 2024 GEAR AUC and ensemble methods C16, C17 P16.1, P16.2, P17.1
Unsupervised learning: Clustering and density estimation
10 12 November, 2024 BJJE K-means and hierarchical clustering C18 P18.1, P18.2, P18.3 Project 2 due 14 Nov.
11 19 November, 2024 BJJE Mixture models and density estimation C19, C20 P20.1, P19.1, P19.2
12 26 November, 2024 BJJE Association mining C21 P21.1, P18.2, P18.3 Project 2 feedback available 28 Nov.
Summary
13 3 December, 2024 GEAR Recap and discussion of the exam C1-C21 (Project 2 resubmission due 5 Dec.)

(Cx refers to Chapter x of the course notes. Px.y refers to problem number y in chapter x of the course notes.
The first listed problem will be that weeks discussion question at the exercises.)

FAQ

DTU logo space
space