DATA ANALYTICS AND ARTIFICIAL INTELLIGENCE - 1
- Anno accademico
- 2025/2026 Programmi anni precedenti
- Titolo corso in inglese
- DATA ANALYTICS AND ARTIFICIAL INTELLIGENCE - 1
- Codice insegnamento
- EM1405 (AF:561279 AR:326500)
- Lingua di insegnamento
- Inglese
- Modalità
- Blended (in presenza e online)
- Crediti formativi universitari
- 6 su 12 di DATA ANALYTICS AND ARTIFICIAL INTELLIGENCE
- Livello laurea
- Laurea magistrale (DM270)
- Settore scientifico disciplinare
- ING-INF/05
- Periodo
- 3° Periodo
- Anno corso
- 1
- Sede
- VENEZIA
- Spazio Moodle
- Link allo spazio del corso
Inquadramento dell'insegnamento nel percorso del corso di studio
The goal of this course is to enable students the understand and exploit predictive data science techniques including both supervised (classification and regression) and un-supervised methods (clustering). The course includes the exploitation of data mining software tools through the python programming language.
Risultati di apprendimento attesi
Students will achieve the following learning outcomes:
Knowledge and understanding: i) understanding principles of non-supervised learning; ii) understanding principles of supervised learning; iii) understanding principle of data pre-processing and feature engineering.
Applying knowledge and understanding: i) being able to apply supervised and unsupervised analysis techniques; ii) being able to use data analysis software tools (e.g., scikit-learn).
Making judgements: i) being able to choose the most appropriate method to a given problem and to evaluate its performance.
Communication: i) reporting comprehensive comparative analysis among different data analysis methods
Prerequisiti
(even without passing the corresponding exams).
Contenuti
- Feature engineering: text, numerical and categorical data; importance of similarity functions.
- Unsupervised Learning: clustering algorithms, k-means, hierarchical, db-scan; evaluation.
- Collaborative filtering: content-based and item-based recommendation algorithms.
- Supervised Learning: regression and classification algorithms; logistic classifier, SVM; decision trees; evaluation.
- Model tuning and Selection: bias and variance, overfitting, underfitting;
- Ensemble methods: Bagging, Boosting, Random Forest.
Testi di riferimento
- Python Data Science Handbook. Jake VanderPlas. O'Reilly. 2016-2021
Modalità di verifica dell'apprendimento
The written exam consists in questions and short exercise regarding the theory of the subjects discussed during the course. The written exam evaluates the theoretical knowledge gained by the student.
The project requires to conduct a comparative analysis of different tools applied to a specific dataset or problem.
The student must chose and motivate the most appropriate solution and deliver a report, to be discussed with the teacher. The project work evaluates the ability of the student in applying the theoretical knowledge to a real-world case study.
Modalità di esame
Il/la docente ha il dovere di vigilare affinché siano rispettate le regole di autenticità e originalità delle prove d'esame. Di conseguenza, nei casi in cui vi sia il sospetto di un comportamento irregolare, l'esame può prevedere un ulteriore approfondimento, contestuale alla prova d'esame, che potrà essere realizzato anche in modalità differente rispetto alle modalità sopra riportate.
Graduazione dei voti
Written exam (mandatory)
To be undertaken in presence at the end of the course (i.e. from the end of may).
This written exam will include three theoretical questions, about topics covered during the whole course, and a small exercise, asking to design from scratch a solution to a practical problem.
Each theoretical question will grant up to 4 points and the exercise will grant up to 20 points.
The students must submit the answers to the theoretical questions within 30 minutes form the start of the exam and the whole solution must be submitted within 90 minutes overall (i.e. if the student submits the theoretical questions in advance, he/she will have more time available for the exercise).
Metodi didattici
Teaching material is delivered through the Moodle platform.
During the course, the python programming language is used together with the scikit-learn library. Students are encouraged to bring their own laptops.