LAB OF INFORMATION SYSTEMS AND ANALYTICS
- Anno accademico
- 2023/2024 Programmi anni precedenti
- Titolo corso in inglese
- LAB OF INFORMATION SYSTEMS AND ANALYTICS
- Codice insegnamento
- ET7008 (AF:386287 AR:216838)
- Lingua di insegnamento
- Inglese
- Modalità
- In presenza
- Crediti formativi universitari
- 6
- Livello laurea
- Laurea
- Settore scientifico disciplinare
- INF/01
- Periodo
- 4° Periodo
- Anno corso
- 2
- Sede
- RONCADE
- Spazio Moodle
- Link allo spazio del corso
Inquadramento dell'insegnamento nel percorso del corso di studio
In particular, students should be able to exploit modern AI approaches to extract meaningful information starting from raw data of various kind.
Such competences need a strong theoretical and practical knowledge of data analysis.
The goal of this course is to teach students methods and technologies for effective data analysis, discussing the fundamental techniques for predictive and descriptive analysis of data.
During the lectures several tools and techniques will be presented, from both theoretical and practical aspects, so that students will be able to compare such tools and extract knowledge from the presented datasets.
The results of the aforementioned analysis are exploited as a starting point for further decisions and considerations.
Risultati di apprendimento attesi
Students should also be able to produce a comparative analysis report, including data representation.
Students will achieve the following learning outcomes, divided in three main areas:
1. Knowledge and understanding:
- understanding the theoretical bases of the main algorithms presented during lectures;
- understanding principles and differences of non-supervised learning algorithms;
- understanding principles and differences of supervised learning algorithms.
2. Applying knowledge and understanding in practical situations:
- being able to apply proper supervised and unsupervised analysis techniques to data;
- being able to use data analysis software tools used during lectures (e.g., scikit-learn);
- being able to compare and correctly interpret different analysis results from different algorithms
3. Communication:
- reporting comprehensive comparative analysis among different data analysis methods;
- being able to present results with appropriate figures and diagrams.
Prerequisiti
Contenuti
- Data-driven approaches and Big Data
- What is Machine Learning and Data Mining: concepts of supervised and unsupervised approaches
- Kinds of data
- Managing a Data Science project
2. Clustering:
- Dimensionality reduction
- Clustering quality evaluation;
3. Supervised Learning
- Model training, validation and tuning; Feature Engineering
- Classification; Regression; Decision Trees;
4. Similarity Search in Text
- Text representation; Tokenization, Stemming, Lemmatization; Vector space; Similarity measures;
Testi di riferimento
Modalità di verifica dell'apprendimento
The project requires to apply data analysis methods to a given dataset of limited complexity and requires to conduct a comparative analysis of different tools applied to a specific dataset or problem.
The student must chose and motivate the most appropriate solution and deliver a report discussing a comparative analysis of the chosen methods.