DATA SCIENCE AND BUSINESS INTELLIGENCE

Academic year
2023/2024 Syllabus of previous years
Official course title
DATA SCIENCE AND BUSINESS INTELLIGENCE
Course code
EM1704 (AF:466187 AR:254568)
Modality
On campus classes
ECTS credits
6
Degree level
Master's Degree Programme (DM270)
Educational sector code
INF/01
Period
2nd Term
Course year
1
Where
VENEZIA
Moodle
Go to Moodle page
This course aims to introduce students to the founding ideas behind AI approaches for managing project data. This course focuses on the quantitative aspects of the master program, providing students with knowledge and skills in predictive data mining methods. It covers topics such as data analysis, AI approaches classification and regression, clustering and other unsupervised methods, and the exploitation of data mining software tools through Python programming. At the end of the course, students should be able to extract meaningful information from raw data and use the results of their analysis to make decisions and considerations.
At the end of the course, students should demonstrate a comprehensive understanding of the principles and applications of supervised and unsupervised learning algorithms, be able to apply the appropriate techniques to data, and manage the tools presented in lectures. They should also be able to compare different methods of data analysis, interpret results properly, and communicate their findings effectively in a comparative analysis report, including data representation.
Students should have achieved the learning outcomes of courses about Coding, and Probability and Statistics.
1. Introduction to Data Science
- Data-driven approaches and Big Data
- What is Machine Learning and Data Mining: concepts of supervised and unsupervised approaches
- Kinds of data
- Managing a Data Science project
2. Clustering:
- Dimensionality reduction
- Clustering quality evaluation;
3. Supervised Learning
- Model training, validation, and tuning; Feature Engineering
- Classification; Regression; Decision Trees;
4. Similarity Search in Text
- Text representation; Tokenization, Stemming, Lemmatization; Vector space; Similarity measures;
- Lecture notes.
- Selected readings provided during the course.
- Python Data Science Handbook. Jake VanderPlas. O'Reilly. 2016.
- Python Machine Learning - Third Edition. Sebastian Raschka, Vahid Mirjalili. Packt. 2019.
A written exam and a project are conducted to evaluate students' learning outcomes. The written exam examines the student's theoretical knowledge through questions and exercises on the topics discussed during the course. The project requires the student to compare different tools applied to a given dataset or problem, select the most suitable solution, and deliver a report. This evaluates the student's ability to apply theoretical knowledge to a practical case.
This course combines theory and hands-on sessions, using the Moodle platform to provide the material. Python programming is used to explore the topics further. Participants should bring their laptops to take advantage of the learning experience fully.
English
written and oral
Definitive programme.
Last update of the programme: 13/03/2023