STATISTICAL INFERENCE AND LEARNING

Academic year
2018/2019 Syllabus of previous years
Official course title
STATISTICAL INFERENCE AND LEARNING
Course code
CM0471 (AF:274848 AR:159110)
Modality
On campus classes
ECTS credits
6
Degree level
Master's Degree Programme (DM270)
Educational sector code
SECS-S/01
Period
1st Semester
Course year
1
Where
VENEZIA
Moodle
Go to Moodle page
This course belongs to the educational activities of the Master in Computer Science that allow the student to acquire advanced instruments for data analysis and machine learning. The objective of the course is to develop statistical skills for the analysis of high dimensional data and solve forecasting and classification problems occurring in a wide variety fields, including technology, science, medicine, economics and business.
Regular and active participation in the teaching activities offered by the course and in independent research activities will enable students to:
1. (knowledge and understanding)
- acquire knowledge and understanding regarding advanced statistical learning methods for synthesis, prediction and classification using data also in presence of complex structures and high-dimensionality
2. (applying knowledge and understanding)
- apply autonomously advanced statistical methods for synthetize information, make predictions and classifications using high-dimensional data
- apply autonomously statistical software for analysis of high-dimensional data
3. (making judgements)
- make autonomous judgements about the validity and feasability of different statistical techniques and understand the effects of these on the outcomes of the analyses
Basic knowledge of probability at the level of a Bachelor in Computer Science is assumed. List of assumed topics: events, axioms of probability, conditional probability and independence, random variables, expected value, variance, covariance and correlation, principal discrete random variables (binomial and Poisson), principal continuous random variables (uniform, normal and exponential), central limit theorem, law of large numbers. For example, the above topics are covered in Baron (2014), chapters 2-3-4.

Baron M (2014). Probability and Statistics for Computer Scientistis. Second Edition. CRC Press.
The course is divided into two parts. The first part focuses on general concepts of statistical inference. The topics of the first part are propaedeutic for the second part about statistical learning. Table of contents that will be presented and discussed through the course:
1. Review of statistical inference
-- point estimation
-- interval estimation
-- hypothesis testing
2. Statistical learning
-- linear regression
-- classification
-- resampling methods
-- linear model selection and regularization
Applications with R language (www.r-project.org) are an integral part of the two parts of the course.
- James G, Witten D, Hastie T, Tibshirani R (2015). An Introduction to Statistical Learning. 6th version. Springer. Webpage http://www-bcf.usc.edu/~gareth/ISL/ Chapters 1-6
- Additional reading and materials distributed during the course through moodle
The achievement of the course objectives is assessed through a written exams subdivided in two parts. The first part is a traditional written exam of 1.5 hours with exercises designed to evaluate the theoretical knowledge of the course topics. The second part lasts 2 hours and regards the analysis of a dataset using the R software. This second part evaluates the student’s ability to apply the methods learn during the course to real problems. The maximal score for each of two exams is 16 points. The final score is given by the sum of the two scores. A total score exceeding 30 corresponds to 30 with honors.
Conventional theoretical lectures complemented by exercise classes, discussion of case studies and computer labs. Teaching material prepared by the lecturer will be distributed during the course through the Moodle platform. The statistical software used in the course is R (www.r-project.org).
English
written

This subject deals with topics related to the macro-area "Climate change and energy" and contributes to the achievement of one or more goals of U. N. Agenda for Sustainable Development

Definitive programme.
Last update of the programme: 19/12/2018