DATA MANAGEMENT
- Academic year
- 2025/2026 Syllabus of previous years
- Official course title
- GESTIONE DEI DATI DIGITALI
- Course code
- NS001B (AF:582240 AR:328523)
- Teaching language
- Italian
- Modality
- On campus classes
- ECTS credits
- 6
- Degree level
- Minor
- Academic Discipline
- INF/01
- Period
- 2nd Semester
- Course year
- 1
- Where
- VENEZIA
- Moodle
- Go to Moodle page
Contribution of the course to the overall degree programme goals
The course aims to provide students with an introductory, methodological and applied preparation on the main processes of digital data management, with particular attention to the integration of visual data analysis tools, elementary Machine Learning techniques and applications of Generative Artificial Intelligence.
The course is designed for students from different disciplinary backgrounds and does not require prior programming skills. Through guided laboratory activities, participants will learn to build simple data analysis workflows with Orange Data Mining and to design a virtual tutor or conversational assistant with Flowise, possibly integrating documents, knowledge bases and RAG architectures.
By the end of the course, students will be able to develop a digital data management project aimed at solving a practical or knowledge-based problem, documenting the phases of data acquisition, preparation, analysis, visualization and valorization through specialist and generative AI tools.
Expected learning outcomes
By the end of the course, students will be able to:
• describe the fundamental characteristics of digital data;
• distinguish between structured, semi-structured and unstructured data;
• illustrate the main stages of the digital data management process;
• understand the role of data in analysis, forecasting and decision-making processes;
• describe, at an introductory level, the main data analysis techniques: classification, regression, clustering, exploratory analysis and visualization;
• understand the meaning of vector representation of texts, images and documents;
• distinguish between specialist Artificial Intelligence and Generative Artificial Intelligence;
• describe the general logical functioning of a Large Language Model;
• understand the concepts of prompt, embedding, vector store, retrieval and RAG architecture;
• recognize the potential, limitations and risks of AI tools applied to digital data management.
Ability to apply knowledge and understanding
Students will be able to:
• acquire and organize digital datasets according to an analysis question;
• use visual tools to explore, clean and transform digital data;
• build a data analysis workflow using Orange Data Mining;
• apply simple classification, regression, clustering or visualization techniques;
• interpret the outputs produced by an analysis workflow;
• represent results through appropriate charts, diagrams and visualizations;
• use generative AI tools to summarize, classify, reorganize and query digital content;
• design effective prompts to guide the processing of texts, data and documents;
• configure a simple virtual tutor or conversational assistant using Flowise;
• design a simple RAG architecture for querying a document base.
Judgement skills
Students will be able to:
• select appropriate tools, techniques and workflows in relation to the problem addressed;
• assess the quality of the available data and recognize their limitations, incompleteness or distortions;
• critically interpret the results produced by analysis models and AI tools;
• distinguish between correlation, forecasting, classification and explanation;
• recognize possible errors, biases, hallucinations or unverifiable responses produced by generative systems;
• assess the appropriateness of a digital solution in relation to the context of use, the target users and the decision-making or knowledge-related objectives.
Communication skills
Students will be able to:
• clearly present the stages of a digital data management project;
• describe a data analysis workflow using correct basic technical terminology;
• communicate analysis results through charts, visualizations and short reports;
• document the functioning of a virtual tutor or conversational assistant;
• explain the methodological and operational choices adopted in the two assignments;
• collaborate with peers and the instructor during exercises, reviews and project activities.
Pre-requirements
Contents
PART ONE
The digital data management process
Software tool: Orange Data Mining
1. The new digital intelligence and the data life cycle
• Data, information and knowledge
• Digital data as a cognitive and decision-making resource
• The stages of the digital data management process
• Acquisition, storage, processing, representation, activation and adaptation
• Relationship between data, models, forecasts and decisions
• Human intelligence, artificial intelligence and hybrid intelligence
2. Data acquisition, storage and quality
• Sources of digital data
• Structured, semi-structured and unstructured data
• Datasets, observations, variables and metadata
• Common formats: CSV, Excel, texts, images, documents
• Data quality criteria
• Errors, missing values, duplicates, anomalies and inconsistencies
• Logical organization of data according to the analysis
3. Exploratory analysis, pre-processing and data representation
• Initial exploration of a dataset
• Distributions, relationships, comparisons and trends
• Data cleaning and transformation
• Normalization, encoding and variable selection
• Vector representation of digital texts and images
• Introductory concepts of features, embeddings and similarity
• Use of Orange Data Mining to build visual workflows
4. Models for analysis, forecasting and visualization
• Difference between descriptive, predictive and decision-oriented analysis
• Classification: predicting categories
• Regression: predicting numerical values
• Clustering: identifying groups within data
• Time series and autoregressive models: trends, variations and forecasting
• Basic evaluation of results
• Data visualization and infographics
• Charts, dashboards and communication of results
PART TWO
Generative AI and digital data management
Software tools: Mistral AI Studio, Flowise
5. From specialist AI to generative AI
• Difference between specialist AI and generative AI
• Machine Learning, Deep Learning and Large Language Models
• Applications of generative AI to digital data management
• Processing of texts, tables, documents and semi-structured content
• Opportunities, limitations and risks of generative automation
• Critical verification of model-generated outputs
6. Logical-functional architecture of an LLM and prompting techniques
• Tokens, context and language generation
• Embeddings and semantic representation
• Training, inference and alignment
• Hallucinations, bias and output control
• Descriptive, instructional and constrained prompts
• Role prompting, few-shot prompting and structured prompts
• Prompts for analysis, summarization, classification, extraction and data transformation
7. Data processing through generative AI
• Analysis and summarization of documents
• Extraction of information from texts and digital content
• Qualitative classification of content
• Generation of tables, diagrams, reports and operational summaries
• Design of generative workflows to support knowledge-based and decision-making activities
• Use of generative AI platforms, with particular reference to Mistral AI Studio
8. Design of agents, virtual tutors and RAG architectures
• From chatbot to generative AI agent
• System instructions, objectives, constraints and conversational behavior
• Memory, tools and workflows
• Retrieval-Augmented Generation
• Indexing of documents and knowledge bases
• Vector stores and semantic search
• Design of a virtual tutor with Flowise
• Overview of multi-agent systems and orchestrators
Referral texts
1. G.B. Ronsivalle, La nuova intelligenza digitale. Come trasformare i dati in decisioni per progettare il futuro, Maggioli Editore, Apogeo Education Series, 2022.
2. G.B. Ronsivalle, I. Baccan, A. Bersan, The Orange Box. Il nuovo laboratorio di Machine Learning, Edizioni Wemole, 2024.
Supplementary teaching materials
The instructor will also provide:
• course slides;
• operational handouts;
• practice datasets;
• tutorials for Orange Data Mining;
• tutorials for Mistral AI Studio and Flowise;
• workflow examples;
• examples of prompts and virtual tutor configurations;
• documents or text corpora for RAG exercises.
Assessment methods
Step 1 — Development of a workflow with Orange Data Mining
Maximum score: 15 points
Minimum passing threshold: 9 points
The assignment consists of creating a data management and analysis workflow using Orange Data Mining.
Students will be required to:
• select or use a dataset provided by the instructor;
• formulate an analysis question;
• import and organize the dataset in Orange;
• carry out exploratory and pre-processing operations;
• apply at least one analysis, modelling or visualization technique;
• interpret the results obtained;
• produce a short descriptive report on the workflow.
Required submission
• Orange workflow file.
• Dataset used or indication of the source.
• Short descriptive report.
• Any screenshots of the main visualizations.
• Interpretative comment on the results.
Step 2 — Development of a virtual tutor with Flowise
Maximum score: 15 points
Minimum passing threshold: 9 points
The assignment consists of designing and implementing a virtual tutor or conversational assistant using Flowise.
Students will be required to:
• define an application domain;
• identify the target users and purpose of the tutor;
• design the conversational behavior of the agent;
• configure prompts, instructions and constraints;
• optionally integrate a document base through a RAG architecture;
• test the system through examples of interaction;
• document limitations, risks and possible improvements.
Required submission
• Export or documentation of the Flowise workflow.
• Description of the virtual tutor developed.
• System prompt or main instructions.
• Any document corpus used.
• Examples of questions and answers.
• Short critical report on the functioning of the system.
Type of exam
The lecturer has a duty to ensure that the rules regarding the authenticity and originality of exam tests and papers are respected. Therefore, if there is suspicion of irregular conduct, an additional assessment may be conducted, which could differ from the original exam description.
Grading scale
Step 1 — Workflow with Orange Data Mining | Maximum score: 15 points | Minimum threshold: 9 points
Step 2 — Virtual tutor with Flowise | Maximum score: 15 points | Minimum threshold: 9 points
Total: Maximum score: 30 points | Minimum threshold: 19 points
The final grade is expressed out of thirty.
Honours may be awarded in the case of particularly original, complete and rigorous assignments, capable of effectively integrating data analysis, visualization, agent design and critical evaluation of results.
The instructor may require a short supplementary interview, possibly in conjunction with the presentation of the assignments, in cases where it is necessary to further assess the authenticity, originality or actual understanding of the work carried out.
Teaching methods
• theoretical presentation sessions supported by multimedia slides;
• moments of interaction and guided discussion;
• analysis of use cases;
• operational demonstrations of the software tools;
• individual and group exercises;
• guided simulations;
• laboratory activities with Orange Data Mining;
• laboratory activities with Mistral AI Studio and Flowise;
• progressive review of the final assignments.
The course adopts a learning-by-doing approach. Each theoretical concept is connected to a practical activity, an applied example or a phase of the final project.
Asynchronous materials, video tutorials and operational worksheets may be made available to further explore the use of the software tools employed during the course.
2030 Agenda for Sustainable Development Goals
This subject deals with topics related to the macro-area "Human capital, health, education" and contributes to the achievement of one or more goals of U. N. Agenda for Sustainable Development