Master of science-level of the Bologna process in Ingegneria Informatica (Computer Engineering) - Torino Master of science-level of the Bologna process in Data Science And Engineering - Torino Master of science-level of the Bologna process in Data Science And Engineering - Torino Master of science-level of the Bologna process in Ingegneria Informatica (Computer Engineering) - Torino
The course aims to provide a comprehensive guide to the adoption of artificial intelligence algorithms in medicine. The course provides an overview of all the steps involved in processing health data, starting with raw data and developing algorithms for use in clinical and/or home settings. To this end, the most common types of data, from electronic records to images, from physical signals to physiological signals, will be described. Basic and advanced techniques for processing various data, extracting information, selecting the most meaningful ones, and using them to develop prediction algorithms will be described. Supervised learning techniques for classification and regression will be presented, along with their optimization and comprehensive validation. Finally, sensitive and important topics for proper development and use in real life will be covered. The methods learned will be put into practice during the laboratories. In addition, the seminars will present specific practical methods used in clinical and research settings.
The course aims to provide students with a solid methodological and practical foundation for the application of artificial intelligence (AI) algorithms in the medical context. The teaching is aimed at the development of transversal and specialized skills consistent with the cultural and professional profiles of the course of study, such as the analysis and interpretation of complex health data, the design of predictive models, and the critical understanding of the ethical and practical implications of AI technologies in the clinical setting. Specifically, the course contributes to:
• Improve the ability to understand and critically analyse heterogeneous biomedical data.
• Apply advanced machine learning and deep learning techniques to solve real-world problems in medical and care settings.
• Develop independent judgment in evaluating the performance and robustness of developed models, considering aspects such as generalizability, bias, and privacy.
The course is part of an educational landscape geared toward the integration of informatics, medicine, and biomedical engineering and prepares for job opportunities in areas such as clinical research, healthcare companies, innovative digital health startups, and AI technology development centers for personalized medicine.
By the end of the course, the student will have acquired knowledge and methods for comprehensive analysis of medical data, starting with raw data from different sources and providing a robust, accurate, and generalizable algorithm. The student will have a comprehensive understanding of the types of data used in medicine, their origin and structure; be able to define analysis approaches appropriate to the specific objective and depending on the data at hand; know how to process signals and images and extract information from raw data; be able to implement and optimize machine learning and deep learning models; will know how to validate and test supervised learning models accurately and robustly; will be able to evaluate the performance of a model comprehensively and objectively; will have a clear idea of how to handle challenges related to algorithm complexity, generalizability, data sparsity and imbalance, sources of bias, and privacy management.
At the end of the course, students will have acquired advanced knowledge, skills, and competencies related to the theoretical and practical foundations of AI applied to medicine. Students will:
• Have a deep understanding of the main sources of biomedical data, pre-processing and transformation techniques, performance and model evaluation methods, and emerging challenges in the application of AI in clinical settings.
• Explore heterogeneous clinical datasets, identifying typical problems related to data quality and structure; apply techniques for cleaning, transforming and representing biomedical data; implement efficient and effective machine learning and deep learning pipelines, validate and test models rigorously, comprehensively evaluating classification/regression performance and assessing bias and generalization.
• Use development tools (e.g., Python, scikit-learn, TensorFlow) to design and document robust and generalizable algorithmic solutions; collaborate on hands-on projects, contributing to the design and presentation of real-world data-driven solutions.
• Be able to integrate computer science, statistics and biomedical knowledge to contribute to the development of intelligent systems for early diagnosis, continuous monitoring and personalized medicine; work in interdisciplinary settings, communicating effectively with health care and technology professionals; evaluate the ethical, legal and social impacts of developed solutions.
Python programming; notions of statistics and machine learning
Python programming; notions of statistics, machine learning, deep learning
Module 1: Biomedical Data Sources (9 hours)
• Introduction
• Electronic Health Records (EHR)
• Medical imaging
• Biomedical signals
• Data characterization
• Public datasets
Module 2: Pre-processing (6 hours)
• Missing values handling
• Outliers detection and imputation
• Reshape and Resample
• Scaling and normalization
• Data representation and transformation
• Filtering: low-pass, high-pass, band-pass, notch
• Feature extraction
• Feature selection
Module 3: Artificial intelligence (9 hours)
• Introduction
• Fuzzy systems
• Machine learning
• Deep learning
• Training and optimization
Module 4: Performance and model evaluation (9 hours)
• Regression metrics
• Classification metrics
• Validation methods
• Test methods
• Generalization
Module 5: Challenges and Opportunities (9 hours)
• Handling data scarcity
• Handling label scarcity
• Preserving privacy
• Interpretability and Explainability
• Mitigating bias
• Edge processing
• Generative AI
Laboratories: 18 hours
• Pre-processing (part 1)
• Pre-processing (part 2)
• Machine learning model training and optimization
• Deep learning models training and optimization
• Performance evaluation, generalization
• Fairness and explainability
Seminars: 3 hours
• Module 1: Biomedical Data Sources (9 hours)
The main sources of medical data are introduced and carefully described, including structured data (i.e., electronic health records), physical/physiological signals (electrocardiography, electroencephalography, electromyography, photoplethysmography, electrodermal activity, inertial signals), and imaging (radiography, computed tomography, magnetic resonance). Particular attention is devoted to acquisition systems, data format, resolution, dimensions, units of measurement, noise sources, sampling rate, frequency range, typical signal shape, and clinical relevance. An overview of the most relevant public datasets is provided.
• Module 2: Pre-processing (9 hours)
Data cleaning methods are presented, including filtering, missing values and outliers detection and imputation. Pre-processing approaches are described, from normalization to data transformation, from feature extraction to feature selection.
• Module 3: Artificial intelligence (9 hours)
AI algorithms are presented, from simple (fuzzy systems) to complex (deep neural networks). Several machine learning models are described, with a particular focus on their hypothesis on data structure, potential and limitations, and practical tips. An overview of the training and optimization settings for complex deep learning algorithms is provided, and practical suggestions are given for developing effective, robust, and bias-free prediction models.
• Module 4: Performance and model evaluation (9 hours)
Performance evaluation metrics for classification and regression problems are described, along with their pros and cons. A comprehensive overview of validation and test methods is given, along with important considerations on their applicability and efficacy. Issues on generalization ability are discussed, as well as possible solutions.
• Module 5: Challenges and Opportunities (9 hours)
Limitations of supervised learning strategies are discussed, including handling data and label scarcity. Privacy-preserving methods (e.g. federated learning) are introduced, essential for medical applications. Attention will be given to interpretability and explainability of health-oriented diagnostic/monitoring systems. Bias identification and mitigation strategies are described. Technical requirements for edge processing and real-time prediction are listed. The potential integration of generative AI in clinical decision support systems is discussed.
• Laboratories: 12 hours
Laboratories will cover all theoretical concepts and methods, applying these in real-world medical problems. These include data pre-processing, machine/deep learning model training and optimization, performance evaluation, and fairness, explainability, and generalizability.
• Seminars: 3 hours
Seminars will provide an in-depth overview of AI applications on different data sources for specific objectives. These include AI-based analysis of multi-modal medical data for better understanding, early diagnosis, and monitoring of acute and chronic disorders.
The course consists of 42 hours of classroom lectures and 18 hours of laboratory exercises. The laboratory exercises will allow students to put into practice the concepts and methods learned in class. These activities will be preparatory to the development of an individual or group project that will contribute to the final grade. Seminars describing practical applications to real problems related to early diagnosis or continuous monitoring of chronic diseases are planned.
• Classroom lectures (45 hours): theoretical lectures dedicated to the introduction and in-depth study of the fundamental concepts of AI applied to the medical context.
• Exercises and practical labs (12 hours): computer lab activities aimed at the practical application of the theoretical content covered.
• Thematic seminars (3 hours): meetings with researchers and clinical staff, dedicated to the presentation of real case studies and applications of AI in the diagnostic, therapeutic or monitoring fields.
• Individual or group project: students will develop an application project, individually or in small groups, aimed at solving a real problem using the techniques learnt. The project will be assessed in the final examination.
- Aurelie Geron, Hands-On Machine Learning with Scikit-Learn and TensorFlow: Concepts, Tools, and Techniques to Build Intelligent Systems
• Lecture slides: made available weekly on the online teaching platform, to support classroom explanation.
• Interactive notebooks (Python/Jupyter): used during exercises and laboratories to explore machine learning and deep learning algorithms on clinical data.
• Handouts and summary sheets: containing theoretical summaries, glossaries of terms and conceptual diagrams.
• Recorded video lectures: available for each course module, accessible asynchronously on the e-learning platform.
• Multimedia material and links to open-access resources: scientific articles, public medical datasets, online courses, and visual content for self-study.
Supplementary material includes:
[1] Aurélien Géron, Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow, 2nd ed., O'Reilly Media, 2019 (introduce and explore the main techniques of machine learning and deep learning with practical examples)
[2] E. Topol, Deep Medicine: How Artificial Intelligence Can Make Healthcare Human Again, Basic Books, 2019 (provides application and critical context on the impact of AI in clinical practice)
Slides; Dispense; Esercitazioni di laboratorio; Video lezioni dell’anno corrente; Materiale multimediale ;
Modalità di esame: Prova scritta (in aula); Elaborato progettuale individuale; Elaborato progettuale in gruppo;
Exam: Written test; Individual project; Group project;
...
The exam consists of three parts, which contribute to the determination of the final grade (32 points max)
(a) multiple-choice written test (12 points)
(b) delivery of laboratory reports (12 points)
(c) presentation of individual or group project (8 points)
Gli studenti e le studentesse con disabilità o con Disturbi Specifici di Apprendimento (DSA), oltre alla segnalazione tramite procedura informatizzata, sono invitati a comunicare anche direttamente al/la docente titolare dell'insegnamento, con un preavviso non inferiore ad una settimana dall'avvio della sessione d'esame, gli strumenti compensativi concordati con l'Unità Special Needs, al fine di permettere al/la docente la declinazione più idonea in riferimento alla specifica tipologia di esame.
Exam: Written test; Individual project; Group project;
The examination consists of two parts:
1. Written examination (duration: 1 hour, maximum score: 20): Multiple-choice test consisting of approximately 20 questions covering the theoretical and application topics covered during the course. It is not permitted to consult texts, notes or electronic devices connected to the Internet.
2. Individual or group application project (duration 20 minutes, maximum score: 10): Development of a project with a real medical dataset, in which machine learning/deep learning methods covered in the lecture are applied and evaluated.
Honours are awarded to students who achieve maximum marks in both components and demonstrate excellence, originality and advanced mastery of the topics. This will be assessed though specific questions during the project presentation.
The examination modalities are kept the same in all appeals, even if the candidate refuses a previous mark.
In addition to the message sent by the online system, students with disabilities or Specific Learning Disorders (SLD) are invited to directly inform the professor in charge of the course about the special arrangements for the exam that have been agreed with the Special Needs Unit. The professor has to be informed at least one week before the beginning of the examination session in order to provide students with the most suitable arrangements for each specific type of exam.