Servizi per la didattica
PORTALE DELLA DIDATTICA

Data science and machine learning for engineering applications

01DSTMW, 01DSTND, 01DSTNF

A.A. 2022/23

Course Language

Inglese

Course degree

Master of science-level of the Bologna process in Ingegneria Chimica E Dei Processi Sostenibili - Torino
Master of science-level of the Bologna process in Ingegneria Energetica E Nucleare - Torino
Master of science-level of the Bologna process in Ingegneria Per L'Ambiente E Il Territorio - Torino

Borrow

01DVJNW 01DVKNW

Course structure
Teaching Hours
Lezioni 30
Esercitazioni in laboratorio 30
Teachers
Teacher Status SSD h.Les h.Ex h.Lab h.Tut Years teaching
Cerquitelli Tania Professore Associato ING-INF/05 30 0 0 0 1
Teaching assistant
Espandi

Context
SSD CFU Activities Area context
ING-INF/05 6 F - Altre attività (art. 10) Abilità informatiche e telematiche
2022/23
This course explores how new algorithms in the Data Science (DS), Machine Learning (ML), and Deep Learning (DL) disciplines can help engineering applications become smarter. These emerging disciplines could play a significant role and substantially impact various sectors. Thus, they could lead to new opportunities to develop cutting-edge and unconventional engineering applications. The course focuses on designing and implementing data-driven processes to extract knowledge from data and support decision-making. The course first introduces the data science process, focusing on its main phases. It then provides a theoretical and practical understanding of data mining and machine learning algorithms commonly used to analyze large and heterogeneous data in engineering scenarios. The course also introduces the Python language and state-of-the-art data mining and machine learning libraries to design and develop machine learning-based applications. Many laboratory sessions, based on a learning-by-doing approach, allow experimental activities on all the phases of a standard data science process - data preparation and cleaning, data exploration and characterization, machine learning algorithm selection and tuning, result assessment - using the most widespread open-source tools and libraries.
This course explores how new algorithms in the Data Science (DS), Machine Learning (ML), and Deep Learning (DL) disciplines can help engineering applications become smarter. These emerging disciplines could play a significant role and substantially impact various sectors. Thus, they could lead to new opportunities to develop cutting-edge and unconventional engineering applications. The course focuses on designing and implementing data-driven processes to extract knowledge from data and support decision-making. The course first introduces the data science process, focusing on its main phases. It then provides a theoretical and practical understanding of data mining and machine learning algorithms commonly used to analyze large and heterogeneous data in engineering scenarios. The course also introduces the Python language and state-of-the-art data mining and machine learning libraries to design and develop machine learning-based applications. Many laboratory sessions, based on a learning-by-doing approach, allow experimental activities on all the phases of a standard data science process - data preparation and cleaning, data exploration and characterization, machine learning algorithm selection and tuning, result assessment - using the most widespread open-source tools and libraries.
• Knowledge of the main phases characterizing a data science and machine learning process for a real-life engineering application. • Knowledge of the major data mining algorithms for classification, regression, clustering, and association rule mining. • Knowledge of the major machine learning and deep learning algorithms. • Knowledge of the Python language. • Knowledge of the major data mining and machine learning libraries. • Ability to design, implement and evaluate a data science process. • Ability to design, implement and evaluate a machine learning and deep learning algorithm. • Ability to use and tune data mining and machine learning algorithms. • Students will acquire all the above abilities by exploiting state-of-the-art machine learning algorithms in engineering applications.
• Knowledge of the main phases characterizing a data science and machine learning process for a real-life engineering application. • Knowledge of the major data mining algorithms for classification, regression, clustering, and association rule mining. • Knowledge of the major machine learning and deep learning algorithms. • Knowledge of the Python language. • Knowledge of the major data mining and machine learning libraries. • Ability to design, implement and evaluate a data science process. • Ability to design, implement and evaluate a machine learning and deep learning algorithm. • Ability to use and tune data mining and machine learning algorithms. • Students will acquire all the above abilities by exploiting state-of-the-art machine learning algorithms in engineering applications.
Basic programming skills.
Basic programming skills.
• Data science process: main phases (0.2 cfu) • Data collection, cleaning, transformation and enrichment, and feature engineering (0.3 cfu) • Data mining algorithms: classification and clustering (0.8 cfu) • Machine learning and deep learning algorithms (0.6 cfu) • Introduction to Python and data mining and machine learning libraries (e.g., scikit-learn) (1.0 cfu) • Case study analysis (0.6 cfu) • Design in the lab of data science process and machine learning and deep learning algorithms for engineering applications (2.5 cfu)
• Data science process: main phases (0.2 cfu) • Data collection, cleaning, transformation and enrichment, and feature engineering (0.3 cfu) • Data mining algorithms: classification and clustering (0.8 cfu) • Machine learning and deep learning algorithms (0.6 cfu) • Introduction to Python and data mining and machine learning libraries (e.g., scikit-learn) (1.0 cfu) • Case study analysis (0.6 cfu) • Design in the lab of data science process and machine learning and deep learning algorithms for engineering applications (2.5 cfu)
The course includes lectures and practices on the course topics, particularly on data science process design, data preprocessing, and machine learning and deep learning algorithms (3.5 CFU). The course includes laboratory sessions on the data science processes and machine learning algorithms for engineering applications (2.5 CFU). Laboratory sessions allow experimental activities on the most widespread tools and libraries. Students will prepare a written report on a group project assigned during the course and discuss it with the professor.
The course includes lectures and practices on the course topics, particularly on data science process design, data preprocessing, and machine learning and deep learning algorithms (3.5 CFU). The course includes laboratory sessions on the data science processes and machine learning algorithms for engineering applications (2.5 CFU). Laboratory sessions allow experimental activities on the most widespread tools and libraries. Students will prepare a written report on a group project assigned during the course and discuss it with the professor.
Copies of the slides used during the lectures, examples of written exams and exercises, and manuals for the activities in the laboratory will be made available. All teaching material is downloadable from the course website or the teaching Portal. Book (only a few chapters needed) - Tan, Steinbach and Karpatne, Kumar; 'Introduction to data mining', 2 ed., Pearson, 2019 - Bellini, Alessandro and Guidi, Andrea; ‘Python e machine learning’, McGraw-Hill Education, 2022
Copies of the slides used during the lectures, examples of written exams and exercises, and manuals for the activities in the laboratory will be made available. All teaching material is downloadable from the course website or the teaching Portal. Book (only a few chapters needed) - Tan, Steinbach and Karpatne, Kumar; 'Introduction to data mining', 2 ed., Pearson, 2019 - Bellini, Alessandro and Guidi, Andrea; ‘Python e machine learning’, McGraw-Hill Education, 2022
Modalità di esame: Elaborato progettuale in gruppo; Prova scritta in aula tramite PC con l'utilizzo della piattaforma di ateneo;
Exam: Group project; Computer-based written test in class using POLITO platform;
Gli studenti e le studentesse con disabilità o con Disturbi Specifici di Apprendimento (DSA), oltre alla segnalazione tramite procedura informatizzata, sono invitati a comunicare anche direttamente al/la docente titolare dell'insegnamento, con un preavviso non inferiore ad una settimana dall'avvio della sessione d'esame, gli strumenti compensativi concordati con l'Unità Special Needs, al fine di permettere al/la docente la declinazione più idonea in riferimento alla specifica tipologia di esame.
Exam: Group project; Computer-based written test in class using POLITO platform;
The exam includes a group project and a written part. The final score is defined by considering both the evaluation of the group project and the written part. The teacher may request an integrative test to confirm the obtained evaluation. Learning objectives assessment The written part will assess - the knowledge of the data preparation techniques and the major data mining algorithms for classification, regression, clustering, and association rule mining. - the knowledge of the machine learning and deep learning for engineering applications. The group project will assess - the ability to design, implement and evaluate a complete data science process, including the evaluation and tuning of machine learning algorithms and result assessment for a specific engineering application. - the working knowledge of the Python language and the major data mining and machine learning libraries. Exam structure and grading criteria The group project consists in designing and implementing a data science process, based on machine learning algorithms, for solving a data analytics task related to a specific engineering application. The project is assigned after 6 weeks of lectures. The evaluation of the group project is based on the performance and accuracy of the proposed solution, in terms of standard quality measures (e.g., prediction accuracy), and completeness (i.e., in depth analysis of each phase of the designed process and motivation for selecting given techniques and algorithms). The written part covers the theoretical part of the course. It includes multiple choice and box-to-fill questions related to the theoretical part of the course (data cleaning, data pre-processing, data mining algorithms, data science process, machine learning and deep learning algorithms). For multiple choice questions, wrong answers are penalized. The score of each question will be specified in the exam text. The written exam lasts 60 minutes. Textbooks, notes, electronic devices of any kind are not allowed. The maximum grade for the group project is 20. The maximum grade for the written part is 12. The final grade is given by the sum of the two parts. The exam is passed if the grade of the group project is greater than or equal to 10, the grade of the written part is greater than or equal to 8, and the overall grade is greater than or equal to 18. If the final score is strictly greater than 31 the registered score will be 30 with honor.
In addition to the message sent by the online system, students with disabilities or Specific Learning Disorders (SLD) are invited to directly inform the professor in charge of the course about the special arrangements for the exam that have been agreed with the Special Needs Unit. The professor has to be informed at least one week before the beginning of the examination session in order to provide students with the most suitable arrangements for each specific type of exam.
Esporta Word


© Politecnico di Torino
Corso Duca degli Abruzzi, 24 - 10129 Torino, ITALY
Contatti