Servizi per la didattica
PORTALE DELLA DIDATTICA

Applied data science project

01TXXSM

A.A. 2021/22

Course Language

Inglese

Course degree

Master of science-level of the Bologna process in Data Science And Engineering - Torino

Course structure
Teaching Hours
Lezioni 30
Esercitazioni in laboratorio 90
Teachers
Teacher Status SSD h.Les h.Ex h.Lab h.Tut Years teaching
Rizzo Giuseppe   Docente esterno e/o collaboratore   21 0 9 0 1
Teaching assistant
Espandi

Context
SSD CFU Activities Area context
ING-INF/05 12 D - A scelta dello studente A scelta dello studente
2021/22
The practical and theoretical knowledge acquired by the students during the master degree provide them with the tools to face and solve specific data science and data analytics problems. Further steps that students will likely take when entering the work market are: thinking in the large and applying diverse abilities, together with team mates in order to achieve objectives and build complex systems. The goal of this course is to let them face, for the first time a long running real-world project, proposed by an international company, including all the related technical and management challenges.
The course is offered in the first semester of the 2nd year. The course is taught in English. The course covers the main pillars of executing a successful data science project ranging from the computational and management aspects to the communication one. The goal of this course is to let students face, for the first time, a long running project and learn how to manage all project steps (problem specification, task assignment, design and implementation of the solution, testing, milestones management, writing of intermediate and final reports, result communication). Laboratory activities will expose students with first-hand experience in projects that use massively data science methodologies in collaboration with companies and applied research institutes with an international breadth.
• Knowledge of the agile software development methodologies • Ability to interact with a client, cope with its needs, design a data science project, assess the project performance, and deliver data analytics processes in an incremental fashion.
- Knowledge of first-hand computational tools to address Data Science projects - Knowledge of management strategies and tools - Knowledge of communication tools and skills - Knowledge of the best-practices for value-oriented project setup and execution - Hands on experience with a real data science project offered by companies or research institutes
• Advanced knowledge of data science processes, data mining and machine learning algorithms, and statistical and mathematical models.
Statistics Data mining Machine learning and Deep learning Python language Relational, NOSQL, graph databases
• Agile software development (0.5 cr.) • Scrum methodology (1.0 cfu) • Data science project development (10 cfu) • Retrospective and critical analysis (0.5 cfu)
Lectures (30 hours) - Data Science Introduction - Model and Data-centric projects - Foundation models - Sustainable development goals and data science examples - Data Science Project pillars: - Design - Develop - Manage - Communicate - 10 Practical tips - Introduction to Agile and Scrum methodologies - Collaborative and interactive project design - Collaborative and interactive project development - Google Colab setup and use - Github setup and use - Collaborative project management - Asana setup and use - How to preparing clear slides to present the project - How to write a technical report - Project champions - Project proposals Laboratory activities (90 hours) - Data science project design and development in the lab
None
None
The course includes lectures and practices on the lecture topics, and in particular on data science process design, data preprocessing and data mining algorithms. Students will prepare an individual written report on an individual project assigned during the course. The course includes laboratory sessions on data science process design and data analytics. Laboratory sessions allow experimental activities on the most widespread commercial and open-source products. The course will be structured into two phases: 1. Introductory lectures, in the classroom with the whole class 2. Project work, in labs and classroom, team based • Projects are provided by external industrial companies and involve commitment by students to develop actually working data science processes • The development will be divided into three/four development iterations (sprints) according to the Scrum development methodology. Each spring will terminate with a demonstration to the customer and teachers, followed by a retrospective (with coordinating teachers), critical appraisal and planning for next iterations.
The course includes practices on the lecture topics, laboratory sessions for the execution of the projects, and meetups with the company key resources (project managers and project leaders) to successfully execute the assigned projects.
Copies of the slides used during the lectures will be made available. All teaching material is downloadable from the course website or the teaching portal.
Copies of the slides used during the lectures will be made available. All teaching material is downloadable from the teaching portal. Reference books: - Machine Learning Yearning, by Andrew Ng - Data Science from Scratch, Joel Grus - Scrum Mastery: From Good To Great Servant-Leadership, by Geoff Watts - Agile Project Management for Dummies, by Mark C. Layton
Modalità di esame: Prova orale obbligatoria; Elaborato progettuale in gruppo;
Exam: Compulsory oral exam; Group project;
The exam includes a mandatory oral part (20% of the overall mark), and the evaluation of the team project assigned during the course (80% of the overall mark). Learning objectives assessment The oral exam covers all the theoretical concepts introduced during the course and will assess - Knowledge of the best-practices for value-oriented project setup and execution - Knowledge of the tools for executing a project - Knowledge of project management strategies - Communication skills The team project and the delivered report will assess - Ability to design, implement and evaluate a complete data science project - Ability to use first-hand computational tools to address data science projects - Ability to use management tools - Ability to use communication tools Exam structure and grading criteria The team project consists in managing, designing and implementing a data science solution for solving a data science problem assigned by companies or applied research institutes. The team project is assigned at the beginning of the course and its report must be delivered before the end of the course. The team project is also characterized by intermediate milestones. Its score is valid for the entire academic year. The evaluation of the team project is based on the performance and accuracy of the proposed solution, in terms of standard quality measures (e.g., prediction accuracy) and completeness (i.e., in depth analysis of each phase of the designed process and motivation for selecting given techniques and algorithms). The clearness and completeness of the delivered reports will also be considered. The oral exam covers all the theoretical parts of the course. The score is based on the completeness and clarity of the answers. The maximum grade for the team project is 32. The maximum grade for the oral part is 32. The final grade is given by weighted average of the two parts (0.8*grade team project + 0.2*grade oral part). The exam is passed if the grade of the team project is greater than or equal to 18 and the grade of the oral part is greater than or equal to 18.
Modalità di esame: Prova orale obbligatoria; Elaborato progettuale in gruppo;
Exam: Compulsory oral exam; Group project;
The exam includes a mandatory oral part (20% of the overall mark), and the evaluation of the team project assigned during the course (80% of the overall mark). Learning objectives assessment The oral exam covers all the theoretical concepts introduced during the course and will assess - Knowledge of the best-practices for value-oriented project setup and execution - Knowledge of the tools for executing a project - Knowledge of project management strategies - Communication skills The team project and the delivered report will assess - Ability to design, implement and evaluate a complete data science project - Ability to use first-hand computational tools to address data science projects - Ability to use management tools - Ability to use communication tools Exam structure and grading criteria The team project consists in managing, designing and implementing a data science solution for solving a data science problem assigned by companies or applied research institutes. The team project is assigned at the beginning of the course and its report must be delivered before the end of the course. The team project is also characterized by intermediate milestones. Its score is valid for the entire academic year. The evaluation of the team project is based on the performance and accuracy of the proposed solution, in terms of standard quality measures (e.g., prediction accuracy) and completeness (i.e., in depth analysis of each phase of the designed process and motivation for selecting given techniques and algorithms). The clearness and completeness of the delivered reports will also be considered. The oral exam covers all the theoretical parts of the course. The score is based on the completeness and clarity of the answers. The maximum grade for the team project is 32. The maximum grade for the oral part is 32. The final grade is given by weighted average of the two parts (0.8*grade team project + 0.2*grade oral part). The exam is passed if the grade of the team project is greater than or equal to 18 and the grade of the oral part is greater than or equal to 18.
Modalità di esame: Prova orale obbligatoria; Elaborato progettuale in gruppo;
Exam: Compulsory oral exam; Group project;
The exam includes a mandatory oral part (20% of the overall mark), and the evaluation of the team project assigned during the course (80% of the overall mark). Learning objectives assessment The oral exam covers all the theoretical concepts introduced during the course and will assess - Knowledge of the best-practices for value-oriented project setup and execution - Knowledge of the tools for executing a project - Knowledge of project management strategies - Communication skills The team project and the delivered report will assess - Ability to design, implement and evaluate a complete data science project - Ability to use first-hand computational tools to address data science projects - Ability to use management tools - Ability to use communication tools Exam structure and grading criteria The team project consists in managing, designing and implementing a data science solution for solving a data science problem assigned by companies or applied research institutes. The team project is assigned at the beginning of the course and its report must be delivered before the end of the course. The team project is also characterized by intermediate milestones. Its score is valid for the entire academic year. The evaluation of the team project is based on the performance and accuracy of the proposed solution, in terms of standard quality measures (e.g., prediction accuracy) and completeness (i.e., in depth analysis of each phase of the designed process and motivation for selecting given techniques and algorithms). The clearness and completeness of the delivered reports will also be considered. The oral exam covers all the theoretical parts of the course. The score is based on the completeness and clarity of the answers. The maximum grade for the team project is 32. The maximum grade for the oral part is 32. The final grade is given by weighted average of the two parts (0.8*grade team project + 0.2*grade oral part). The exam is passed if the grade of the team project is greater than or equal to 18 and the grade of the oral part is greater than or equal to 18.
Esporta Word


© Politecnico di Torino
Corso Duca degli Abruzzi, 24 - 10129 Torino, ITALY
Contatti