Detecting the risk discrimination in classifiers with imbalance measures
keywords DATA QUALITY, DATA SCIENCE, OPEN DATA, OPEN GOVERNMENT DATA, SOFTWARE ENGINEERING
Reference persons MARCO TORCHIANO, ANTONIO VETRO'
Research Groups DAUIN - GR-16 - SOFTWARE ENGINEERING GROUP - SOFTENG, DAUIN - GR-22 - Nexa Center for Internet & Society - NEXA
Thesis type RESEARCH / EXPERIMENTAL, RESEARCH, INNOVATIVE
Description The diffusion of Open Government Data (OGD) in recent years kept a very fast pace, however quality did not keep the pace of quantity.
The ISO standards on data quality provides a useful framework to model and measure data quality, and a prototypal tool is available for applying some of the measures.
The goal of the thesis is to harness the availability of the tool and to provide a massive analysis of the quality of Italian open data in the portal https://dati.gov.it/
Enhancements of the tool's features will be also part of the thesis work.
Required skills Good programming skills and basic knowledge of common data analytics tools and techniques. Grade point average equal to or higher than 26 can be a criterion for selection of candidate.
Notes When sending your application, we kindly ask you to attach the following information:
- list of exams taken in you master degree, with grades and grade point average
- a résumé or equivalent (e.g., linkedin profile), if you already have one
- by when you aim to graduate and an estimate of the time you can devote to the thesis in a typical week
Deadline 30/11/2023 PROPONI LA TUA CANDIDATURA