Telecommunication Networks Group
Automatic crawler for phishing sites
Parole chiave CYBERSECURITY, MACHINE LEARNING, ARTIFICIAL NEURAL NETWORKS, PHISHING
Riferimenti MARCO MELLIA
Riferimenti esterni Idilio Drago
Gruppi di ricerca SmartData@PoliTO, Telecommunication Networks Group
Tipo tesi EXPERIMENTAL
Descrizione The thesis focuses on the design, engineering and testing of an automatic crawler that identifies possible phishing sites. Given a URL, the crawler will be able to quantify the probability that that page is i) phishing, ii) parking, iii) legitimate, iv) corrupted or other possible classes.
The system will be based on functionalities extracted from the page (number of links, number of images, etc.) integrated with external information (TLS certificate, hosting site, website country, use of content management systems, etc.).
The student will use some already available data, like https://phishtank.org, or solutions like https://conferences.sigcomm.org/imc/2022/program/#p54 to design and implement a system that automatically visits a page and returns any classification of that page.
Conoscenze richieste - Interest in cyber-security
- Interest in machine learning and AI algorithms
- Good programming skill (Python)
- Good knowledge on machine learning classifiers
Scadenza validita proposta 13/12/2023 PROPONI LA TUA CANDIDATURA