PORTALE DELLA DIDATTICA

Ricerca CERCA
  KEYWORD

Image Processing Lab (IPL)

Improved convolutional layers in deep learning via learned strides

keywords DEEP LEARNING, DEEP LEARNING, VIDEO ANALYSIS, DEEP NEURAL NETWORKS, MACHINE LEARNING, MACHINE LEARNING, ARTIFICIAL NEURAL NETWORKS

Reference persons ENRICO MAGLI

Research Groups CCNE - COMMUNICATIONS AND COMPUTER NETWORKS ENGINEERING, ICT4SS - ICT FOR SMART SOCIETIES, Image Processing Lab (IPL)

Thesis type RESEARCH

Description A recent paper [R1] has shown that convolutional layers in deep neural networks can be improved by learning the stride parameter, instead of using a fixed stride. This is implemented as a cropping window having learnable size in the Fourier domain. This method simplifies the design of a deep architecture, and achieves performance gains in several tasks.

The objective of this thesis is to develop this concept even further, employing signal processing methods to optimize the cropping stage, and adopting signal-adaptive windows. The new methods developed during the thesis will be tested on classification problems, as well as other problems to be defined.

[R1] R. Riad, O. Teboul, D. Grangier, N. Zeghidour, "Learning strides in convolutional neural networks", Proc. of ICLR 2022, winner of best paper award.

Required skills Candidate students should have some background on neural networks. Some experience of TensorFlow environment and Python programming are desirable, along with good programming skills.


Deadline 11/05/2023      PROPONI LA TUA CANDIDATURA




© Politecnico di Torino
Corso Duca degli Abruzzi, 24 - 10129 Torino, ITALY
Contatti