From CNNs to Shift-Invariant Twin Wavelet Models - EDP Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2022

From CNNs to Shift-Invariant Twin Wavelet Models

Résumé

We propose a novel antialiasing method to increase shift invariance in convolutional neural networks (CNNs). More precisely, we replace the conventional combination "real-valued convolutions + max pooling" ($\mathbb R$Max) by "complex-valued convolutions + modulus" ($\mathbb C$Mod), which produce stable feature representations for band-pass filters with well-defined orientations. In a recent work, we proved that, for such filters, the two operators yield similar outputs. Therefore, $\mathbb C$Mod can be viewed as a stable alternative to $\mathbb R$Max. To separate band-pass filters from other freely-trained kernels, in this paper, we designed a "twin" architecture based on the dual-tree complex wavelet packet transform, which generates similar outputs as standard CNNs with fewer trainable parameters. In addition to improving stability to small shifts, our experiments on AlexNet and ResNet showed increased prediction accuracy on natural image datasets such as ImageNet and CIFAR10. Furthermore, our approach outperformed recent antialiasing methods based on low-pass filtering by preserving high-frequency information, while reducing memory usage.
Fichier principal
Vignette du fichier
main.pdf (7.58 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03880520 , version 1 (01-12-2022)
hal-03880520 , version 2 (21-04-2023)

Identifiants

  • HAL Id : hal-03880520 , version 1

Citer

Hubert Leterme, Kévin Polisano, Valérie Perrier, Karteek Alahari. From CNNs to Shift-Invariant Twin Wavelet Models. 2022. ⟨hal-03880520v1⟩
98 Consultations
45 Téléchargements

Partager

Gmail Facebook X LinkedIn More