A Minimal Model for Classification of Rotated Objects with Prediction of the Angle of Rotation - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue Journal of Visual Communication and Image Representation Année : 2021

A Minimal Model for Classification of Rotated Objects with Prediction of the Angle of Rotation

(1) , (2) , (1)
1
2

Résumé

In classification tasks, the robustness against various image transformations remains a crucial property of the Convolutional Neural Networks (CNNs). It can be acquired using the data augmentation. It comes, however, at the price of the risk of overfitting and a considerable increase in training time. Consequently, other ways to endow CNN with invariance to various transformations-and mainly to the rotations-is an intensive field of study. This paper presents a new reduced rotation invariant classification model composed of two parts: a feature representation mapping and a classifier. We provide an insight into the principle and we prove that the proposed model is trainable. This model is smaller in terms of trainable parameters than similar approaches, and has angular prediction capabilities. We illustrate the results on the MNIST and CIFAR-10 datasets. On MNIST, we i) achieve the state of the art of classification on MNIST-rot (with training on MNIST-rot), and ii) improve the results of classification on MNIST-rot (with training on upright MNIST). When trained on CIFAR-10 with upright samples and tested with rotated samples we improve by 20% the state of the art classification results. In all cases, we can predict the rotation angle.
Fichier principal
Vignette du fichier
JVCIR_2021.pdf (753.54 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03118567 , version 1 (22-01-2021)
hal-03118567 , version 2 (18-02-2021)

Identifiants

Citer

Rosemberg Rodriguez Salas, Petr Dokládal, Eva Dokladalova. A Minimal Model for Classification of Rotated Objects with Prediction of the Angle of Rotation. Journal of Visual Communication and Image Representation, 2021, 75, pp.103054. ⟨10.1016/j.jvcir.2021.103054⟩. ⟨hal-03118567v2⟩
213 Consultations
618 Téléchargements

Altmetric

Partager

Gmail Facebook Twitter LinkedIn More