A PAC-Bayes Analysis of Adversarial Robustness

Guillaume Vidot; Paul Viallard; Amaury Habrard; Emilie Morvant

Pré-Publication, Document De Travail Année : 2021

A PAC-Bayes Analysis of Adversarial Robustness

(1, 2) , (3) , (3) , (3)

1
2
3

Guillaume Vidot

Fonction : Auteur
PersonId : 743959
IdHAL : guillaume-vidot
ORCID : 0000-0002-4367-457X

Airbus Operation S.A.S.

Advancing Rigorous Software and System Engineering

Paul Viallard

Fonction : Auteur
PersonId : 743893
IdHAL : paul-viallard
ORCID : 0000-0003-4836-0809

Laboratoire Hubert Curien

Amaury Habrard

Fonction : Auteur
PersonId : 439
IdHAL : amaury-habrard
ORCID : 0000-0003-3038-9347
IdRef : 084103655

Laboratoire Hubert Curien

Emilie Morvant

Fonction : Auteur
PersonId : 410
IdHAL : emilie-morvant
ORCID : 0000-0002-8301-7240
IdRef : 179027468

Laboratoire Hubert Curien

Résumé

We propose the first general PAC-Bayesian generalization bounds for adversarial robustness, that estimate, at test time, how much a model will be invariant to imperceptible perturbations in the input. Instead of deriving a worst-case analysis of the risk of a hypothesis over all the possible perturbations, we leverage the PAC-Bayesian framework to bound the averaged risk on the perturbations for majority votes (over the whole class of hypotheses). Our theoretically founded analysis has the advantage to provide general bounds (i) independent from the type of perturbations (i.e., the adversarial attacks), (ii) that are tight thanks to the PAC-Bayesian framework, (iii) that can be directly minimized during the learning phase to obtain a robust model on different attacks at test time.

Domaines

Machine Learning [stat.ML] Intelligence artificielle [cs.AI]

Fichier principal

arxiv.pdf (302.74 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Guillaume VIDOT : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03145332

Soumis le : jeudi 18 février 2021-12:00:17

Dernière modification le : lundi 20 novembre 2023-11:44:23

Archivage à long terme le : mercredi 19 mai 2021-18:56:59

Dates et versions

hal-03145332 , version 1 (18-02-2021)

hal-03145332 , version 2 (26-10-2021)

Identifiants

HAL Id : hal-03145332 , version 1

Citer

Guillaume Vidot, Paul Viallard, Amaury Habrard, Emilie Morvant. A PAC-Bayes Analysis of Adversarial Robustness. 2021. ⟨hal-03145332v1⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

SMS

331 Consultations

192 Téléchargements

A PAC-Bayes Analysis of Adversarial Robustness

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager