TR2021-125
Towards Universal Adversarial Examples and Defenses
-
- "Towards Universal Adversarial Examples and Defenses", IEEE Information Theory Workshop, DOI: 10.1109/ITW48936.2021.9611439, October 2021.BibTeX TR2021-125 PDF Video
- @inproceedings{Rakin2021oct,
- author = {Rakin, Adnan S and Wang, Ye and Aeron, Shuchin and Koike-Akino, Toshiaki and Moulin, Pierre and Parsons, Kieran},
- title = {Towards Universal Adversarial Examples and Defenses},
- booktitle = {IEEE Information Theory Workshop},
- year = 2021,
- month = oct,
- publisher = {IEEE},
- doi = {10.1109/ITW48936.2021.9611439},
- isbn = {978-1-6654-0312-2},
- url = {https://www.merl.com/publications/TR2021-125}
- }
,
- "Towards Universal Adversarial Examples and Defenses", IEEE Information Theory Workshop, DOI: 10.1109/ITW48936.2021.9611439, October 2021.
-
MERL Contacts:
-
Research Areas:
Artificial Intelligence, Machine Learning, Signal Processing
Abstract:
Adversarial example attacks have recently exposed the severe vulnerability of neural network models. However, most of the existing attacks require some form of target model information (i.e., weights/model inquiry/architecture) to improve the efficacy of the attack. We leverage the information-theoretic connections between robust learning and generalized rate-distortion theory to formulate a universal adversarial example (UAE) generation algorithm. Our algorithm trains an offline adversarial generator to minimize the mutual information of a given data distribution. At the inference phase, our UAE can efficiently generate effective adversary examples without high computation cost.These adversarial examples in turn allow for developing universal defense responses through adversarial training. Our experiments demonstrate promising gains in improving the training efficiency of conventional adversarial training