copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Metrics and methods for robustness evaluation of neural networks with generative models

I. Buzhinsky, A. Nerinovsky, and S. Tripakis. (2020)cite arxiv:2003.01993Comment: 24 pages, 9 figures.

Abstract

Recent studies have shown that modern deep neural network classifiers are easy to fool, assuming that an adversary is able to slightly modify their inputs. Many papers have proposed adversarial attacks, defenses and methods to measure robustness to such adversarial perturbations. However, most commonly considered adversarial examples are based on $\ell_p$-bounded perturbations in the input space of the neural network, which are unlikely to arise naturally. Recently, especially in computer vision, researchers discovered "natural" or "semantic" perturbations, such as rotations, changes of brightness, or more high-level changes, but these perturbations have not yet been systematically utilized to measure the performance of classifiers. In this paper, we propose several metrics to measure robustness of classifiers to natural adversarial examples, and methods to evaluate them. These metrics, called latent space performance metrics, are based on the ability of generative models to capture probability distributions, and are defined in their latent spaces. On three image classification case studies, we evaluate the proposed metrics for several classifiers, including ones trained in conventional and robust ways. We find that the latent counterparts of adversarial robustness are associated with the accuracy of the classifier rather than its conventional adversarial robustness, but the latter is still reflected on the properties of found latent perturbations. In addition, our novel method of finding latent adversarial perturbations demonstrates that these perturbations are often perceptually small.

Description

[2003.01993] Metrics and methods for robustness evaluation of neural networks with generative models

Links and resources

BibTeX key: buzhinsky2020metrics
entry type: article
year: 2020
url: http://arxiv.org/abs/2003.01993
note: cite arxiv:2003.01993Comment: 24 pages, 9 figures

@kirk86's tags highlighted

Cite this publication

@article{buzhinsky2020metrics, abstract = {Recent studies have shown that modern deep neural network classifiers are easy to fool, assuming that an adversary is able to slightly modify their inputs. Many papers have proposed adversarial attacks, defenses and methods to measure robustness to such adversarial perturbations. However, most commonly considered adversarial examples are based on $\ell_p$-bounded perturbations in the input space of the neural network, which are unlikely to arise naturally. Recently, especially in computer vision, researchers discovered "natural" or "semantic" perturbations, such as rotations, changes of brightness, or more high-level changes, but these perturbations have not yet been systematically utilized to measure the performance of classifiers. In this paper, we propose several metrics to measure robustness of classifiers to natural adversarial examples, and methods to evaluate them. These metrics, called latent space performance metrics, are based on the ability of generative models to capture probability distributions, and are defined in their latent spaces. On three image classification case studies, we evaluate the proposed metrics for several classifiers, including ones trained in conventional and robust ways. We find that the latent counterparts of adversarial robustness are associated with the accuracy of the classifier rather than its conventional adversarial robustness, but the latter is still reflected on the properties of found latent perturbations. In addition, our novel method of finding latent adversarial perturbations demonstrates that these perturbations are often perceptually small.}, added-at = {2020-03-05T15:04:18.000+0100}, author = {Buzhinsky, Igor and Nerinovsky, Arseny and Tripakis, Stavros}, biburl = {https://www.bibsonomy.org/bibtex/2e140c5ecd8ac9ecae85fdc847c5796b1/kirk86}, description = {[2003.01993] Metrics and methods for robustness evaluation of neural networks with generative models}, interhash = {c18c08bc31e7229c252d411a9bbef054}, intrahash = {e140c5ecd8ac9ecae85fdc847c5796b1}, keywords = {adversarial generative-models robustness}, note = {cite arxiv:2003.01993Comment: 24 pages, 9 figures}, timestamp = {2020-03-05T15:04:30.000+0100}, title = {Metrics and methods for robustness evaluation of neural networks with generative models}, url = {http://arxiv.org/abs/2003.01993}, year = 2020 }

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Metrics and methods for robustness evaluation of neural networks with generative models

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Metrics and methods for robustness evaluation of neural networks with generative models

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Metrics and methods for robustness evaluation of neural networks with generative models

Comments and Reviews
(0)