копировать удалить добавить публикацию в буфер
Запись сообщества
посмотреть историю данной записи
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

On the Effectiveness of Low Frequency Perturbations

Y. Sharma, G. Ding, и M. Brubaker. (2019)cite arxiv:1903.00073Comment: IJCAI 2019.

Аннотация

Carefully crafted, often imperceptible, adversarial perturbations have been shown to cause state-of-the-art models to yield extremely inaccurate outputs, rendering them unsuitable for safety-critical application domains. In addition, recent work has shown that constraining the attack space to a low frequency regime is particularly effective. Yet, it remains unclear whether this is due to generally constraining the attack search space or specifically removing high frequency components from consideration. By systematically controlling the frequency components of the perturbation, evaluating against the top-placing defense submissions in the NeurIPS 2017 competition, we empirically show that performance improvements in both the white-box and black-box transfer settings are yielded only when low frequency components are preserved. In fact, the defended models based on adversarial training are roughly as vulnerable to low frequency perturbations as undefended models, suggesting that the purported robustness of state-of-the-art ImageNet defenses is reliant upon adversarial perturbations being high frequency in nature. We do find that under $\ell_ınfty$ $\epsilon=16/255$, the competition distortion bound, low frequency perturbations are indeed perceptible. This questions the use of the $\ell_ınfty$-norm, in particular, as a distortion metric, and, in turn, suggests that explicitly considering the frequency space is promising for learning robust models which better align with human perception.

Описание

[1903.00073] On the Effectiveness of Low Frequency Perturbations

Линки и ресурсы

ключ BibTeX: sharma2019effectiveness
тип записи: article
год: 2019
url: http://arxiv.org/abs/1903.00073
Примечание: cite arxiv:1903.00073Comment: IJCAI 2019

тэги

@kirk86- тэги данного пользователя выделены

adversarial

Цитировать эту публикацию

@article{sharma2019effectiveness, abstract = {Carefully crafted, often imperceptible, adversarial perturbations have been shown to cause state-of-the-art models to yield extremely inaccurate outputs, rendering them unsuitable for safety-critical application domains. In addition, recent work has shown that constraining the attack space to a low frequency regime is particularly effective. Yet, it remains unclear whether this is due to generally constraining the attack search space or specifically removing high frequency components from consideration. By systematically controlling the frequency components of the perturbation, evaluating against the top-placing defense submissions in the NeurIPS 2017 competition, we empirically show that performance improvements in both the white-box and black-box transfer settings are yielded only when low frequency components are preserved. In fact, the defended models based on adversarial training are roughly as vulnerable to low frequency perturbations as undefended models, suggesting that the purported robustness of state-of-the-art ImageNet defenses is reliant upon adversarial perturbations being high frequency in nature. We do find that under $\ell_\infty$ $\epsilon=16/255$, the competition distortion bound, low frequency perturbations are indeed perceptible. This questions the use of the $\ell_\infty$-norm, in particular, as a distortion metric, and, in turn, suggests that explicitly considering the frequency space is promising for learning robust models which better align with human perception.}, added-at = {2019-08-13T22:28:49.000+0200}, author = {Sharma, Yash and Ding, Gavin Weiguang and Brubaker, Marcus}, biburl = {https://www.bibsonomy.org/bibtex/2d21080b0c7cfeae698997591d371feb6/kirk86}, description = {[1903.00073] On the Effectiveness of Low Frequency Perturbations}, interhash = {5e0a1a5e5db75e4f937862e489c353cf}, intrahash = {d21080b0c7cfeae698997591d371feb6}, keywords = {adversarial}, note = {cite arxiv:1903.00073Comment: IJCAI 2019}, timestamp = {2019-08-13T22:28:49.000+0200}, title = {On the Effectiveness of Low Frequency Perturbations}, url = {http://arxiv.org/abs/1903.00073}, year = 2019 }

искать в

Метаданные

Последнее изменение 5 лет назад
Создан 5 лет назад

Комментарии и рецензии
(0)

Комментарии, или рецензии отсутствуют. Вы можете их написать!