Misc,

The mathematics of adversarial attacks in AI -- Why deep learning is unstable despite the existence of stable neural networks

A. Bastounis, A. Hansen, and V. Vlačić.
(2021)cite arxiv:2109.06098Comment: 29 pages, 1 figure.

Abstract

The unprecedented success of deep learning (DL) makes it unchallenged when it comes to classification problems. However, it is well established that the current DL methodology produces universally unstable neural networks (NNs). The instability problem has caused an enormous research effort -- with a vast literature on so-called adversarial attacks -- yet there has been no solution to the problem. Our paper addresses why there has been no solution to the problem, as we prove the following mathematical paradox: any training procedure based on training neural networks for classification problems with a fixed architecture will yield neural networks that are either inaccurate or unstable (if accurate) -- despite the provable existence of both accurate and stable neural networks for the same classification problems. The key is that the stable and accurate neural networks must have variable dimensions depending on the input, in particular, variable dimensions is a necessary condition for stability. Our result points towards the paradox that accurate and stable neural networks exist, however, modern algorithms do not compute them. This yields the question: if the existence of neural networks with desirable properties can be proven, can one also find algorithms that compute them? There are cases in mathematics where provable existence implies computability, but will this be the case for neural networks? The contrary is true, as we demonstrate how neural networks can provably exist as approximate minimisers to standard optimisation problems with standard cost functions, however, no randomised algorithm can compute them with probability better than 1/2.

BibTeX key: bastounis2021mathematics
entry type: misc
year: 2021
url: http://arxiv.org/abs/2109.06098
note: cite arxiv:2109.06098Comment: 29 pages, 1 figure

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

@misc{bastounis2021mathematics, abstract = {The unprecedented success of deep learning (DL) makes it unchallenged when it comes to classification problems. However, it is well established that the current DL methodology produces universally unstable neural networks (NNs). The instability problem has caused an enormous research effort -- with a vast literature on so-called adversarial attacks -- yet there has been no solution to the problem. Our paper addresses why there has been no solution to the problem, as we prove the following mathematical paradox: any training procedure based on training neural networks for classification problems with a fixed architecture will yield neural networks that are either inaccurate or unstable (if accurate) -- despite the provable existence of both accurate and stable neural networks for the same classification problems. The key is that the stable and accurate neural networks must have variable dimensions depending on the input, in particular, variable dimensions is a necessary condition for stability. Our result points towards the paradox that accurate and stable neural networks exist, however, modern algorithms do not compute them. This yields the question: if the existence of neural networks with desirable properties can be proven, can one also find algorithms that compute them? There are cases in mathematics where provable existence implies computability, but will this be the case for neural networks? The contrary is true, as we demonstrate how neural networks can provably exist as approximate minimisers to standard optimisation problems with standard cost functions, however, no randomised algorithm can compute them with probability better than 1/2.}, added-at = {2021-09-24T21:58:54.000+0200}, author = {Bastounis, Alexander and Hansen, Anders C and Vlačić, Verner}, biburl = {https://www.bibsonomy.org/bibtex/2c8945d6d71fba255688a22dc9ac71961/stdiff}, description = {The mathematics of adversarial attacks in AI -- Why deep learning is unstable despite the existence of stable neural networks}, interhash = {87a7c4919d29d311147aef0b5c8851bb}, intrahash = {c8945d6d71fba255688a22dc9ac71961}, keywords = {neural-network}, note = {cite arxiv:2109.06098Comment: 29 pages, 1 figure}, timestamp = {2021-09-24T21:58:54.000+0200}, title = {The mathematics of adversarial attacks in AI -- Why deep learning is unstable despite the existence of stable neural networks}, url = {http://arxiv.org/abs/2109.06098}, year = 2021 }

BibSonomy

The mathematics of adversarial attacks in AI -- Why deep learning is unstable despite the existence of stable neural networks

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on