Beliebiger Eintrag,

On the Expressive Power of Overlapping Architectures of Deep Learning

O. Sharir, und A. Shashua.
(2017)cite arxiv:1703.02065Comment: Published as a conference paper at ICLR 2018.

Zusammenfassung

Expressive efficiency refers to the relation between two architectures A and B, whereby any function realized by B could be replicated by A, but there exists functions realized by A, which cannot be replicated by B unless its size grows significantly larger. For example, it is known that deep networks are exponentially efficient with respect to shallow networks, in the sense that a shallow network must grow exponentially large in order to approximate the functions represented by a deep network of polynomial size. In this work, we extend the study of expressive efficiency to the attribute of network connectivity and in particular to the effect of överlaps" in the convolutional process, i.e., when the stride of the convolution is smaller than its filter size (receptive field). To theoretically analyze this aspect of network's design, we focus on a well-established surrogate for ConvNets called Convolutional Arithmetic Circuits (ConvACs), and then demonstrate empirically that our results hold for standard ConvNets as well. Specifically, our analysis shows that having overlapping local receptive fields, and more broadly denser connectivity, results in an exponential increase in the expressive capacity of neural networks. Moreover, while denser connectivity can increase the expressive capacity, we show that the most common types of modern architectures already exhibit exponential increase in expressivity, without relying on fully-connected layers.

BibTeX-Schlüssel: sharir2017expressive
Eintragstyp: misc
Jahr: 2017
URL: http://arxiv.org/abs/1703.02065
Hinweis: cite arxiv:1703.02065Comment: Published as a conference paper at ICLR 2018

Nutzer

Kommentare und Rezensionenanzeigen / verbergen

Bitte melden Sie sich an um selbst Rezensionen oder Kommentare zu erstellen.

Zitieren Sie diese Publikation

@misc{sharir2017expressive, abstract = {Expressive efficiency refers to the relation between two architectures A and B, whereby any function realized by B could be replicated by A, but there exists functions realized by A, which cannot be replicated by B unless its size grows significantly larger. For example, it is known that deep networks are exponentially efficient with respect to shallow networks, in the sense that a shallow network must grow exponentially large in order to approximate the functions represented by a deep network of polynomial size. In this work, we extend the study of expressive efficiency to the attribute of network connectivity and in particular to the effect of "overlaps" in the convolutional process, i.e., when the stride of the convolution is smaller than its filter size (receptive field). To theoretically analyze this aspect of network's design, we focus on a well-established surrogate for ConvNets called Convolutional Arithmetic Circuits (ConvACs), and then demonstrate empirically that our results hold for standard ConvNets as well. Specifically, our analysis shows that having overlapping local receptive fields, and more broadly denser connectivity, results in an exponential increase in the expressive capacity of neural networks. Moreover, while denser connectivity can increase the expressive capacity, we show that the most common types of modern architectures already exhibit exponential increase in expressivity, without relying on fully-connected layers.}, added-at = {2018-02-27T08:09:04.000+0100}, author = {Sharir, Or and Shashua, Amnon}, biburl = {https://www.bibsonomy.org/bibtex/2f6844dd598fac94eaa65dc07671c410d/jk_itwm}, description = {On the Expressive Power of Overlapping Architectures of Deep Learning}, interhash = {a7d49930ed815121ad8f9e76dc9e6154}, intrahash = {f6844dd598fac94eaa65dc07671c410d}, keywords = {seminar}, note = {cite arxiv:1703.02065Comment: Published as a conference paper at ICLR 2018}, timestamp = {2018-02-27T08:09:04.000+0100}, title = {On the Expressive Power of Overlapping Architectures of Deep Learning}, url = {http://arxiv.org/abs/1703.02065}, year = 2017 }

BibSonomy

On the Expressive Power of Overlapping Architectures of Deep Learning

Zusammenfassung

Tags

Nutzer

Kommentare und Rezensionenanzeigen / verbergen

Zitieren Sie diese Publikation

Mehr Zitationsstile

Suchen auf