Misc,

Squeeze-and-Excitation Networks

J. Hu, L. Shen, S. Albanie, G. Sun, and E. Wu.
(2017)cite arxiv:1709.01507Comment: journal version of the CVPR 2018 paper, accepted by TPAMI.

Abstract

The central building block of convolutional neural networks (CNNs) is the convolution operator, which enables networks to construct informative features by fusing both spatial and channel-wise information within local receptive fields at each layer. A broad range of prior research has investigated the spatial component of this relationship, seeking to strengthen the representational power of a CNN by enhancing the quality of spatial encodings throughout its feature hierarchy. In this work, we focus instead on the channel relationship and propose a novel architectural unit, which we term the "Squeeze-and-Excitation" (SE) block, that adaptively recalibrates channel-wise feature responses by explicitly modelling interdependencies between channels. We show that these blocks can be stacked together to form SENet architectures that generalise extremely effectively across different datasets. We further demonstrate that SE blocks bring significant improvements in performance for existing state-of-the-art CNNs at slight additional computational cost. Squeeze-and-Excitation Networks formed the foundation of our ILSVRC 2017 classification submission which won first place and reduced the top-5 error to 2.251%, surpassing the winning entry of 2016 by a relative improvement of ~25%. Models and code are available at https://github.com/hujie-frank/SENet.

BibTeX key: hu2017squeezeandexcitation
entry type: misc
year: 2017
url: http://arxiv.org/abs/1709.01507
note: cite arxiv:1709.01507Comment: journal version of the CVPR 2018 paper, accepted by TPAMI

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

@misc{hu2017squeezeandexcitation, abstract = {The central building block of convolutional neural networks (CNNs) is the convolution operator, which enables networks to construct informative features by fusing both spatial and channel-wise information within local receptive fields at each layer. A broad range of prior research has investigated the spatial component of this relationship, seeking to strengthen the representational power of a CNN by enhancing the quality of spatial encodings throughout its feature hierarchy. In this work, we focus instead on the channel relationship and propose a novel architectural unit, which we term the "Squeeze-and-Excitation" (SE) block, that adaptively recalibrates channel-wise feature responses by explicitly modelling interdependencies between channels. We show that these blocks can be stacked together to form SENet architectures that generalise extremely effectively across different datasets. We further demonstrate that SE blocks bring significant improvements in performance for existing state-of-the-art CNNs at slight additional computational cost. Squeeze-and-Excitation Networks formed the foundation of our ILSVRC 2017 classification submission which won first place and reduced the top-5 error to 2.251%, surpassing the winning entry of 2016 by a relative improvement of ~25%. Models and code are available at https://github.com/hujie-frank/SENet.}, added-at = {2021-09-02T08:40:17.000+0200}, author = {Hu, Jie and Shen, Li and Albanie, Samuel and Sun, Gang and Wu, Enhua}, biburl = {https://www.bibsonomy.org/bibtex/2ed504063d89646b2910fbef47cc07bf8/aerover}, description = {Squeeze-and-Excitation Networks}, interhash = {a23063c0f6779478f3072610412697e1}, intrahash = {ed504063d89646b2910fbef47cc07bf8}, keywords = {cs.CV}, note = {cite arxiv:1709.01507Comment: journal version of the CVPR 2018 paper, accepted by TPAMI}, timestamp = {2021-09-02T08:40:17.000+0200}, title = {Squeeze-and-Excitation Networks}, url = {http://arxiv.org/abs/1709.01507}, year = 2017 }

BibSonomy

Squeeze-and-Excitation Networks

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on