Misc,

Detail-Preserving Pooling in Deep Networks

xxx.
(Apr 11, 2018)

Abstract

Most convolutional neural networks use some method for gradually downscaling the size of the hidden layers. This is commonly referred to as pooling, and is applied to reduce the number of parameters, improve invariance to certain distortions, and increase the receptive field size. Since pooling by nature is a lossy process, it is crucial that each such layer maintains the portion of the activations that is most important for the network's discriminability. Yet, simple maximization or averaging over blocks, max or average pooling, or plain downsampling in the form of strided convolutions are the standard. In this paper, we aim to leverage recent results on image downscaling for the purposes of deep learning. Inspired by the human visual system, which focuses on local spatial changes, we propose detail-preserving pooling (DPP), an adaptive pooling method that magnifies spatial changes and preserves important structural detail. Importantly, its parameters can be learned jointly with the rest of the network. We analyze some of its theoretical properties and show its empirical benefits on several datasets and networks, where DPP consistently outperforms previous pooling approaches.

BibTeX key: citeulike:14582396
entry type: misc
year: 2018
month: apr
day: 11
citeulike-article-id: 14582396
citeulike-linkout-1: http://arxiv.org/pdf/1804.04076
priority: 0
posted-at: 2018-05-08 09:27:19
eprint: 1804.04076
citeulike-linkout-0: http://arxiv.org/abs/1804.04076
archiveprefix: arXiv
url: http://arxiv.org/abs/1804.04076

BibSonomy

Detail-Preserving Pooling in Deep Networks

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on