Artikel,

BlockSwap: Fisher-guided Block Substitution for Network Compression

J. Turner, E. Crowley, G. Gray, A. Storkey, und M. O'Boyle.
(2019)cite arxiv:1906.04113.

Zusammenfassung

The desire to run neural networks on low-capacity edge devices has led to the development of a wealth of compression techniques. Moonshine is a simple and powerful example of this: one takes a large pre-trained network and substitutes each of its convolutional blocks with a selected cheap alternative block, then distills the resultant network with the original. However, not all blocks are created equally; for a required parameter budget there may exist a potent combination of many different cheap blocks. In this work, we find these by developing BlockSwap: an algorithm for choosing networks with interleaved block types by passing a single minibatch of training data through randomly initialised networks and gauging their Fisher potential. We show that block-wise cheapening yields more accurate networks than single block-type networks across a spectrum of parameter budgets. Code is available at https://github.com/BayesWatch/pytorch-blockswap.

BibTeX-Schlüssel: turner2019blockswap
Eintragstyp: article
Jahr: 2019
URL: http://arxiv.org/abs/1906.04113
Hinweis: cite arxiv:1906.04113

BibSonomy

BlockSwap: Fisher-guided Block Substitution for Network Compression

Zusammenfassung

Tags

Nutzer

Kommentare und Rezensionenanzeigen / verbergen

Zitieren Sie diese Publikation

Mehr Zitationsstile

Suchen auf