copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Effective Quantization Approaches for Recurrent Neural Networks

M. Alom, A. Moody, N. Maruyama, B. Van Essen, and T. Taha. (2018)cite arxiv:1802.02615Comment: 8 pages, 23 figures,Submitted to International Joint Conference on Neural Networks (IJCNN) 2018.

Abstract

Deep learning, and in particular Recurrent Neural Networks (RNN) have shown superior accuracy in a large variety of tasks including machine translation, language understanding, and movie frame generation. However, these deep learning approaches are very expensive in terms of computation. In most cases, Graphic Processing Units (GPUs) are in used for large scale implementations. Meanwhile, energy efficient RNN approaches are proposed for deploying solutions on special purpose hardware including Field Programming Gate Arrays (FPGAs) and mobile platforms. In this paper, we propose an effective quantization approach for Recurrent Neural Networks (RNN) techniques including Long Short Term Memory (LSTM), Gated Recurrent Units (GRU), and Convolutional Long Short Term Memory (ConvLSTM). We have implemented different quantization methods including Binary Connect -1, 1, Ternary Connect -1, 0, 1, and Quaternary Connect -1, -0.5, 0.5, 1. These proposed approaches are evaluated on different datasets for sentiment analysis on IMDB and video frame predictions on the moving MNIST dataset. The experimental results are compared against the full precision versions of the LSTM, GRU, and ConvLSTM. They show promising results for both sentiment analysis and video frame prediction.

Description

1802.02615.pdf

Links and resources

BibTeX key: alom2018effective
entry type: misc
year: 2018
url: http://arxiv.org/abs/1802.02615
note: cite arxiv:1802.02615Comment: 8 pages, 23 figures,Submitted to International Joint Conference on Neural Networks (IJCNN) 2018

@jk_itwm's tags highlighted

Cite this publication

@misc{alom2018effective, abstract = {Deep learning, and in particular Recurrent Neural Networks (RNN) have shown superior accuracy in a large variety of tasks including machine translation, language understanding, and movie frame generation. However, these deep learning approaches are very expensive in terms of computation. In most cases, Graphic Processing Units (GPUs) are in used for large scale implementations. Meanwhile, energy efficient RNN approaches are proposed for deploying solutions on special purpose hardware including Field Programming Gate Arrays (FPGAs) and mobile platforms. In this paper, we propose an effective quantization approach for Recurrent Neural Networks (RNN) techniques including Long Short Term Memory (LSTM), Gated Recurrent Units (GRU), and Convolutional Long Short Term Memory (ConvLSTM). We have implemented different quantization methods including Binary Connect {-1, 1}, Ternary Connect {-1, 0, 1}, and Quaternary Connect {-1, -0.5, 0.5, 1}. These proposed approaches are evaluated on different datasets for sentiment analysis on IMDB and video frame predictions on the moving MNIST dataset. The experimental results are compared against the full precision versions of the LSTM, GRU, and ConvLSTM. They show promising results for both sentiment analysis and video frame prediction.}, added-at = {2018-02-10T12:55:43.000+0100}, author = {Alom, Md Zahangir and Moody, Adam T and Maruyama, Naoya and Van Essen, Brian C and Taha, Tarek M.}, biburl = {https://www.bibsonomy.org/bibtex/20e69d5e87066140ba32fac1574dd5062/jk_itwm}, description = {1802.02615.pdf}, interhash = {8f33feaccc7bdad07ac7b2923244900a}, intrahash = {0e69d5e87066140ba32fac1574dd5062}, keywords = {RNN quantization}, note = {cite arxiv:1802.02615Comment: 8 pages, 23 figures,Submitted to International Joint Conference on Neural Networks (IJCNN) 2018}, timestamp = {2018-02-10T12:55:43.000+0100}, title = {Effective Quantization Approaches for Recurrent Neural Networks}, url = {http://arxiv.org/abs/1802.02615}, year = 2018 }

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Effective Quantization Approaches for Recurrent Neural Networks

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Effective Quantization Approaches for Recurrent Neural Networks

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Effective Quantization Approaches for Recurrent Neural Networks

Comments and Reviews
(0)