copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

A Comprehensive guide to Bayesian Convolutional Neural Network with Variational Inference

K. Shridhar, F. Laumann, and M. Liwicki. (2019)cite arxiv:1901.02731Comment: arXiv admin note: text overlap with arXiv:1506.02158, arXiv:1703.04977 by other authors.

Abstract

Artificial Neural Networks are connectionist systems that perform a given task by learning on examples without having prior knowledge about the task. This is done by finding an optimal point estimate for the weights in every node. Generally, the network using point estimates as weights perform well with large datasets, but they fail to express uncertainty in regions with little or no data, leading to overconfident decisions. In this paper, Bayesian Convolutional Neural Network (BayesCNN) using Variational Inference is proposed, that introduces probability distribution over the weights. Furthermore, the proposed BayesCNN architecture is applied to tasks like Image Classification, Image Super-Resolution and Generative Adversarial Networks. The results are compared to point-estimates based architectures on MNIST, CIFAR-10 and CIFAR-100 datasets for Image CLassification task, on BSD300 dataset for Image Super Resolution task and on CIFAR10 dataset again for Generative Adversarial Network task. BayesCNN is based on Bayes by Backprop which derives a variational approximation to the true posterior. We, therefore, introduce the idea of applying two convolutional operations, one for the mean and one for the variance. Our proposed method not only achieves performances equivalent to frequentist inference in identical architectures but also incorporate a measurement for uncertainties and regularisation. It further eliminates the use of dropout in the model. Moreover, we predict how certain the model prediction is based on the epistemic and aleatoric uncertainties and empirically show how the uncertainty can decrease, allowing the decisions made by the network to become more deterministic as the training accuracy increases. Finally, we propose ways to prune the Bayesian architecture and to make it more computational and time effective.

Description

[1901.02731v1] A Comprehensive guide to Bayesian Convolutional Neural Network with Variational Inference

Links and resources

BibTeX key: shridhar2019comprehensive
entry type: misc
year: 2019
url: http://arxiv.org/abs/1901.02731
note: cite arxiv:1901.02731Comment: arXiv admin note: text overlap with arXiv:1506.02158, arXiv:1703.04977 by other authors

@annakrause's tags highlighted

Cite this publication

%0 Generic %1 shridhar2019comprehensive %A Shridhar, Kumar %A Laumann, Felix %A Liwicki, Marcus %D 2019 %K Bayesian CNN DeepLearning %T A Comprehensive guide to Bayesian Convolutional Neural Network with Variational Inference %U http://arxiv.org/abs/1901.02731 %X Artificial Neural Networks are connectionist systems that perform a given task by learning on examples without having prior knowledge about the task. This is done by finding an optimal point estimate for the weights in every node. Generally, the network using point estimates as weights perform well with large datasets, but they fail to express uncertainty in regions with little or no data, leading to overconfident decisions. In this paper, Bayesian Convolutional Neural Network (BayesCNN) using Variational Inference is proposed, that introduces probability distribution over the weights. Furthermore, the proposed BayesCNN architecture is applied to tasks like Image Classification, Image Super-Resolution and Generative Adversarial Networks. The results are compared to point-estimates based architectures on MNIST, CIFAR-10 and CIFAR-100 datasets for Image CLassification task, on BSD300 dataset for Image Super Resolution task and on CIFAR10 dataset again for Generative Adversarial Network task. BayesCNN is based on Bayes by Backprop which derives a variational approximation to the true posterior. We, therefore, introduce the idea of applying two convolutional operations, one for the mean and one for the variance. Our proposed method not only achieves performances equivalent to frequentist inference in identical architectures but also incorporate a measurement for uncertainties and regularisation. It further eliminates the use of dropout in the model. Moreover, we predict how certain the model prediction is based on the epistemic and aleatoric uncertainties and empirically show how the uncertainty can decrease, allowing the decisions made by the network to become more deterministic as the training accuracy increases. Finally, we propose ways to prune the Bayesian architecture and to make it more computational and time effective.

@misc{shridhar2019comprehensive, abstract = {Artificial Neural Networks are connectionist systems that perform a given task by learning on examples without having prior knowledge about the task. This is done by finding an optimal point estimate for the weights in every node. Generally, the network using point estimates as weights perform well with large datasets, but they fail to express uncertainty in regions with little or no data, leading to overconfident decisions. In this paper, Bayesian Convolutional Neural Network (BayesCNN) using Variational Inference is proposed, that introduces probability distribution over the weights. Furthermore, the proposed BayesCNN architecture is applied to tasks like Image Classification, Image Super-Resolution and Generative Adversarial Networks. The results are compared to point-estimates based architectures on MNIST, CIFAR-10 and CIFAR-100 datasets for Image CLassification task, on BSD300 dataset for Image Super Resolution task and on CIFAR10 dataset again for Generative Adversarial Network task. BayesCNN is based on Bayes by Backprop which derives a variational approximation to the true posterior. We, therefore, introduce the idea of applying two convolutional operations, one for the mean and one for the variance. Our proposed method not only achieves performances equivalent to frequentist inference in identical architectures but also incorporate a measurement for uncertainties and regularisation. It further eliminates the use of dropout in the model. Moreover, we predict how certain the model prediction is based on the epistemic and aleatoric uncertainties and empirically show how the uncertainty can decrease, allowing the decisions made by the network to become more deterministic as the training accuracy increases. Finally, we propose ways to prune the Bayesian architecture and to make it more computational and time effective.}, added-at = {2020-10-15T10:34:13.000+0200}, author = {Shridhar, Kumar and Laumann, Felix and Liwicki, Marcus}, biburl = {https://www.bibsonomy.org/bibtex/22cf83fe2da687353193033b7619987f9/annakrause}, description = {[1901.02731v1] A Comprehensive guide to Bayesian Convolutional Neural Network with Variational Inference}, interhash = {f7188f1acc08e8dd28cf3a8ca7622059}, intrahash = {2cf83fe2da687353193033b7619987f9}, keywords = {Bayesian CNN DeepLearning}, note = {cite arxiv:1901.02731Comment: arXiv admin note: text overlap with arXiv:1506.02158, arXiv:1703.04977 by other authors}, timestamp = {2020-10-15T10:34:13.000+0200}, title = {A Comprehensive guide to Bayesian Convolutional Neural Network with Variational Inference}, url = {http://arxiv.org/abs/1901.02731}, year = 2019 }

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

A Comprehensive guide to Bayesian Convolutional Neural Network with Variational Inference

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML A Comprehensive guide to Bayesian Convolutional Neural Network with Variational Inference

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

A Comprehensive guide to Bayesian Convolutional Neural Network with Variational Inference

Comments and Reviews
(0)