Author of the publication

Sharp Minima Can Generalize For Deep Nets

, , , and . (2017)cite arxiv:1703.04933Comment: 8.5 pages of main content, 2.5 of bibliography and 1 page of appendix.

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

No persons found for author name Bengio, Yoshua
add a person with the name Bengio, Yoshua
 

Other publications of authors with the same name

Noisy Activation Functions, , , and . (2016)cite arxiv:1603.00391v3.pdf.Unitary Evolution Recurrent Neural Networks., , and . CoRR, (2015)Multiresolution Recurrent Neural Networks: An Application to Dialogue Response Generation., , , , , , and . (2016)cite arxiv:1606.00776Comment: 21 pages, 2 figures, 10 tables.Building End-To-End Dialogue Systems Using Generative Hierarchical Neural Network Models, , , , and . (2015)cite arxiv:1507.04808Comment: 8 pages with references; Published in AAAI 2016 (Special Track on Cognitive Systems).Batch Normalized Recurrent Neural Networks., , , , and . CoRR, (2015)Guest Introduction: Special Issue on New Methods for Model Selection and Model Combination., and . Mach. Learn., 48 (1-3): 5-7 (2002)On the Spectral Bias of Neural Networks, , , , , , , and . (2018)cite arxiv:1806.08734Comment: 23 pages.A neural probabilistic language model, , , and . Journal of machine learning research, 3 (Feb): 1137--1155 (2003)Graph Attention Networks., , , , , and . ICLR (Poster), OpenReview.net, (2018)DEUP: Direct Epistemic Uncertainty Prediction., , , , , , , and . Trans. Mach. Learn. Res., (2023)