A. Achille, and S. Soatto. (2017)cite arxiv:1706.01350Comment: Deep learning, neural network, representation, flat minima, information bottleneck, overfitting, generalization, sufficiency, minimality, sensitivity, information complexity, stochastic gradient descent, regularization, total correlation, PAC-Bayes.
M. Al-Radhawi, and D. Angeli. (2015)cite arxiv:1509.02086Comment: some results were published in the IEEE CDC conference, 2014. doi: 10.1109/CDC.2014.7039867.
D. Aldous. (2013)cite arxiv:1306.3039Comment: Published in at http://dx.doi.org/10.1214/12-STS404 the Statistical Science (http://www.imstat.org/sts/) by the Institute of Mathematical Statistics (http://www.imstat.org).
M. Aldridge, O. Johnson, and J. Scarlett. (2019)cite arxiv:1902.06002Comment: Survey paper, 140 pages, 19 figures. To be published in Foundations and Trends in Communications and Information Theory.
D. Alvarez-Melis, and T. Jaakkola. (2018)cite arxiv:1806.08049Comment: presented at 2018 ICML Workshop on Human Interpretability in Machine Learning (WHI 2018), Stockholm, Sweden.
S. Amari, R. Karakida, and M. Oizumi. Proceedings of Machine Learning Research, volume 89 of Proceedings of Machine Learning Research, page 694--702. PMLR, (16--18 Apr 2019)