A. Achille, and S. Soatto. (2017)cite arxiv:1706.01350Comment: Deep learning, neural network, representation, flat minima, information bottleneck, overfitting, generalization, sufficiency, minimality, sensitivity, information complexity, stochastic gradient descent, regularization, total correlation, PAC-Bayes.
M. Al-Radhawi, and D. Angeli. (2015)cite arxiv:1509.02086Comment: some results were published in the IEEE CDC conference, 2014. doi: 10.1109/CDC.2014.7039867.
D. Aldous. (2013)cite arxiv:1306.3039Comment: Published in at http://dx.doi.org/10.1214/12-STS404 the Statistical Science (http://www.imstat.org/sts/) by the Institute of Mathematical Statistics (http://www.imstat.org).
M. Aldridge, O. Johnson, and J. Scarlett. (2019)cite arxiv:1902.06002Comment: Survey paper, 140 pages, 19 figures. To be published in Foundations and Trends in Communications and Information Theory.