R. Pascanu, T. Mikolov, and Y. Bengio. (2012)cite arxiv:1211.5063Comment: Improved description of the exploding gradient problem and description and analysis of the vanishing gradient problem.
X. Glorot, and Y. Bengio. In Proceedings of the International Conference on Artificial Intelligence and Statistics (AISTATS’10). Society for Artificial Intelligence and Statistics, (2010)