@dblp

Exponential convergence rates for Batch Normalization: The power of length-direction decoupling in non-convex optimization.

, , , , , and . AISTATS, volume 89 of Proceedings of Machine Learning Research, page 806-815. PMLR, (2019)

Links and resources

Tags