J. Werfel, X. Xie, and H. Seung. In, MIT Press, (2003)Discussion of learning curves for stochastic gradient descent.
Besides gradient based approaches, the paper shortly describes (with additional references) weight perturbation and node perturbation approaches..
W. Wang, M. Jafari, S. Sanei, and J. Chambers. Signal Processing and Its Applications, 2003. Proceedings. Seventh International Symposium on, (July 2003)