In this post, I want to show how I use NLTK for preprocessing and tokenization, but then apply machine learning techniques (e.g. building a linear SVM using stochastic gradient descent) using Scikit-Learn.
C. Schmitz, S. Staab, R. Studer, G. Stumme, and J. Tane. Proceedings of E-Learn 2002: World Conference on E-Learning in Corporate, Government, Healthcare, & Higher Education, October 15-19 2002, Montreal, Canada, page 909-915. (2002)