In this post, I want to show how I use NLTK for preprocessing and tokenization, but then apply machine learning techniques (e.g. building a linear SVM using stochastic gradient descent) using Scikit-Learn.
S. Bloehdorn, и A. Hotho. Proceedings of the MSW 2004 workshop at the 10th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, стр. 70-87. (августа 2004)
B. Lauser, и A. Hotho. Proc. of the 7th European Conference in Research and Advanced Technology for Digital Libraries, ECDL 2003, том 2769 из LNCS, стр. 140-151. Springer, (2003)
S. Bloehdorn, и A. Hotho. Proceedings of the MSW 2004 workshop at the 10th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, стр. 70-87. (августа 2004)
B. Lauser, и A. Hotho. Proc. of the 7th European Conference in Research and Advanced Technology for Digital Libraries, ECDL 2003, том 2769 из LNCS, стр. 140-151. Springer, (2003)
S. Bloehdorn, и A. Hotho. Proceedings of the Fourth IEEE International Conference on Data Mining, стр. 331-334. IEEE Computer Society Press, (ноября 2004)