In this post, I want to show how I use NLTK for preprocessing and tokenization, but then apply machine learning techniques (e.g. building a linear SVM using stochastic gradient descent) using Scikit-Learn.
C. Henning, und R. Ewerth. Proceedings of the 2017 ACM on International Conference on Multimedia Retrieval, Seite 14--22. New York, NY, USA, ACM, (2017)
Z. Yang, D. Yang, C. Dyer, X. He, A. Smola, und E. Hovy. Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Seite 1480--1489. San Diego, California, Association for Computational Linguistics, (Juni 2016)
X. Zhang, und Y. LeCun. (2015)cite arxiv:1502.01710Comment: This technical report is superseded by a paper entitled "Character-level Convolutional Networks for Text Classification", arXiv:1509.01626. It has considerably more experimental results and a rewritten introduction.
Y. Kim. Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, EMNLP 2014, October 25-29, 2014, Doha, Qatar, A meeting of SIGDAT, a Special Interest Group of the ACL, Seite 1746--1751. (2014)
S. Dori-Hacohen, und J. Allan. Proceedings of the 22nd ACM international conference on Conference on information &\#38; knowledge management, Seite 1845--1848. New York, NY, USA, ACM, (2013)
E. Loza Mencía, und J. Fürnkranz. Semantic Processing of Legal Texts, Volume 6036 von Lecture Notes in Computer Science, Springer Berlin Heidelberg, (2010)
X. Li, B. Liu, und S. Ng. Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, Seite 218--228. Stroudsburg, PA, USA, Association for Computational Linguistics, (2010)
G. Forman, M. Scholz, und S. Rajaram. KDD '09: Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining, Seite 299--308. New York, NY, USA, ACM, (2009)
S. Feldman, M. Marin, M. Ostendorf, und M. Gupta. Proceedings of the 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, Seite 4781--4784. Washington, DC, USA, IEEE Computer Society, (2009)
X. Phan, L. Nguyen, und S. Horiguchi. WWW '08: Proceeding of the 17th international conference on World Wide Web, Seite 91--100. New York, NY, USA, ACM, (2008)
E. Loza Mencía, und J. Fürnkranz. Machine Learning and Knowledge Discovery in Databases, Volume 5212 von Lecture Notes in Computer Science, Springer Berlin Heidelberg, (2008)
L. Hirsch, R. Hirsch, und M. Saeedi. GECCO '07: Proceedings of the 9th annual conference on
Genetic and evolutionary computation, 2, Seite 1604--1611. London, ACM Press, (7-11 July 2007)