In this post, I want to show how I use NLTK for preprocessing and tokenization, but then apply machine learning techniques (e.g. building a linear SVM using stochastic gradient descent) using Scikit-Learn.
D. Parks, J. Prochaska, S. Dong, and Z. Cai. (2017)cite arxiv:1709.04962Comment: 20 pages, 19 figures; submitted to MNRAS; comments welcome; code available at https://github.com/davidparks21/qso_lya_detection_pipeline.
J. Tang, H. fung Leung, Q. Luo, D. Chen, and J. Gong. IJCAI'09: Proceedings of the 21st international jont conference on Artifical intelligence, page 2089--2094. San Francisco, CA, USA, Morgan Kaufmann Publishers Inc., (2009)