tomotopy is a Python extension of tomoto (Topic Modeling Tool) which is a Gibbs-sampling based topic model library written in C++. It utilizes a vectorization of modern CPUs for maximizing speed. The current version of tomoto supports several major topic models including
In natural language understanding, there is a hierarchy of lenses through which we can extract meaning - from words to sentences to paragraphs to documents. At the document level, one of the most useful ways to understand text is by analyzing its topics.
C. Wang, и D. Blei. Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, стр. 448--456. New York, NY, USA, ACM, (2011)
Y. Chen, H. Dong, и W. Wang. Proceedings of the 2018 International Conference on Data Science and Information Technology, стр. 138--143. New York, NY, USA, ACM, (2018)