ConceptNet Numberbatch consists of state-of-the-art semantic vectors (also known as word embeddings) that can be used directly as a representation of word meanings or as a starting point for further machine learning.
In natural language understanding, there is a hierarchy of lenses through which we can extract meaning - from words to sentences to paragraphs to documents. At the document level, one of the most useful ways to understand text is by analyzing its topics.
G. Marco Baroni, Georgiana Dinu. 52nd Annual Meeting of the Association for Computational Linguistics, ACL 2014 - Proceedings of the Conference, (2014)
S. Bordia, und S. Bowman. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Student Research Workshop, Seite 7--15. Minneapolis, Minnesota, Association for Computational Linguistics, (Juni 2019)
M. Artetxe, G. Labaka, I. Lopez-Gazpio, und E. Agirre. Proceedings of the 22nd Conference on Computational Natural Language Learning, Seite 282--291. Association for Computational Linguistics, (2018)
D. Tang, F. Wei, N. Yang, M. Zhou, T. Liu, und B. Qin. Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Seite 1555--1565. Baltimore, Maryland, Association for Computational Linguistics, (Juni 2014)