I made an introductory talk on word embeddings in the past and this write-up is an extended version of the part about philosophical ideas behind word vectors.
In natural language understanding, there is a hierarchy of lenses through which we can extract meaning - from words to sentences to paragraphs to documents. At the document level, one of the most useful ways to understand text is by analyzing its topics.
M. Kusner, Y. Sun, N. Kolkin, и K. Weinberger. Proceedings of the 32Nd International Conference on International Conference on Machine Learning - Volume 37, стр. 957--966. JMLR.org, (2015)
M. Artetxe, G. Labaka, I. Lopez-Gazpio, и E. Agirre. Proceedings of the 22nd Conference on Computational Natural Language Learning, стр. 282--291. Association for Computational Linguistics, (2018)
L. Hettinger, A. Zehe, A. Dallmann, и A. Hotho. INFORMATIK 2019: 50 Jahre Gesellschaft für Informatik – Informatik für Gesellschaft, стр. 191-204. Bonn, Gesellschaft für Informatik e.V., (2019)
S. Bordia, и S. Bowman. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Student Research Workshop, стр. 7--15. Minneapolis, Minnesota, Association for Computational Linguistics, (июня 2019)