In this 3-part blog series we present a unifying perspective on pre-trained word embeddings under a general framework of matrix factorization. The most popular word embedding model, Word2vec, has…
Extracting topics is a good unsupervised data-mining technique to discover the underlying relationships between texts. There are many different approaches with the most popular probably being LDA but…
C. Ding, T. Li, D. Luo, и W. Peng. SIGIR '08: Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval, стр. 831--832. New York, NY, USA, ACM, (2008)
E. Gaussier, и C. Goutte. Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, стр. 601--602. New York, NY, USA, ACM, (2005)