In this 3-part blog series we present a unifying perspective on pre-trained word embeddings under a general framework of matrix factorization. The most popular word embedding model, Word2vec, has…
Extracting topics is a good unsupervised data-mining technique to discover the underlying relationships between texts. There are many different approaches with the most popular probably being LDA but…
C. Ding, T. Li, D. Luo, und W. Peng. SIGIR '08: Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval, Seite 831--832. New York, NY, USA, ACM, (2008)
E. Gaussier, und C. Goutte. Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, Seite 601--602. New York, NY, USA, ACM, (2005)