copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Integrating and Evaluating Neural Word Embeddings in Information Retrieval

G. Zuccon, B. Koopman, P. Bruza, and L. Azzopardi. Proceedings of the 20th Australasian Document Computing Symposium, page 12:1--12:8. New York, NY, USA, ACM, (2015)
DOI: 10.1145/2838931.2838936

Abstract

Recent advances in neural language models have contributed new methods for learning distributed vector representations of words (also called word embeddings). Two such methods are the continuous bag-of-words model and the skipgram model. These methods have been shown to produce embeddings that capture higher order relationships between words that are highly effective in natural language processing tasks involving the use of word similarity and word analogy. Despite these promising results, there has been little analysis of the use of these word embeddings for retrieval. Motivated by these observations, in this paper, we set out to determine how these word embeddings can be used within a retrieval model and what the benefit might be. To this aim, we use neural word embeddings within the well known translation language model for information retrieval. This language model captures implicit semantic relations between the words in queries and those in relevant documents, thus producing more accurate estimations of document relevance. The word embeddings used to estimate neural language models produce translations that differ from previous translation language model approaches; differences that deliver improvements in retrieval effectiveness. The models are robust to choices made in building word embeddings and, even more so, our results show that embeddings do not even need to be produced from the same corpus being used for retrieval.

Description

Integrating and Evaluating Neural Word Embeddings in Information Retrieval

Links and resources

BibTeX key: Zuccon:2015:IEN:2838931.2838936
entry type: inproceedings
address: New York, NY, USA
booktitle: Proceedings of the 20th Australasian Document Computing Symposium
year: 2015
pages: 12:1--12:8
publisher: ACM
series: ADCS '15
acmid: 2838936
isbn: 978-1-4503-4040-3
location: Parramatta, NSW, Australia
numpages: 8
articleno: 12
DOI: 10.1145/2838931.2838936
url: http://doi.acm.org/10.1145/2838931.2838936

@albinzehe's tags highlighted

Cite this publication

@inproceedings{Zuccon:2015:IEN:2838931.2838936, abstract = {Recent advances in neural language models have contributed new methods for learning distributed vector representations of words (also called word embeddings). Two such methods are the continuous bag-of-words model and the skipgram model. These methods have been shown to produce embeddings that capture higher order relationships between words that are highly effective in natural language processing tasks involving the use of word similarity and word analogy. Despite these promising results, there has been little analysis of the use of these word embeddings for retrieval. Motivated by these observations, in this paper, we set out to determine how these word embeddings can be used within a retrieval model and what the benefit might be. To this aim, we use neural word embeddings within the well known translation language model for information retrieval. This language model captures implicit semantic relations between the words in queries and those in relevant documents, thus producing more accurate estimations of document relevance. The word embeddings used to estimate neural language models produce translations that differ from previous translation language model approaches; differences that deliver improvements in retrieval effectiveness. The models are robust to choices made in building word embeddings and, even more so, our results show that embeddings do not even need to be produced from the same corpus being used for retrieval.}, acmid = {2838936}, added-at = {2016-12-18T14:48:32.000+0100}, address = {New York, NY, USA}, articleno = {12}, author = {Zuccon, Guido and Koopman, Bevan and Bruza, Peter and Azzopardi, Leif}, biburl = {https://www.bibsonomy.org/bibtex/2865dbc5a63c075f4f11475a2234db37a/albinzehe}, booktitle = {Proceedings of the 20th Australasian Document Computing Symposium}, description = {Integrating and Evaluating Neural Word Embeddings in Information Retrieval}, doi = {10.1145/2838931.2838936}, interhash = {5c80d608bd4aa140058a64372b3c5deb}, intrahash = {865dbc5a63c075f4f11475a2234db37a}, isbn = {978-1-4503-4040-3}, keywords = {doc2vec ma-zehe paragraphvectors}, location = {Parramatta, NSW, Australia}, numpages = {8}, pages = {12:1--12:8}, publisher = {ACM}, series = {ADCS '15}, timestamp = {2016-12-18T14:48:32.000+0100}, title = {Integrating and Evaluating Neural Word Embeddings in Information Retrieval}, url = {http://doi.acm.org/10.1145/2838931.2838936}, year = 2015 }

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Integrating and Evaluating Neural Word Embeddings in Information Retrieval

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Integrating and Evaluating Neural Word Embeddings in Information Retrieval

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Integrating and Evaluating Neural Word Embeddings in Information Retrieval

Comments and Reviews
(0)