copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

An Empirical Evaluation of doc2vec with Practical Insights into Document Embedding Generation

J. Lau, and T. Baldwin. (2016)cite arxiv:1607.05368Comment: 1st Workshop on Representation Learning for NLP.

Abstract

Recently, Le and Mikolov (2014) proposed doc2vec as an extension to word2vec (Mikolov et al., 2013a) to learn document-level embeddings. Despite promising results in the original paper, others have struggled to reproduce those results. This paper presents a rigorous empirical evaluation of doc2vec over two tasks. We compare doc2vec to two baselines and two state-of-the-art document embedding methodologies. We found that doc2vec performs robustly when using models trained on large external corpora, and can be further improved by using pre-trained word embeddings. We also provide recommendations on hyper-parameter settings for general purpose applications, and release source code to induce document embeddings using our trained doc2vec models.

Description

An Empirical Evaluation of doc2vec with Practical Insights into Document Embedding Generation

Links and resources

BibTeX key: lau2016empirical
entry type: misc
year: 2016
url: http://arxiv.org/abs/1607.05368
note: cite arxiv:1607.05368Comment: 1st Workshop on Representation Learning for NLP

@schwemmlein's tags highlighted

Cite this publication

search on

Meta data

Last update 4 years ago
Created 4 years ago

Comments and Reviews
(0)

There is no review or comment yet. You can write one!

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

An Empirical Evaluation of doc2vec with Practical Insights into Document Embedding Generation

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML An Empirical Evaluation of doc2vec with Practical Insights into Document Embedding Generation

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

An Empirical Evaluation of doc2vec with Practical Insights into Document Embedding Generation

Comments and Reviews
(0)