@dallmann

Wikipedia2Vec: An Optimized Tool for Learning Embeddings of Words and Entities from Wikipedia

, , , , and . (2018)cite arxiv:1812.06280.

Abstract

We present Wikipedia2Vec, an open source tool for learning embeddings of words and entities from Wikipedia. This tool enables users to easily obtain high-quality embeddings of words and entities from a Wikipedia dump with a single command. The learned embeddings can be used as features in downstream natural language processing (NLP) models. The tool can be installed via PyPI. The source code, documentation, and pretrained embeddings for 12 major languages can be obtained at http://wikipedia2vec.github.io.

Description

Wikipedia2Vec: An Optimized Tool for Learning Embeddings of Words and Entities from Wikipedia

Links and resources

Tags

community

  • @dallmann
  • @dblp
@dallmann's tags highlighted