ghagerer > word2vec

закладки (спрятать)4
показать
всё
только закладки
закладки на страницу
5
10
20
50
100
RSS
BibTeX
XML

1What meaning does the length of a Word2vec vector have? - Stack Overflow
When a word appears in different contexts, its vector gets moved in different directions during updates. The final vector then represents some sort of weighted average over the various contexts. Averaging over vectors that point in different directions typically results in a vector that gets shorter with increasing number of different contexts in which the word appears. For words to be used in many different contexts, they must carry little meaning. Prime examples of such insignificant words are high-frequency stop words, which are indeed represented by short vectors despite their high term frequencies ...
4 лет назад , @ghagerer
word-vector-length
word-vectors
word2vec
word-vector-lengthword-vectorsword2vec
(0)
копироватьудалить
- Запись сообщества
- посмотреть историю записи
1Should I normalize word2vec's word vectors before using them? - Cross Validated
When the downstream applications only care about the direction of the word vectors (e.g. they only pay attention to the cosine similarity of two words), then normalize, and forget about length. However, if the downstream applications are able to (or need to) consider more sensible aspects, such as word significance, or consistency in word usage (see below), then normalization might not be such a good idea.
4 лет назад , @ghagerer
normalization
word-vectors
word2vec
normalizationword-vectorsword2vec
(0)
копироватьудалить
- Запись сообщества
- посмотреть историю записи
1AI Trained on Old Scientific Papers Makes Discoveries Humans Missed - VICE
Scientists used machine learning to reveal new scientific knowledge hidden in old research papers.
5 лет назад , @ghagerer
scientific-papers
vice
word-vectors
word2vec
scientific-papersviceword-vectorsword2vec
(0)
копироватьудалить
- Запись сообщества
- посмотреть историю записи
1FastText and Gensim word embeddings | RARE Technologies
Facebook Research open sourced a great project recently – fastText, a fast (no surprise) and effective method to learn word representations and perform text classification. I was curious about comparing these embeddings to other commonly used embeddings, so word2vec seemed like the obvious choice, especially considering fastText embeddings are an extension of word2vec.
5 лет назад , @ghagerer
comparison
fasttext
word2vec
comparisonfasttextword2vec
(0)
копироватьудалить
- Запись сообщества
- посмотреть историю записи

&lang;&lang;
⟨
1
&rang;
⟩⟩

публикации (спрятать)5
показать
всё
только публикации
публикации на страницу
5
10
20
50
100
расширенный...
RSS
BibTeX
RDF
дальше...

28Efficient Estimation of Word Representations in Vector Space
T. Mikolov, K. Chen, G. Corrado, и J. Dean. (2013)cite arxiv:1301.3781.
3 лет назад , @ghagerer
cbow
mikolov
skip-gram
word-vectors
word2vec
cbowmikolovskip-gramword-vectorsword2vec
(0)
копироватьудалитьдобавить публикацию в буфер
2Top2Vec: Distributed Representations of Topics
D. Angelov. (2020)cite arxiv:2008.09470Comment: Implementation available at https://github.com/ddangelov/Top2Vec.
4 лет назад , @ghagerer
clustering
doc2vec
topic-modeling
umap
unsupervised
word2vec
clusteringdoc2vectopic-modelingumapunsupervisedword2vec
(0)
копироватьудалитьдобавить публикацию в буфер
14Distributed Representations of Sentences and Documents.
Q. Le, и T. Mikolov. ICML, 14, стр. 1188--1196. (2014)
4 лет назад , @ghagerer
word-vectors
word2vec
word-vectorsword2vec
(0)
копироватьудалитьдобавить публикацию в буфер
31Distributed Representations of Words and Phrases and their Compositionality
T. Mikolov, I. Sutskever, K. Chen, G. Corrado, и J. Dean. Advances in neural information processing systems, стр. 3111--3119. (2013)
4 лет назад , @ghagerer
word-vectors
word2vec
word-vectorsword2vec
(0)
копироватьудалитьдобавить публикацию в буфер
4An Unsupervised Neural Attention Model for Aspect Extraction.
R. He, W. Lee, H. Ng, и D. Dahlmeier. ACL (1), стр. 388-397. Association for Computational Linguistics, (2017)
5 лет назад , @ghagerer
anjali
attention-based-aspect-extraction
clustering
topic-modeling
word-vectors
word2vec
anjaliattention-based-aspect-extractionclusteringtopic-modelingword-vectorsword2vec
(0)
копироватьудалитьдобавить публикацию в буфер

&lang;&lang;
⟨
1
&rang;
⟩⟩

BibSonomy

закладки (спрятать)4
показать
всё
только закладки
закладки на страницу
5
10
20
50
100
RSS
BibTeX
XML

1What meaning does the length of a Word2vec vector have? - Stack Overflow

1Should I normalize word2vec's word vectors before using them? - Cross Validated

1AI Trained on Old Scientific Papers Makes Discoveries Humans Missed - VICE

1FastText and Gensim word embeddings | RARE Technologies

публикации (спрятать)5
показать
всё
только публикации
публикации на страницу
5
10
20
50
100
расширенный...
RSS
BibTeX
RDF
дальше...

28Efficient Estimation of Word Representations in Vector Space

2Top2Vec: Distributed Representations of Topics

14Distributed Representations of Sentences and Documents.

31Distributed Representations of Words and Phrases and their Compositionality

4An Unsupervised Neural Attention Model for Aspect Extraction.

просмотр

сходные по теме тэги

концепции

тэги

закладки (спрятать)4 показатьвсётолько закладкизакладки на страницу5102050100 RSSBibTeXXML

публикации (спрятать)5 показатьвсётолько публикациипубликации на страницу5102050100 расширенный... RSSBibTeXRDFдальше...

просмотр

сходные по теме тэги

тэги

закладки (спрятать)4
показать
всё
только закладки
закладки на страницу
5
10
20
50
100
RSS
BibTeX
XML

публикации (спрятать)5
показать
всё
только публикации
публикации на страницу
5
10
20
50
100
расширенный...
RSS
BibTeX
RDF
дальше...