copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Sharp Nearby, Fuzzy Far Away: How Neural Language Models Use Context

U. Khandelwal, H. He, P. Qi, and D. Jurafsky. (2018)cite arxiv:1805.04623Comment: ACL 2018.

Abstract

We know very little about how neural language models (LM) use prior linguistic context. In this paper, we investigate the role of context in an LSTM LM, through ablation studies. Specifically, we analyze the increase in perplexity when prior context words are shuffled, replaced, or dropped. On two standard datasets, Penn Treebank and WikiText-2, we find that the model is capable of using about 200 tokens of context on average, but sharply distinguishes nearby context (recent 50 tokens) from the distant history. The model is highly sensitive to the order of words within the most recent sentence, but ignores word order in the long-range context (beyond 50 tokens), suggesting the distant past is modeled only as a rough semantic field or topic. We further find that the neural caching model (Grave et al., 2017b) especially helps the LSTM to copy words from within this distant context. Overall, our analysis not only provides a better understanding of how neural LMs use their context, but also sheds light on recent success from cache-based models.

Description

[1805.04623] Sharp Nearby, Fuzzy Far Away: How Neural Language Models Use Context

Links and resources

BibTeX key: khandelwal2018sharp
entry type: misc
year: 2018
url: http://arxiv.org/abs/1805.04623
note: cite arxiv:1805.04623Comment: ACL 2018

@mcreinhardt's tags highlighted

Cite this publication

search on

Meta data

Last update 3 years ago
Created 3 years ago

Comments and Reviews
(0)

There is no review or comment yet. You can write one!

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Sharp Nearby, Fuzzy Far Away: How Neural Language Models Use Context

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Sharp Nearby, Fuzzy Far Away: How Neural Language Models Use Context

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Sharp Nearby, Fuzzy Far Away: How Neural Language Models Use Context

Comments and Reviews
(0)