Inproceedings,

Knowledge Enhanced Contextual Word Representations

M. Peters, M. Neumann, R. Logan, R. Schwartz, V. Joshi, S. Singh, and N. Smith.
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), page 43--54. Hong Kong, China, Association for Computational Linguistics, (November 2019)
DOI: 10.18653/v1/D19-1005

Abstract

Contextual word representations, typically trained on unstructured, unlabeled text, do not contain any explicit grounding to real world entities and are often unable to remember facts about those entities. We propose a general method to embed multiple knowledge bases (KBs) into large scale models, and thereby enhance their representations with structured, human-curated knowledge. For each KB, we first use an integrated entity linker to retrieve relevant entity embeddings, then update contextual word representations via a form of word-to-entity attention. In contrast to previous approaches, the entity linkers and self-supervised language modeling objective are jointly trained end-to-end in a multitask setting that combines a small amount of entity linking supervision with a large amount of raw text. After integrating WordNet and a subset of Wikipedia into BERT, the knowledge enhanced BERT (KnowBert) demonstrates improved perplexity, ability to recall facts as measured in a probing task and downstream performance on relationship extraction, entity typing, and word sense disambiguation. KnowBert's runtime is comparable to BERT's and it scales to large KBs.

BibTeX key: peters-etal-2019-knowledge
entry type: inproceedings
address: Hong Kong, China
booktitle: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
year: 2019
month: nov
pages: 43--54
publisher: Association for Computational Linguistics
DOI: 10.18653/v1/D19-1005
url: https://www.aclweb.org/anthology/D19-1005

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

@inproceedings{peters-etal-2019-knowledge, abstract = {Contextual word representations, typically trained on unstructured, unlabeled text, do not contain any explicit grounding to real world entities and are often unable to remember facts about those entities. We propose a general method to embed multiple knowledge bases (KBs) into large scale models, and thereby enhance their representations with structured, human-curated knowledge. For each KB, we first use an integrated entity linker to retrieve relevant entity embeddings, then update contextual word representations via a form of word-to-entity attention. In contrast to previous approaches, the entity linkers and self-supervised language modeling objective are jointly trained end-to-end in a multitask setting that combines a small amount of entity linking supervision with a large amount of raw text. After integrating WordNet and a subset of Wikipedia into BERT, the knowledge enhanced BERT (KnowBert) demonstrates improved perplexity, ability to recall facts as measured in a probing task and downstream performance on relationship extraction, entity typing, and word sense disambiguation. KnowBert{'}s runtime is comparable to BERT{'}s and it scales to large KBs.}, added-at = {2020-09-14T23:36:50.000+0200}, address = {Hong Kong, China}, author = {Peters, Matthew E. and Neumann, Mark and Logan, Robert and Schwartz, Roy and Joshi, Vidur and Singh, Sameer and Smith, Noah A.}, biburl = {https://www.bibsonomy.org/bibtex/260918c5ccebed462ff63d11c63a8a9cb/schwemmlein}, booktitle = {Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)}, description = {Knowledge Enhanced Contextual Word Representations - ACL Anthology}, doi = {10.18653/v1/D19-1005}, interhash = {e8c27259626fab413b2e2411bd0ddd11}, intrahash = {60918c5ccebed462ff63d11c63a8a9cb}, keywords = {antrag bert deconspire graph kg knowledge language model nlp}, month = nov, pages = {43--54}, publisher = {Association for Computational Linguistics}, timestamp = {2020-09-14T23:40:22.000+0200}, title = {Knowledge Enhanced Contextual Word Representations}, url = {https://www.aclweb.org/anthology/D19-1005}, year = 2019 }

BibSonomy

Knowledge Enhanced Contextual Word Representations

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on