The paper discusses the capabilities of large pre-trained language models and their limitations in accessing and manipulating knowledge. The authors introduce retrieval-augmented generation (RAG) models that combine pre-trained parametric and non-parametric memory for language generation. The study explores the effectiveness of RAG models in various NLP tasks and compares them with other architectures.
arXiv is a free distribution service and an open-access archive for 2,316,761 scholarly articles in the fields of physics, mathematics, computer science, quantitative biology, quantitative finance, statistics, electrical engineering and systems science, and economics. Materials on this site are not peer-reviewed by arXiv.
arXiv is a free distribution service and an open-access archive for 2,310,555 scholarly articles in the fields of physics, mathematics, computer science, quantitative biology, quantitative finance, statistics, electrical engineering and systems science, and economics. Materials on this site are not peer-reviewed by arXiv.
A. Eyre, and J. Binney. (2010)cite arxiv:1011.3672
Comment: 23 pages, 20 figures, submitted to MNRAS. Now includes reference
list, minor corrections to arXiv metadata.
C. Milne, P. Kim, J. Eddy, and N. Price. (2009)cite arxiv:0912.2955
Comment: Highlighted in "In this issue" section of Biotechnology Journal
12/2009.