The paper discusses the capabilities of large pre-trained language models and their limitations in accessing and manipulating knowledge. The authors introduce retrieval-augmented generation (RAG) models that combine pre-trained parametric and non-parametric memory for language generation. The study explores the effectiveness of RAG models in various NLP tasks and compares them with other architectures.
arXiv is a free distribution service and an open-access archive for 2,316,761 scholarly articles in the fields of physics, mathematics, computer science, quantitative biology, quantitative finance, statistics, electrical engineering and systems science, and economics. Materials on this site are not peer-reviewed by arXiv.
arXiv is a free distribution service and an open-access archive for 2,310,555 scholarly articles in the fields of physics, mathematics, computer science, quantitative biology, quantitative finance, statistics, electrical engineering and systems science, and economics. Materials on this site are not peer-reviewed by arXiv.
H. Neilson, L. Rousseau-Nepton, S. Lawler, and K. Spekkens. (2019)cite arxiv:1910.02976Comment: 11 pages. Community paper submitted to the Canadian Long Range Plan 2020, https://casca.ca/?page_id=11499lrp2020/.
A. Cimatti, F. Fraternali, and C. Nipoti. (2019)cite arxiv:1912.06216Comment: 17 pages, 3 figures, first introductory chapter of the textbook published by Cambridge University Press. For more information https://decdb4ae-c884-4971-9114-5f11b6929fd9.filesusr.com/ugd/f44359_26d2207ea96e4f359636feb5b7473336.pdf.