Article,

Language Model Cascades

D. Dohan, W. Xu, A. Lewkowycz, J. Austin, D. Bieber, R. Lopes, Y. Wu, H. Michalewski, R. Saurous, J. Sohl dickstein, K. Murphy, and C. Sutton.
(2022)cite arxiv:2207.10342Comment: Presented as spotlight at the Beyond Bases workshop at ICML 2022 (https://beyond-bayes.github.io).

Abstract

Prompted models have demonstrated impressive few-shot learning abilities. Repeated interactions at test-time with a single model, or the composition of multiple models together, further expands capabilities. These compositions are probabilistic models, and may be expressed in the language of graphical models with random variables whose values are complex data types such as strings. Cases with control flow and dynamic structure require techniques from probabilistic programming, which allow implementing disparate model structures and inference strategies in a unified language. We formalize several existing techniques from this perspective, including scratchpads / chain of thought, verifiers, STaR, selection-inference, and tool use. We refer to the resulting programs as language model cascades.

BibTeX key: dohan2022language
entry type: article
year: 2022
url: http://arxiv.org/abs/2207.10342
note: cite arxiv:2207.10342Comment: Presented as spotlight at the Beyond Bases workshop at ICML 2022 (https://beyond-bayes.github.io)

BibSonomy

Language Model Cascades

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on