Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Red Teaming Language Models with Language Models.

E. Perez, S. Huang, H. Song, T. Cai, R. Ring, J. Aslanides, A. Glaese, N. McAleese, and G. Irving. EMNLP, page 3419-3448. Association for Computational Linguistics, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Ethan gets

Henryk Ethan Vorast

Other publications of authors with the same name

Debating with More Persuasive LLMs Leads to More Truthful Answers.A. Khan, J. Hughes, D. Valentine, L. Ruis, K. Sachan, A. Radhakrishnan, E. Grefenstette, S. Bowman, T. Rocktäschel, and E. Perez. CoRR, (2024)Towards Understanding Sycophancy in Language Models.M. Sharma, M. Tong, T. Korbak, D. Duvenaud, A. Askell, S. Bowman, N. Cheng, E. Durmus, Z. Hatfield-Dodds, S. Johnston and 9 other author(s). CoRR, (2023)Few-shot Adaptation Works with UnpredicTable Data.J. Chan, M. Pieler, J. Jao, J. Scheurer, and E. Perez. CoRR, (2022)Language Models Don't Always Say What They Think: Unfaithful Explanations in Chain-of-Thought Prompting.M. Turpin, J. Michael, E. Perez, and S. Bowman. CoRR, (2023)Specific versus General Principles for Constitutional AI.S. Kundu, Y. Bai, S. Kadavath, A. Askell, A. Callahan, A. Chen, A. Goldie, A. Balwit, A. Mirhoseini, B. McLean and 26 other author(s). CoRR, (2023)Inverse Scaling: When Bigger Isn't Better.I. McKenzie, A. Lyzhov, M. Pieler, A. Parrish, A. Mueller, A. Prabhu, E. McLean, A. Kirtland, A. Ross, A. Liu and 17 other author(s). CoRR, (2023)Rissanen Data Analysis: Examining Dataset Characteristics via Description Length.E. Perez, D. Kiela, and K. Cho. ICML, volume 139 of Proceedings of Machine Learning Research, page 8500-8513. PMLR, (2021)Finding Generalizable Evidence by Learning to Convince Q&A Models.E. Perez, S. Karamcheti, R. Fergus, J. Weston, D. Kiela, and K. Cho. EMNLP/IJCNLP (1), page 2402-2411. Association for Computational Linguistics, (2019)Case-based Reasoning for Natural Language Queries over Knowledge Bases.R. Das, M. Zaheer, D. Thai, A. Godbole, E. Perez, J. Lee, L. Tan, L. Polymenakos, and A. McCallum. EMNLP (1), page 9594-9611. Association for Computational Linguistics, (2021)ELI5: Long Form Question Answering.A. Fan, Y. Jernite, E. Perez, D. Grangier, J. Weston, and M. Auli. ACL (1), page 3558-3567. Association for Computational Linguistics, (2019)

BibSonomy

Disambiguation of "Perez, Ethan"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Red Teaming Language Models with Language Models.

Please choose a person to relate this publication to

Ethan gets

Henryk Ethan Vorast

Henryk Ethan Vorast

Henryk Ethan Vorast

Henryk Ethan Vorast

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Perez, Ethan"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Red Teaming Language Models with Language Models.

Please choose a person to relate this publication to

Ethan gets

Henryk Ethan Vorast

Henryk Ethan Vorast

Henryk Ethan Vorast

Henryk Ethan Vorast

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Red Teaming Language Models with Language Models.