Author of the publication

Red Teaming Language Models with Language Models.

, , , , , , , , and . EMNLP, page 3419-3448. Association for Computational Linguistics, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Debating with More Persuasive LLMs Leads to More Truthful Answers., , , , , , , , , and . CoRR, (2024)Towards Understanding Sycophancy in Language Models., , , , , , , , , and 9 other author(s). CoRR, (2023)Few-shot Adaptation Works with UnpredicTable Data., , , , and . CoRR, (2022)Language Models Don't Always Say What They Think: Unfaithful Explanations in Chain-of-Thought Prompting., , , and . CoRR, (2023)Specific versus General Principles for Constitutional AI., , , , , , , , , and 26 other author(s). CoRR, (2023)Inverse Scaling: When Bigger Isn't Better., , , , , , , , , and 17 other author(s). CoRR, (2023)Rissanen Data Analysis: Examining Dataset Characteristics via Description Length., , and . ICML, volume 139 of Proceedings of Machine Learning Research, page 8500-8513. PMLR, (2021)Finding Generalizable Evidence by Learning to Convince Q&A Models., , , , , and . EMNLP/IJCNLP (1), page 2402-2411. Association for Computational Linguistics, (2019)Case-based Reasoning for Natural Language Queries over Knowledge Bases., , , , , , , , and . EMNLP (1), page 9594-9611. Association for Computational Linguistics, (2021)ELI5: Long Form Question Answering., , , , , and . ACL (1), page 3558-3567. Association for Computational Linguistics, (2019)