From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Predictability and Surprise in Large Generative Models., , , , , , , , , и 20 other автор(ы). CoRR, (2022)Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training., , , , , , , , , и 29 other автор(ы). CoRR, (2024)Language Models (Mostly) Know What They Know, , , , , , , , , и 26 other автор(ы). (2022)cite arxiv:2207.05221Comment: 23+17 pages; refs added, typos fixed.Measuring Progress on Scalable Oversight for Large Language Models., , , , , , , , , и 36 other автор(ы). CoRR, (2022)Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback., , , , , , , , , и 21 other автор(ы). CoRR, (2022)The Capacity for Moral Self-Correction in Large Language Models., , , , , , , , , и 39 other автор(ы). CoRR, (2023)Language Models (Mostly) Know What They Know., , , , , , , , , и 26 other автор(ы). CoRR, (2022)In-context Learning and Induction Heads., , , , , , , , , и 16 other автор(ы). CoRR, (2022)Specific versus General Principles for Constitutional AI., , , , , , , , , и 26 other автор(ы). CoRR, (2023)Discovering Language Model Behaviors with Model-Written Evaluations., , , , , , , , , и 53 other автор(ы). ACL (Findings), стр. 13387-13434. Association for Computational Linguistics, (2023)