From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Model evaluation for extreme risks., , , , , , , , , и 11 other автор(ы). CoRR, (2023)Deep Reinforcement Learning from Human Preferences., , , , , и . NIPS, стр. 4299-4307. (2017)Reflective Oracles: A Foundation for Game Theory in Artificial Intelligence., , и . LORI, том 9394 из Lecture Notes in Computer Science, стр. 411-415. Springer, (2015)Provably manipulation-resistant reputation systems.. COLT, том 49 из JMLR Workshop and Conference Proceedings, стр. 670-697. JMLR.org, (2016)Lossless Fault-Tolerant Data Structures with Additive Overhead., , и . WADS, том 6844 из Lecture Notes in Computer Science, стр. 243-254. Springer, (2011)Provably Manipulation-Resistant Reputation Systems.. CoRR, (2014)Supervising strong learners by amplifying weak experts., , и . CoRR, (2018)Learning to summarize with human feedback., , , , , , , , и . NeurIPS, (2020)Training language models to follow instructions with human feedback., , , , , , , , , и 10 other автор(ы). NeurIPS, (2022)A Connection between Generative Adversarial Networks, Inverse Reinforcement Learning, and Energy-Based Models., , , и . CoRR, (2016)