From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Fine-Tuning Language Models from Human Preferences., , , , , , , и . CoRR, (2019)Electrical Flows, Laplacian Systems, and Faster Approximation of Maximum Flow in Undirected Graphs, , , , и . Proceedings of the Forty-third Annual ACM Symposium on Theory of Computing, стр. 273--282. New York, NY, USA, ACM, (2011)Learning to summarize from human feedback., , , , , , , , и . CoRR, (2020)Theano: A Python framework for fast computation of mathematical expressions, , , , , , , , , и 103 other автор(ы). (2016)cite arxiv:1605.02688Comment: 19 pages, 5 figures.Manipulation-resistant online learning.. University of California, Berkeley, USA, (2017)base-search.net (ftcdlib:qt0w22c86t).Model evaluation for extreme risks., , , , , , , , , и 11 other автор(ы). CoRR, (2023)Deep Reinforcement Learning from Human Preferences., , , , , и . NIPS, стр. 4299-4307. (2017)Lossless Fault-Tolerant Data Structures with Additive Overhead., , и . WADS, том 6844 из Lecture Notes in Computer Science, стр. 243-254. Springer, (2011)Reflective Oracles: A Foundation for Game Theory in Artificial Intelligence., , и . LORI, том 9394 из Lecture Notes in Computer Science, стр. 411-415. Springer, (2015)Provably manipulation-resistant reputation systems.. COLT, том 49 из JMLR Workshop and Conference Proceedings, стр. 670-697. JMLR.org, (2016)