From post

копировать удалить добавить публикацию в буфер
Запись сообщества
посмотреть историю данной записи
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Reflective Oracles: A Foundation for Classical Game Theory.

B. Fallenstein, J. Taylor, и P. Christiano. CoRR, (2015)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

Christiano German

Christiano Pesavento

Christiano German

Christiano German

Christiano German

Другие публикации лиц с тем же именем

Model evaluation for extreme risks.T. Shevlane, S. Farquhar, B. Garfinkel, M. Phuong, J. Whittlestone, J. Leung, D. Kokotajlo, N. Marchal, M. Anderljung, N. Kolt и 11 other автор(ы). CoRR, (2023)Deep Reinforcement Learning from Human Preferences.P. Christiano, J. Leike, T. Brown, M. Martic, S. Legg, и D. Amodei. NIPS, стр. 4299-4307. (2017)Reflective Oracles: A Foundation for Game Theory in Artificial Intelligence.B. Fallenstein, J. Taylor, и P. Christiano. LORI, том 9394 из Lecture Notes in Computer Science, стр. 411-415. Springer, (2015)Provably manipulation-resistant reputation systems.P. Christiano. COLT, том 49 из JMLR Workshop and Conference Proceedings, стр. 670-697. JMLR.org, (2016)Lossless Fault-Tolerant Data Structures with Additive Overhead.P. Christiano, E. Demaine, и S. Kishore. WADS, том 6844 из Lecture Notes in Computer Science, стр. 243-254. Springer, (2011)Provably Manipulation-Resistant Reputation Systems.P. Christiano. CoRR, (2014)Supervising strong learners by amplifying weak experts.P. Christiano, B. Shlegeris, и D. Amodei. CoRR, (2018)Learning to summarize with human feedback.N. Stiennon, L. Ouyang, J. Wu, D. Ziegler, R. Lowe, C. Voss, A. Radford, D. Amodei, и P. Christiano. NeurIPS, (2020)Training language models to follow instructions with human feedback.L. Ouyang, J. Wu, X. Jiang, D. Almeida, C. Wainwright, P. Mishkin, C. Zhang, S. Agarwal, K. Slama, A. Ray и 10 other автор(ы). NeurIPS, (2022)A Connection between Generative Adversarial Networks, Inverse Reinforcement Learning, and Energy-Based Models.C. Finn, P. Christiano, P. Abbeel, и S. Levine. CoRR, (2016)

Что такое BibSonomy?: С чего начать; Кнопки для браузера; Помощь
Разработчикам: Обзор; API-документация

Контакт и защита личных данных: о нас; Cookies; Сообщить о проблеме; BibSonomy Вики

Интеграция: PUMA; Расширение для TYPO3; Плагин для; Клиент Java REST; Поддерживаемые источники; далее

О BibSonomy: Команда; Блог; Список рассылки
Социальные сети: Наш Twitter