From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Teaching Large Language Models to Reason with Reinforcement Learning., , , , , , , , и . CoRR, (2024)Understanding the Effects of RLHF on LLM Generalisation and Diversity., , , , , , и . CoRR, (2023)LLaMA: Open and Efficient Foundation Language Models., , , , , , , , , и 4 other автор(ы). CoRR, (2023)Generalization to New Sequential Decision Making Tasks with In-Context Learning., , , , и . ICML, OpenReview.net, (2024)Dungeons and Data: A Large-Scale NetHack Dataset., , , , , , и . NeurIPS, (2022)Know When To Stop: A Study of Semantic Drift in Text Generation., , , и . NAACL-HLT, стр. 3656-3671. Association for Computational Linguistics, (2024)LLaMA: Open and Efficient Foundation Language Models, , , , , , , , , и 4 other автор(ы). CoRR, (2023)Llama: Open and efficient foundation language models, , , , , , , , , и 1 other автор(ы). arXiv preprint arXiv:2302.13971, (2023)Understanding the Effects of RLHF on LLM Generalisation and Diversity., , , , , , и . ICLR, OpenReview.net, (2024)MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research., , , , , , , , , и . NeurIPS Datasets and Benchmarks, (2021)