From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Rethinking Information Structures in RLHF: Reward Generalization from a Graph Theory Perspective., , , , , , , , , и . CoRR, (2024)MATE: Benchmarking Multi-Agent Reinforcement Learning in Distributed Target Coverage Control., , , , , и . NeurIPS, (2022)Aligner: Achieving Efficient Alignment through Weak-to-Strong Correction., , , , , , , и . CoRR, (2024)Red Teaming Game: A Game-Theoretic Framework for Red Teaming Language Models., , , , , , и . CoRR, (2023)OmniSafe: An Infrastructure for Accelerating Safe Reinforcement Learning Research., , , , , , , , , и . CoRR, (2023)Proactive Multi-Camera Collaboration for 3D Human Pose Estimation., , , , и . ICLR, OpenReview.net, (2023)AI Alignment: A Comprehensive Survey., , , , , , , , , и 15 other автор(ы). CoRR, (2023)BeaverTails: Towards Improved Safety Alignment of LLM via a Human-Preference Dataset., , , , , , , , , и . CoRR, (2023)Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark., , , , , , , , , и . CoRR, (2023)TorchOpt: An Efficient Library for Differentiable Optimization., , , , , , и . CoRR, (2022)