Author of the publication

ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models.

, , , , , and . CoRR, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Estimation of Leaf Water Use Efficiency Threshold Values for Water Stress in Winter Wheat (Triticum aestivum L.)., , , , , , and . J. Sensors, (2020)Provable Adaptivity in Adam., , , , , , and . CoRR, (2022)Uncertainty and Explainable Analysis of Machine Learning Model for Reconstruction of Sonic Slowness Logs., , , , , , and . CoRR, (2023)Effects of Micropore Group Spacing and Irrigation Amount on Soil Respiration and Yield of Tomato with Microsprinkler Irrigation under Plastic Film in Greenhouse., , , , , , , and . J. Sensors, (2021)Adam Can Converge Without Any Modification On Update Rules., , , , and . NeurIPS, (2022)Effect of Microsprinkler Irrigation under Plastic Film on Photosynthesis and Fruit Yield of Greenhouse Tomato., , , , , , , and . J. Sensors, (2020)Why Transformers Need Adam: A Hessian Perspective., , , , , and . CoRR, (2024)Communication Efficiency Optimization of Federated Learning for Computing and Network Convergence of 6G Networks., , , , , , and . CoRR, (2023)Identification of Differentially Expressed Genes Associated with Idiopathic Pulmonary Arterial Hypertension by Integrated Bioinformatics Approaches., , and . J. Comput. Biol., 28 (1): 79-88 (2021)When Expressivity Meets Trainability: Fewer than $n$ Neurons Can Work., , , , and . NeurIPS, page 9167-9180. (2021)