Author of the publication

Reflect-RL: Two-Player Online RL Fine-Tuning for LMs.

, , and . ACL (1), page 995-1015. Association for Computational Linguistics, (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Graph Neural Tangent Kernel: Fusing Graph Neural Networks with Graph Kernels, , , , , and . (2019)A Reduction-based Framework for Sequential Decision Making with Delayed Feedback., , , , , and . CoRR, (2023)When is particle filtering efficient for planning in partially observed linear dynamical systems?, , , , , and . UAI, volume 161 of Proceedings of Machine Learning Research, page 728-737. AUAI Press, (2021)Hypothesis Transfer Learning via Transformation Functions., , , and . NIPS, page 574-584. (2017)On the Power of Truncated SVD for General High-rank Matrix Estimation Problems., , and . NIPS, page 445-455. (2017)Graph Neural Tangent Kernel: Fusing Graph Neural Networks with Graph Kernels., , , , , and . NeurIPS, page 5724-5734. (2019)Over-parameterized Adversarial Training: An Analysis Overcoming the Curse of Dimensionality., , , , , and . NeurIPS, (2020)Hitting Time of Stochastic Gradient Langevin Dynamics to Stationary Points: A Direct Analysis., , and . CoRR, (2019)On the Power of Over-parametrization in Neural Networks with Quadratic Activation., and . ICML, volume 80 of Proceedings of Machine Learning Research, page 1328-1337. PMLR, (2018)Gradient Descent Learns One-hidden-layer CNN: Don't be Afraid of Spurious Local Minima., , , , and . ICML, volume 80 of Proceedings of Machine Learning Research, page 1338-1347. PMLR, (2018)