Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Metric Residual Network for Sample Efficient Goal-Conditioned Reinforcement Learning., , , and . AAAI, page 8799-8806. AAAI Press, (2023)Non-asymptotic Confidence Intervals of Off-policy Evaluation: Primal and Dual Bounds., , , and . ICLR, OpenReview.net, (2021)HIVE: Harnessing Human Feedback for Instructional Visual Editing., , , , , , , , , and 2 other author(s). CoRR, (2023)Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization., , , , , , , , , and 5 other author(s). ICLR, OpenReview.net, (2024)Demand Prediction by Incorporating Internet-of-Things Data: A Case of Automobile Repair and Maintenance Service., , and . HICSS, page 5017-5026. ScholarSpace, (2024)Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward., , , , , , , , , and 1 other author(s). CoRR, (2024)Accountable Off-Policy Evaluation With Kernel Bellman Statistics., , , and . ICML, volume 119 of Proceedings of Machine Learning Research, page 3102-3111. PMLR, (2020)Unsupervised Out-of-Domain Detection via Pre-trained Transformers., , , , and . ACL/IJCNLP (1), page 1052-1061. Association for Computational Linguistics, (2021)Action-dependent Control Variates for Policy Optimization via Stein Identity., , , , , and . ICLR (Poster), OpenReview.net, (2018)Knowledge-guided Semantic Computing Network., , , , , , and . CoRR, (2018)