Author of the publication

Offline stochastic shortest path: Learning, evaluation and towards optimality.

, , , and . UAI, volume 180 of Proceedings of Machine Learning Research, page 2278-2288. PMLR, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Online Sparse Reinforcement Learning., , , and . CoRR, (2020)Voting-Based Multiagent Reinforcement Learning for Intelligent IoT., , , , , and . IEEE Internet Things J., 8 (4): 2681-2693 (2021)Byzantine-Robust Online and Offline Distributed Reinforcement Learning., , , , and . AISTATS, volume 206 of Proceedings of Machine Learning Research, page 3230-3269. PMLR, (2023)A Novel Privacy-Preserving Data Gathering Scheme in WSN Based on Compressive Sensing and Embedding., , and . ICC, page 1-6. IEEE, (2019)A Practice to Search the Summit of a DEM Using Simulated Annealing Technique., and . Geoinformatics, page 1-5. IEEE, (2018)A Many-Core Accelerator Design for On-Chip Deep Reinforcement Learning., , , , and . ICCAD, page 46:1-46:7. IEEE, (2020)Visual Adversarial Examples Jailbreak Aligned Large Language Models., , , , , and . AAAI, page 21527-21536. AAAI Press, (2024)Decentralized Gossip-Based Stochastic Bilevel Optimization over Communication Networks., , and . NeurIPS, (2022)Variational Policy Gradient Method for Reinforcement Learning with General Utilities., , , , and . NeurIPS, (2020)On the Convergence and Sample Efficiency of Variance-Reduced Policy Gradient Method., , , , and . NeurIPS, page 2228-2240. (2021)