Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Offline stochastic shortest path: Learning, evaluation and towards optimality.

M. Yin, W. Chen, M. Wang, and Y. Wang. UAI, volume 180 of Proceedings of Machine Learning Research, page 2278-2288. PMLR, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Mengdi Wang

Wang Wang

Linfang Wang

Investigation of the functions of 53BP1 in DNA demethylationL. Wang. Uni Marburg, (2009)

Paul Wang

Lien Wang

Other publications of authors with the same name

Online Sparse Reinforcement Learning.B. Hao, T. Lattimore, C. Szepesvári, and M. Wang. CoRR, (2020)Voting-Based Multiagent Reinforcement Learning for Intelligent IoT.Y. Xu, Z. Deng, M. Wang, W. Xu, A. So, and S. Cui. IEEE Internet Things J., 8 (4): 2681-2693 (2021)Byzantine-Robust Online and Offline Distributed Reinforcement Learning.Y. Chen, X. Zhang, K. Zhang, M. Wang, and X. Zhu. AISTATS, volume 206 of Proceedings of Machine Learning Research, page 3230-3269. PMLR, (2023)A Novel Privacy-Preserving Data Gathering Scheme in WSN Based on Compressive Sensing and Embedding.M. Wang, D. Xiao, and Z. Ao. ICC, page 1-6. IEEE, (2019)A Practice to Search the Summit of a DEM Using Simulated Annealing Technique.M. Wang, and K. Zhang. Geoinformatics, page 1-5. IEEE, (2018)A Many-Core Accelerator Design for On-Chip Deep Reinforcement Learning.Y. Wang, M. Wang, B. Li, H. Li, and X. Li. ICCAD, page 46:1-46:7. IEEE, (2020)Visual Adversarial Examples Jailbreak Aligned Large Language Models.X. Qi, K. Huang, A. Panda, P. Henderson, M. Wang, and P. Mittal. AAAI, page 21527-21536. AAAI Press, (2024)Decentralized Gossip-Based Stochastic Bilevel Optimization over Communication Networks.S. Yang, X. Zhang, and M. Wang. NeurIPS, (2022)Variational Policy Gradient Method for Reinforcement Learning with General Utilities.J. Zhang, A. Koppel, A. Bedi, C. Szepesvári, and M. Wang. NeurIPS, (2020)On the Convergence and Sample Efficiency of Variance-Reduced Policy Gradient Method.J. Zhang, C. Ni, Z. Yu, C. Szepesvári, and M. Wang. NeurIPS, page 2228-2240. (2021)

BibSonomy

Disambiguation of "Wang, Mengdi"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Offline stochastic shortest path: Learning, evaluation and towards optimality.

Please choose a person to relate this publication to

Mengdi Wang

Wang Wang

Linfang Wang

Paul Wang

Lien Wang

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Wang, Mengdi"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Offline stochastic shortest path: Learning, evaluation and towards optimality.

Please choose a person to relate this publication to

Mengdi Wang

Wang Wang

Linfang Wang

Paul Wang

Lien Wang

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Offline stochastic shortest path: Learning, evaluation and towards optimality.