Author of the publication

Off-Policy Reinforcement Learning with Delayed Rewards.

, , , , and . ICML, volume 162 of Proceedings of Machine Learning Research, page 8280-8303. PMLR, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

A Grid Resource Scheduling Algorithm Based on the Utility Optimization., , and . Complex (2), volume 5 of Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, page 1355-1362. Springer, (2009)Finding reinforced structural hole spanners in social networks via node embedding., , and . Intell. Data Anal., 27 (1): 297-318 (2023)Workplace loneliness, ego depletion and cyberloafing: can leader problem-focused interpersonal emotion management help?, , , and . Internet Res., 33 (4): 1473-1494 (2023)Validation of Satellite Soil Moisture Products by Sparsification of Ground Observations., , , , , and . IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens., (2024)Understanding the Importance of Single Directions via Representative Substitution., , , , , and . CoRR, (2019)Accelerating Nonconvex Learning via Replica Exchange Langevin Diffusion., , , , and . CoRR, (2020)S2OMGAN: Shortcut from Remote Sensing Images to Online Maps., , , , , , and . CoRR, (2020)Understanding the Importance of Single Directions via Representative Substitution., , , , , and . CoRR, (2018)Overcoming Long-term Catastrophic Forgetting through Adversarial Neural Pruning and Synaptic Consolidation., , , , , , and . CoRR, (2019)Split Time Series into Patches: Rethinking Long-term Series Forecasting with Dateformer., , , , and . CoRR, (2022)