Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

DoMo-AC: Doubly Multi-step Off-policy Actor-Critic Algorithm.

Y. Tang, T. Kozuno, M. Rowland, A. Harutyunyan, R. Munos, B. Pires, and M. Valko. ICML, volume 202 of Proceedings of Machine Learning Research, page 33657-33673. PMLR, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Tadashi Obara

Tadashi Otsuru

Tadashi Makabe

Tadashi Kito

Tadashi Yamagata

Other publications of authors with the same name

Theoretical Analysis of Efficiency and Robustness of Softmax and Gap-Increasing Operators in Reinforcement Learning.T. Kozuno, E. Uchibe, and K. Doya. AISTATS, volume 89 of Proceedings of Machine Learning Research, page 2995-3003. PMLR, (2019)Benchmarking Actor-Critic Deep Reinforcement Learning Algorithms for Robotics Control with Action Constraints.K. Kasaura, S. Miura, T. Kozuno, R. Yonetani, K. Hoshino, and Y. Hosoe. CoRR, (2023)Gap-Increasing Policy Evaluation for Efficient and Noise-Tolerant Reinforcement Learning.T. Kozuno, D. Han, and K. Doya. CoRR, (2019)Greedification Operators for Policy Optimization: Investigating Forward and Reverse KL Divergences.A. Chan, H. Silva, S. Lim, T. Kozuno, A. Mahmood, and M. White. CoRR, (2021)Unifying Value Iteration, Advantage Learning, and Dynamic Policy Programming.T. Kozuno, E. Uchibe, and K. Doya. CoRR, (2017)No More Pesky Hyperparameters: Offline Hyperparameter Tuning for RL.H. Wang, A. Sakhadeo, A. White, J. Bell, V. Liu, X. Zhao, P. Liu, T. Kozuno, A. Fyshe, and M. White. CoRR, (2022)Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice.T. Kitamura, T. Kozuno, Y. Tang, N. Vieillard, M. Valko, W. Yang, J. Mei, P. Ménard, M. Azar, R. Munos and 5 other author(s). ICML, volume 202 of Proceedings of Machine Learning Research, page 17135-17175. PMLR, (2023)Confident Approximate Policy Iteration for Efficient Local Planning in $q^\pi$-realizable MDPs.G. Weisz, A. György, T. Kozuno, and C. Szepesvári. NeurIPS, (2022)Leverage the Average: an Analysis of Regularization in RL.N. Vieillard, T. Kozuno, B. Scherrer, O. Pietquin, R. Munos, and M. Geist. CoRR, (2020)Greedification Operators for Policy Optimization: Investigating Forward and Reverse KL Divergences.A. Chan, H. Silva, S. Lim, T. Kozuno, A. Mahmood, and M. White. J. Mach. Learn. Res., (2022)

BibSonomy

Disambiguation of "Kozuno, Tadashi"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

DoMo-AC: Doubly Multi-step Off-policy Actor-Critic Algorithm.

Please choose a person to relate this publication to

Tadashi Obara

Tadashi Otsuru

Tadashi Makabe

Tadashi Kito

Tadashi Yamagata

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Kozuno, Tadashi"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML DoMo-AC: Doubly Multi-step Off-policy Actor-Critic Algorithm.

Please choose a person to relate this publication to

Tadashi Obara

Tadashi Otsuru

Tadashi Makabe

Tadashi Kito

Tadashi Yamagata

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

DoMo-AC: Doubly Multi-step Off-policy Actor-Critic Algorithm.