Author of the publication

Proposal of an Action Selection Strategy with Expected Failure Probability and Its Evaluation in Multi-agent Reinforcement Learning.

, , and . EUMAS/AT, volume 10207 of Lecture Notes in Computer Science, page 172-186. Springer, (2016)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Proposal of Detour Path Suppression Method in PS Reinforcement Learning and Its Application to Altruistic Multi-agent Environment., , and . PRIMA, volume 11224 of Lecture Notes in Computer Science, page 638-645. Springer, (2018)Proposal and Evaluation of the Active Course Classification Support System with Exploitation-Oriented Learning., and . EWRL, volume 7188 of Lecture Notes in Computer Science, page 333-344. Springer, (2011)Exploitation-Oriented Learning PS-r#., and . J. Adv. Comput. Intell. Intell. Informatics, 13 (6): 624-630 (2009)Proposal and Evaluation of an Action Selection Strategy with Expected Failure Probability in Multi-agent Learning., , and . ICA, page 127-130. IEEE, (2016)Application of Deep Reinforcement Learning to Decision-Making System based on Consciousness.. BICA*AI, volume 190 of Procedia Computer Science, page 631-636. Elsevier, (2020)Multi User Learning Agent on the Distribution of MDPs., , and . RO-MAN, page 698-703. IEEE, (2006)Reinforcement learning for penalty avoiding policy making., and . SMC, page 206-211. IEEE, (2000)Proposal of Exploitation-Oriented Learning PS-r#., and . IDEAL, volume 5326 of Lecture Notes in Computer Science, page 1-8. Springer, (2008)Proposal of PSwithEFP and its Evaluation in Multi-Agent Reinforcement Learning., , and . J. Adv. Comput. Intell. Intell. Informatics, 21 (5): 930-938 (2017)Reinforcement Learning for Penalty Avoidance in Continuous State Spaces., and . J. Adv. Comput. Intell. Intell. Informatics, 11 (6): 668-676 (2007)