Author of the publication

Learning online alignments with continuous rewards policy gradient.

, , , and . ICASSP, page 2801-2805. IEEE, (2017)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Learning online alignments with continuous rewards policy gradient., , , and . ICASSP, page 2801-2805. IEEE, (2017)Video Object Contour Tracking Using Improved Dual-Front Active Contour., , and . ICIC (2), volume 4114 of Lecture Notes in Computer Science, page 855-865. Springer, (2006)Reliability-Based and QoS-Aware Service Redundancy Backup Method in IoT-Based Smart Grid., , , , and . ICAIS (4), volume 11635 of Lecture Notes in Computer Science, page 588-598. Springer, (2019)Learning Self-Correctable Policies and Value Functions from Demonstrations with Negative Sampling., , and . CoRR, (2019)On the Expressivity of Neural Networks for Deep Reinforcement Learning., , , , and . ICML, volume 119 of Proceedings of Machine Learning Research, page 2627-2637. PMLR, (2020)Safe Reinforcement Learning by Imagining the Near Future., , and . NeurIPS, page 13859-13869. (2021)Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees., , , , , and . ICLR (Poster), OpenReview.net, (2019)Towards Learning to Play Piano with Dexterous Hands and Touch., , , , and . IROS, page 10410-10416. IEEE, (2022)Computational identification of 48 potato microRNAs and their targets., , , , and . Comput. Biol. Chem., 33 (1): 84-93 (2009)Towards Learning to Play Piano with Dexterous Hands and Touch., , , , and . CoRR, (2021)