Author of the publication

An Off-Policy Natural Policy Gradient Method for a Partial Observable Markov Decision Process.

, , and . ICANN (2), volume 3697 of Lecture Notes in Computer Science, page 431-436. Springer, (2005)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Hierarchical lossless audio coding in terms of sampling rate and amplitude resolution., , , , and . ICASSP (5), page 409-412. IEEE, (2003)Establishment of screening system toward discovery of kinase inhibitors using label-free on-chip phosphorylation assays., , , , , , , and . Biosyst., 97 (3): 179-185 (2009)An Off-Policy Natural Policy Gradient Method for a Partial Observable Markov Decision Process., , and . ICANN (2), volume 3697 of Lecture Notes in Computer Science, page 431-436. Springer, (2005)A real-time IMT-2000 audio transmission system., , , and . IEEE Trans. Consumer Electronics, 47 (4): 860-866 (2001)Reinforcement learning for a biped robot based on a CPG-actor-critic method., , , and . Neural Networks, 20 (6): 723-735 (2007)Fast encoding algorithms for MPEG-4 TwinVQ audio tool., , , , and . ICASSP, page 3253-3256. IEEE, (2001)G.711.1: A wideband extension to ITU-T G.711., , , , , , , , , and 5 other author(s). EUSIPCO, page 1-5. IEEE, (2008)Japanese large-vocabulary continuous-speech recognition using a newspaper corpus and broadcast news., , , , , , and . Speech Commun., 28 (2): 155-166 (1999)Off-Policy Natural Policy Gradient Method for a Biped Walking Using a CPG Controller., , , , and . J. Robotics Mechatronics, 17 (6): 636-644 (2005)An Additive Reinforcement Learning., and . ICANN (1), volume 5768 of Lecture Notes in Computer Science, page 608-617. Springer, (2009)