From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Will my Spoken Dialogue System be a Slow Learner ?, и . SIGDIAL Conference, стр. 97-101. The Association for Computer Linguistics, (2013)The Emergence of the Shape Bias Results from Communicative Efficiency., , , , и . CoNLL, стр. 607-623. Association for Computational Linguistics, (2021)When does return-conditioned supervised learning work for offline reinforcement learning?, , , , и . NeurIPS, (2022)Dr Jekyll & Mr Hyde: the strange case of off-policy policy updates., и . NeurIPS, стр. 24442-24454. (2021)Decentralized Exploration in Multi-Armed Bandits., , и . ICML, том 97 из Proceedings of Machine Learning Research, стр. 1901-1909. PMLR, (2019)Hybridisation of expertise and reinforcement learning in dialogue systems., , , и . INTERSPEECH, стр. 2479-2482. ISCA, (2009)Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Weighting., , , и . ICLR, OpenReview.net, (2023)Score-based Inverse Reinforcement Learning., , , , и . AAMAS, стр. 457-465. ACM, (2016)Learning dialogue dynamics with the method of moments., , и . SLT, стр. 98-105. IEEE, (2016)Safe Policy Improvement with Soft Baseline Bootstrapping., , и . ECML/PKDD (3), том 11908 из Lecture Notes in Computer Science, стр. 53-68. Springer, (2019)