Author of the publication

BanditSum: Extractive Summarization as a Contextual Bandit.

, , , , and . EMNLP, page 3739-3748. Association for Computational Linguistics, (2018)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Uncertainty Aware Learning from Demonstrations in Multiple Contexts using Bayesian Neural Networks., , , , and . ICRA, page 768-774. IEEE, (2019)Unifying Variational Inference and PAC-Bayes for Supervised Learning that Scales., , , and . CoRR, (2019)Learning to predict phases of manipulation tasks as hidden states., , , and . ICRA, page 4009-4014. IEEE, (2014)Probabilistic interactive segmentation for anthropomorphic robots in cluttered environments., , and . Humanoids, page 169-176. IEEE, (2013)Combining Reward Information from Multiple Sources., , and . CoRR, (2021)MDP Homomorphic Networks: Group Symmetries in Reinforcement Learning., , , , and . NeurIPS, (2020)Stable reinforcement learning with autoencoders for tactile and visual data., , , , and . IROS, page 3928-3934. IEEE, (2016)Social Navigation with Human Empowerment Driven Deep Reinforcement Learning., , and . ICANN (2), volume 12397 of Lecture Notes in Computer Science, page 395-407. Springer, (2020)Addressing function approximation error in actor-critic methods, , and . arXiv preprint arXiv:1802.09477, (2018)Reusable Options through Gradient-based Meta Learning., and . CoRR, (2022)