Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

An empirical analysis of value function-based and policy search reinforcement learning., and . AAMAS (2), page 749-756. IFAAMAS, (2009)On Building Decision Trees from Large-scale Data in Applications of On-line Advertising., , and . CIKM, page 669-678. ACM, (2014)PAC Subset Selection in Stochastic Multi-armed Bandits., , , and . ICML, icml.cc / Omnipress, (2012)The Second NeurIPS Tournament of Reconnaissance Blind Chess., , , , , , , , , and 3 other author(s). NeurIPS (Competition and Demos), volume 176 of Proceedings of Machine Learning Research, page 53-65. PMLR, (2021)Lower Bounds for Policy Iteration on Multi-action MDPs., , , , , and . CDC, page 1744-1749. IEEE, (2020)UT Austin Villa 2011: a champion agent in the RoboCup 3D soccer simulation competition., , , , , , , , and . AAMAS, page 129-136. IFAAMAS, (2012)Direction-changing fall control of humanoid robots: theory and experiments., , , , , and . Auton. Robots, 36 (3): 199-223 (2014)Regret Minimisation in Multi-Armed Bandits Using Bounded Arm Memory., and . CoRR, (2019)Artificial Intelligence and Life in 2030: The One Hundred Year Study on Artificial Intelligence., , , , , , , , , and 7 other author(s). CoRR, (2022)PAC Identification of a Bandit Arm Relative to a Reward Quantile., and . AAAI, page 1777-1783. AAAI Press, (2017)