Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Planning in POMDPs Using Multiplicity Automata, , and . CoRR, (2012)Convergence time to Nash equilibrium in load balancing., , and . ACM Trans. Algorithms, 3 (3): 32 (2007)Regret to the best vs. regret to the average., , , and . Mach. Learn., 72 (1-2): 21-37 (2008)Sponsored Search with Contexts., , and . WINE, volume 4858 of Lecture Notes in Computer Science, page 312-317. Springer, (2007)Convergence Time to Nash Equilibria., , and . ICALP, volume 2719 of Lecture Notes in Computer Science, page 502-513. Springer, (2003)A network formation game for bipartite exchange economies., , and . SODA, page 697-706. SIAM, (2007)PAC Bounds for Multi-armed Bandit and Markov Decision Processes., , and . COLT, volume 2375 of Lecture Notes in Computer Science, page 255-270. Springer, (2002)Reinforcement Learning in POMDPs Without Resets., , and . IJCAI, page 690-695. Professional Book Center, (2005)Approximate Equivalence of Markov Decision Processes., and . COLT, volume 2777 of Lecture Notes in Computer Science, page 581-594. Springer, (2003)Learning Rates for Q-Learning., and . COLT/EuroCOLT, volume 2111 of Lecture Notes in Computer Science, page 589-604. Springer, (2001)