From post

Logarithmic Regret for Learning Linear Quadratic Regulators Efficiently.

, , и . ICML, том 119 из Proceedings of Machine Learning Research, стр. 1328-1337. PMLR, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Learning Approximately Optimal Contracts., , и . SAGT, том 13584 из Lecture Notes in Computer Science, стр. 331-346. Springer, (2022)Following the Perturbed Leader for Online Structured Learning., и . ICML, том 37 из JMLR Workshop and Conference Proceedings, стр. 1034-1042. JMLR.org, (2015)Average reward reinforcement learning with unknown mixing times., , , и . CoRR, (2019)Apprenticeship Learning via Frank-Wolfe., , , и . AAAI, стр. 6720-6728. AAAI Press, (2020)Efficient Online Linear Control with Stochastic Convex Costs and Unknown Dynamics., , и . COLT, том 178 из Proceedings of Machine Learning Research, стр. 3589-3604. PMLR, (2022)The Real Price of Bandit Information in Multiclass Classification., , , , и . CoRR, (2024)Incentivizing the Dynamic Workforce: Learning Contracts in the Gig-Economy., , и . CoRR, (2018)Faster Optimal Planning with Partial-Order Pruning., , , и . ICAPS, AAAI, (2013)Unknown mixing times in apprenticeship and reinforcement learning., , , и . UAI, том 124 из Proceedings of Machine Learning Research, стр. 430-439. AUAI Press, (2020)Learning to Screen., , , , и . NeurIPS, стр. 8612-8621. (2019)