копировать удалить добавить публикацию в буфер
Запись сообщества
посмотреть историю данной записи
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Minimax Regret for Bandit Convex Optimisation of Ridge Functions

T. Lattimore. (2021)cite arxiv:2106.00444Comment: Correcting an (instructive) error that leads to a weaker result.

Аннотация

We analyse adversarial bandit convex optimisation with an adversary that is restricted to playing functions of the form $f_t(x) = g_t(x, þeta\rangle)$ for convex $g_t : R R$ and unknown $þeta R^d$ that is homogeneous over time. We provide a short information-theoretic proof that the minimax regret is at most $O(d n łog(n diam(K)))$ where $n$ is the number of interactions, $d$ the dimension and $diam(K)$ is the diameter of the constraint set.

Описание

[2106.00444] Minimax Regret for Bandit Convex Optimisation of Ridge Functions

Линки и ресурсы

ключ BibTeX: lattimore2021minimax
тип записи: article
год: 2021
url: http://arxiv.org/abs/2106.00444
Примечание: cite arxiv:2106.00444Comment: Correcting an (instructive) error that leads to a weaker result

тэги

@kirk86- тэги данного пользователя выделены

Цитировать эту публикацию

искать в

Метаданные

Последнее изменение 3 лет назад
Создан 3 лет назад

Комментарии и рецензии
(0)

Комментарии, или рецензии отсутствуют. Вы можете их написать!