копировать удалить добавить публикацию в буфер
Запись сообщества
посмотреть историю данной записи
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Linear Multi-Resource Allocation with Semi-Bandit Feedback

T. Lattimore, K. Crammer, и {. Szepesvári. NIPS, стр. 964--972. (сентября 2015)

Аннотация

We study an idealised sequential resource allocation problem. In each time step the learner chooses an allocation of several resource types between a number of tasks. Assigning more resources to a task increases the probability that it is completed. The problem is challenging because the alignment of the tasks to the resource types is unknown and the feedback is noisy. Our main contribution is the new setting and an algorithm with nearly-optimal regret analysis. Along the way we draw connections to the problem of minimising regret for stochastic linear bandits with heteroscedastic noise. We also present some new results for stochastic linear bandits on the hypercube that significantly improve on existing work, especially in the sparse case.

Линки и ресурсы

ключ BibTeX: LaCrSze15
тип записи: inproceedings
название книги: NIPS
год: 2015
месяц: September
страницы: 964--972
pdf: papers/NIPS15-mr-bandit.pdf
date-modified: 2016-08-01 03:14:20 +0000
date-added: 2015-12-02 01:31:55 +0000

тэги

Цитировать эту публикацию

искать в

Метаданные

Последнее изменение 4 лет назад
Создан 4 лет назад

Комментарии и рецензии
(0)

Комментарии, или рецензии отсутствуют. Вы можете их написать!