Inproceedings,

Delay-Tolerant Online Convex Optimization: Unified Analysis and Adaptive-Gradient Algorithms

P. Joulani, A. György, and {. Szepesvári.
AAAI-2016, page 1744--1750. (November 2016)

Abstract

We present a unified, black-box-style method for developing and analyzing online convex optimization (OCO) algorithms for full-information online learning in delayed-feedback environments. Our new, simplified analysis enables us to substantially improve upon previous work and to solve a number of open problems from the literature. Specifically, we develop and analyze asynchronous AdaGrad-style algorithms from the Follow-the-Regularized-Leader (FTRL) and Mirror-Descent family that, unlike previous works, can handle projections and adapt both to the gradients and the delays, without relying on either strong convexity or smoothness of the objective function, or data sparsity. Our unified framework builds on a natural reduction from delayed-feedback to standard (non-delayed) online learning. This reduction, together with recent unification results for OCO algorithms, allows us to analyze the regret of generic FTRL and Mirror-Descent algorithms in the delayed-feedback setting in a unified manner using standard proof techniques. In addition, the reduction is exact and can be used to obtain both upper and lower bounds on the regret in the delayed-feedback setting.

BibTeX key: JoGySz:AAAI16
entry type: inproceedings
booktitle: AAAI-2016
year: 2016
month: November
pages: 1744--1750
pdf: papers/AAAI16-stable-algs-linear.pdf
date-modified: 2016-08-01 15:25:29 +0000
date-added: 2015-12-02 00:24:21 +0000

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

@inproceedings{JoGySz:AAAI16, abstract = {We present a unified, black-box-style method for developing and analyzing online convex optimization (OCO) algorithms for full-information online learning in delayed-feedback environments. Our new, simplified analysis enables us to substantially improve upon previous work and to solve a number of open problems from the literature. Specifically, we develop and analyze asynchronous AdaGrad-style algorithms from the Follow-the-Regularized-Leader (FTRL) and Mirror-Descent family that, unlike previous works, can handle projections and adapt both to the gradients and the delays, without relying on either strong convexity or smoothness of the objective function, or data sparsity. Our unified framework builds on a natural reduction from delayed-feedback to standard (non-delayed) online learning. This reduction, together with recent unification results for OCO algorithms, allows us to analyze the regret of generic FTRL and Mirror-Descent algorithms in the delayed-feedback setting in a unified manner using standard proof techniques. In addition, the reduction is exact and can be used to obtain both upper and lower bounds on the regret in the delayed-feedback setting. }, added-at = {2020-03-17T03:03:01.000+0100}, author = {Joulani, P. and Gy{\"o}rgy, A. and Szepesv{\'a}ri, {Cs}.}, biburl = {https://www.bibsonomy.org/bibtex/2070b09a8a246d8857b605de6b5f38500/csaba}, booktitle = {AAAI-2016}, date-added = {2015-12-02 00:24:21 +0000}, date-modified = {2016-08-01 15:25:29 +0000}, interhash = {748ee34dfdbad684c24e0ada634de567}, intrahash = {070b09a8a246d8857b605de6b5f38500}, keywords = {adversarial convex delay, learning, online optimization, setting theory,}, month = {November}, pages = {1744--1750}, pdf = {papers/AAAI16-stable-algs-linear.pdf}, timestamp = {2020-03-17T03:03:01.000+0100}, title = {Delay-Tolerant Online Convex Optimization: Unified Analysis and Adaptive-Gradient Algorithms}, year = 2016 }

BibSonomy

Delay-Tolerant Online Convex Optimization: Unified Analysis and Adaptive-Gradient Algorithms

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on