copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Logarithmic Regret for Online Control

N. Agarwal, E. Hazan, and K. Singh. (2019)cite arxiv:1909.05062.

Abstract

We study optimal regret bounds for control in linear dynamical systems under adversarially changing strongly convex cost functions, given the knowledge of transition dynamics. This includes several well studied and fundamental frameworks such as the Kalman filter and the linear quadratic regulator. State of the art methods achieve regret which scales as $O(T)$, where $T$ is the time horizon. We show that the optimal regret in this setting can be significantly smaller, scaling as $O(poly(T))$. This regret bound is achieved by two different efficient iterative methods, online gradient descent and online natural gradient.

Description

[1909.05062] Logarithmic Regret for Online Control

Links and resources

BibTeX key: agarwal2019logarithmic
entry type: article
year: 2019
url: http://arxiv.org/abs/1909.05062
note: cite arxiv:1909.05062

@kirk86's tags highlighted

Cite this publication

search on

Meta data

Last update 5 years ago
Created 5 years ago

Comments and Reviews
(0)

There is no review or comment yet. You can write one!

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Logarithmic Regret for Online Control

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Logarithmic Regret for Online Control

Abstract

Description

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Logarithmic Regret for Online Control

Comments and Reviews
(0)