Jump to Content

Learning Linear-Quadratic Regulators Efficiently with only √ T Regret

ICML (2019) (to appear)
Google Scholar

Abstract

We present the first computationally-efficient algorithm with $\tO(\sqrt{T})$ regret for learning in Linear Quadratic Control systems with unknown linear dynamics and known quadratic costs.

Research Areas