SIERRA - 2016 - Rapport annuel d'activité

SIERRA

SIERRA - 2016

Project-Team Sierra

Members

Overall Objectives

Statement

Research Program

Application Domains

Application Domains

New Software and Platforms

New Results

Bilateral Contracts and Grants with Industry

Partnerships and Cooperations

Dissemination

Bibliography

Previous |

Home | Next next

Section: New Results

Harder, Better, Faster, Stronger Convergence Rates for Least-Squares Regression

In [20], we consider the optimization of a quadratic objective function whose gradients are only accessible through a stochastic oracle that returns the gradient at any given point plus a zero-mean finite variance random error. We present the first algorithm that achieves jointly the optimal prediction error rates for least-squares regression, both in terms of forgetting of initial conditions in $O (1 / n^{2})$ , and in terms of dependence on the noise and dimension $d$ of the problem, as $O (d / n)$ . Our new algorithm is based on averaged accelerated regularized gradient descent, and may also be analyzed through finer assumptions on initial conditions and the Hessian matrix, leading to dimension-free quantities that may still be small while the " optimal " terms above are large. In order to characterize the tightness of these new bounds, we consider an application to non-parametric regression and use the known lower bounds on the statistical performance (without computational limits), which happen to match our bounds obtained from a single pass on the data and thus show optimality of our algorithm in a wide variety of particular trade-offs between bias and variance.

Previous |

Home | Next next