SIERRA - 2014 - Annual activity report

SIERRA

SIERRA - 2014

Project-Team Sierra

Members

Overall Objectives

Statement

Research Program

New Software and Platforms

New Results

Bilateral Contracts and Grants with Industry

Partnerships and Cooperations

Dissemination

Bibliography

Publications of the year

Previous |

Home | Next next

Section: New Results

Adaptivity of averaged stochastic gradient descent to local strong convexity for logistic regression

Participant : Francis Bach.

In this work, we consider supervised learning problems such as logistic regression and study the stochastic gradient method with averaging, in the usual stochastic approximation setting where observations are used only once. We show that after $N$ iterations, with a constant step-size proportional to $1 / R^{2} \sqrt{N}$ where $N$ is the number of observations and $R$ is the maximum norm of the observations, the convergence rate is always of order $O (1 / \sqrt{N})$ , and improves to $O (R^{2} / μ N)$ where $μ$ is the lowest eigenvalue of the Hessian at the global optimum (when this eigenvalue is greater than $R^{2} / \sqrt{N}$ ). Since $μ$ does not need to be known in advance, this shows that averaged stochastic gradient is adaptive to unknown local strong convexity of the objective function. Our proof relies on the generalized self-concordance properties of the logistic loss and thus extends to all generalized linear models with uniformly bounded features.

Previous |

Home | Next next