SIERRA - 2015 - Annual activity report

SIERRA

SIERRA - 2015

Project-Team Sierra

Members

Overall Objectives

Statement

Research Program

Application Domains

Application Domains

Highlights of the Year

New Software and Platforms

DICA: Moment Matching for Latent Dirichlet Allocation (LDA) and Discrete Independent Component Analysis (DICA)
LinearFW: Implementation of linearly convergent versions of Frank-Wolfe
CNN-Head-Detection: Context-aware CNNs for person head detection

New Results

Bilateral Contracts and Grants with Industry

Partnerships and Cooperations

Dissemination

Bibliography

Publications of the year

Previous |

Home | Next next

Section: New Results

Batched Bandit Problems

Participant : Vianney Perchet [correspondent] .

Collaboration with Philippe Rigollet, Sylvain Chassang and Erik Snowberg.

Motivated by practical applications, chiefly clinical trials, we study in [39] the regret achievable for stochastic bandits under the constraint that the employed policy must split trials into a small number of batches. Our results show that a very small number of batches gives close to minimax optimal regret bounds. As a byproduct, we derive optimal policies with low switching cost for stochastic bandits.

Previous |

Home | Next next