POLARIS

POLARIS - 2025

2025Activity reportProject-TeamPOLARIS

RNSR:‌ 201622036M

Research center Inria Centre at Université Grenoble‌ Alpes
In partnership with:Université de Grenoble Alpes,‌ CNRS
Team name: Performance analysis and Optimization of‌ LARge Infrastructures and Systems
In collaboration with:Laboratoire‌ d'Informatique de Grenoble (LIG)

Creation of the Project-Team:‌ 2018 January 01

Each year, Inria research teams‌ publish an Activity Report presenting their work and‌ results over the reporting period. These reports follow‌ a common structure, with some optional sections depending‌ on the specific team. They typically begin by‌ outlining the overall objectives and research programme, including‌ the main research themes, goals, and methodological approaches.‌ They also describe the application domains targeted by‌ the team, highlighting the scientific or societal contexts‌ in which their work is situated.

The reports‌ then present the highlights of the year, covering‌ major scientific achievements, software developments, or teaching contributions.‌ When relevant, they include sections on software, platforms,‌ and open data, detailing the tools developed and‌ how they are shared. A substantial part is‌ dedicated to new results, where scientific contributions are‌ described in detail, often with subsections specifying participants‌ and associated keywords.

Finally, the Activity Report addresses‌ funding, contracts, partnerships, and collaborations at various levels,‌ from industrial agreements to international cooperations. It also‌ covers dissemination and teaching activities, such as participation‌ in scientific events, outreach, and supervision. The document‌ concludes with a presentation of scientific production, including‌ major publications and those produced during the year.‌

Keywords

Computer Science and Digital Science

A1.2. Networks‌
A1.3.5. Cloud
A1.3.6. Fog, Edge
A1.6. Green Computing‌
A3.4. Machine learning and statistics
A3.5.2. Recommendation systems‌
A5.2. Data visualization
A6. Modeling, simulation and control
A6.2.3. Probabilistic methods
A6.2.4.‌ Statistical methods
A6.2.6. Optimization‌
A6.2.7. HPC for machine‌‌ learning
A8.2. Optimization
A8.9. Performance evaluation
A8.11. Game‌ Theory
A9.2. Machine learning‌
A9.9. Distributed AI, Multi-agent‌‌

1 Team members, visitors,‌ external collaborators

Research Scientists‌‌

Arnaud Legrand [Team leader, CNRS,‌ Senior Researcher, HDR‌]
Jonatha Anselmi [‌‌INRIA, Researcher, until Jul 2025]‌
Dorian Baudry [INRIA‌, Researcher, from‌‌ Jul 2025 until Jul 2025]
Mathieu Besancon‌ [INRIA, Researcher‌, until Jul 2025‌‌]
Nicolas Gast [INRIA, Researcher,‌ until Jul 2025,‌ HDR]
Bruno Gaujal‌‌ [INRIA, Senior Researcher, until Jul‌ 2025, HDR]‌
Panayotis Mertikopoulos [CNRS‌‌, Senior Researcher, until Jul 2025,‌ HDR]
Bary Pradelski‌ [CNRS, Researcher‌‌, until Jul 2025]

Faculty Members

Romain‌ Couillet [UGA,‌ Professor, until Mar‌‌ 2025]
Vincent Danjean [UGA, Associate‌ Professor, until Feb‌ 2025]
Kevin Marquet‌‌ [UGA, Associate Professor]
Florence Perronnin‌ [UGA, Associate‌ Professor]
Jean-Marc Vincent‌‌ [UGA, Associate Professor, until Jul‌ 2025]
Philippe Waille‌ [UGA, Associate‌‌ Professor, until Feb 2025]

Post-Doctoral Fellows‌

Mete Ahunbay [CNRS‌, Post-Doctoral Fellow,‌‌ from Apr 2025]
Victor Boone [CNRS‌, Post-Doctoral Fellow,‌ until Jun 2025]‌‌
Dheeraj Narasimha [INRIA, Post-Doctoral Fellow,‌ until Jun 2025]‌

PhD Students

Rim Alhajal‌‌ [CNRS, until Jul 2025]
Helene‌ Arvis [EDF,‌ CIFRE]
Achille Baucher‌‌ [UGA, until Sep 2025]
Pierre-Louis‌ Cauvin [CNRS,‌ until Jul 2025]‌‌
Romain Cravic [INRIA, until Jul 2025‌]
Valentin Girard [‌UGA]
Galaad Langlois‌‌ [ENS DE LYON, from Sep 2025‌]
Hugo Lebeau [‌Univ. Grenoble Alpes,‌‌ until Jan 2025]
Davide Legacci [UGA‌, until Oct 2025‌]
Hubert Villuendas [‌‌UGA, until Jul 2025]

Interns and‌ Apprentices

Samuel Bounan [‌INRIA, Intern,‌‌ from Mar 2025 until Jul 2025]
Samuel‌ Bounan [INRIA,‌ Intern, until Feb‌‌ 2025]
Joel Charles-Rebuffe [ENS DE LYON‌, Intern, from‌ Apr 2025 until Jul‌‌ 2025]
Leandre Cheruy [UGA, from‌ Apr 2025 until Jul‌ 2025]
Hamidou Diallo‌‌ [INRIA, Intern, from Jun 2025‌ until Jul 2025]‌
Hamidou Diallo [INRIA‌‌, Intern, from Feb 2025 until May‌ 2025]
Karl Gottlieb‌ [UGA, from‌‌ Feb 2025 until Aug 2025]
Djamel Rassem‌ Lamouri [INRIA,‌ Intern, from Feb‌‌ 2025 until Jul 2025‌]
Galaad Langlois [ENS DE LYON,‌ Intern, until Mar 2025]
Adrien Obrecht‌ [ENS DE LYON, Intern, until‌ Jul 2025]

Administrative Assistants

Luce Coelho [‌INRIA]
Annie Simon [INRIA]

2‌ Overall objectives

Note: The POLARIS project started‌ in January 2016 and has ended in December‌ 2025. This 10 year project has been an‌ amazing human and scientific journey. Over this period,‌ this group has enriched with many new members,‌ both full-time researchers (6), postdocs & PhD students‌ (60), and interns. The scope of scientific interests‌ in POLARIS has continued to broaden (from HPC‌ to learning, including aspects of privacy, statistics, energy‌ optimization, combinatorial optimization, environmental and sustainability concerns, etc.),‌ leading to a restructuring into two new Inria‌ teams: GHOST (which has been created on August‌ 1st 2025) and ADN (which has been created‌ on January 1st 2026). Some members of POLARIS‌ have also joined other teams such as CORSE‌ (compilers), DATAMOVE (HPC), and DRAKKAR (networking and security).‌ For convenience, the activity the members of the‌ GHOST project until the end of 2025 (August-December‌ 2025) has been reattached to the 2025 POLARIS‌ activity report.

The GHOST (Games, Mathematical Optimization, and‌ Stochastic Systems) targets the study of random dynamical‌ systems, which are ubiquitous in many fields of‌ computer science and applied mathematics: In machine learning‌ for example, they are used to analyze learning‌ algorithms and provide concrete convergence and generalization guarantees;‌ in queuing theory, they are used to model,‌ characterize, and optimize the performance of distributed systems;‌ in game theory, they model the behavior of‌ autonomous agents that are competing or cooperating to‌ improve their performance; etc. GHOST will work at‌ the interface of stochastic modeling, dynamical systems, online‌ learning, game theory and optimization, and their aim‌ will be to (a) design mathematical and algorithmic‌ methods for studying the dynamics of complex systems‌ in the presence of randomness and uncertainty; and‌ (b) to use these methods to optimize performance,‌ design new learning algorithms and optimize decision-making in‌ all its aspects. The applications of GHOST mostly‌ revolve around complex resource allocation problems such as‌ job allocation in distributed computing resources, improving the‌ methods used to train complex machine learning models,‌ and the areas of energy management and scheduling‌ in electrical networks.

The research of the ADN‌ (Anthropocene, Degrowth, and ICT) builds on the observation‌ that the environmental impacts of human activities have‌ increased so dramatically since the beginning of the‌ Industrial Revolution that they now represent a major‌ driver of the Earth system, prompting the use‌ of the term Anthropocene to describe this new‌ epoch. Which role do Information and Communication Technologies‌ (ICT) play in this, and how could they‌ most ignificantly contribute to mitigation and adaptation strategies‌ for tackling these environmental impacts? The ADN project‌ team seeks to address this question by rethinking‌ ICT through a strong sustainability via degrowth approach. For this, the members‌ of this project will‌ (1) study the place‌‌ and contribution of digital technologies in prospective scenarios,‌ (2) taking into account‌ their political nature through‌‌ a value in design approach, and (3) with‌ a focus on key‌ software technologies and infrastructures‌‌ as commons and dedigitization.

2.1 Context

Large distributed‌ infrastructures are rampant in‌ our society. Numerical simulations‌‌ form the basis of computational sciences and high‌ performance computing infrastructures have‌ become scientific instruments with‌‌ similar roles as those of test tubes or‌ telescopes. Cloud infrastructures are‌ used by companies in‌‌ such an intense way that even the shortest‌ outage quickly incurs the‌ loss of several millions‌‌ of dollars. But every citizen also relies on‌ (and interacts with) such‌ infrastructures via complex wireless‌‌ mobile embedded devices whose nature is constantly evolving.‌ In this way, the‌ advent of digital miniaturization‌‌ and interconnection has enabled our homes, power stations,‌ cars and bikes to‌ evolve into smart grids‌‌ and smart transportation systems that should be optimized‌ to fulfill societal expectations.‌

Our dependence and intense‌‌ usage of such gigantic systems obviously leads to‌ very high expectations in‌ terms of performance. Indeed,‌‌ we strive for low-cost and energy-efficient systems that‌ seamlessly adapt to changing‌ environments that can only‌‌ be accessed through uncertain measurements. Such digital systems‌ also have to take‌ into account both the‌‌ users' profile and expectations to efficiently and fairly‌ share resources in an‌ online way. Analyzing, designing‌‌ and provisioning such systems has thus become a‌ real challenge.

Such systems‌ are characterized by their‌‌ ever-growing size, intrinsic heterogeneity and distributedness,‌ user-driven requirements, and an‌ unpredictable variability that renders‌‌ them essentially stochastic. In such contexts, many‌ of the former design‌ and analysis hypotheses (homogeneity,‌‌ limited hierarchy, omniscient view, optimization carried out by‌ a single entity, open-loop‌ optimization, user outside of‌‌ the picture) have become obsolete, which calls for‌ radically new approaches. Properly‌ studying such systems requires‌‌ a drastic rethinking of fundamental aspects regarding the‌ system's observation (measure, trace,‌ methodology, design of experiments),‌‌ analysis (modeling, simulation, trace analysis and visualization), and‌ optimization (distributed, online, stochastic).‌

2.2 Objectives

The goal‌‌ of the POLARIS project is to contribute to‌ the understanding of the‌ performance of very large‌‌ scale distributed systems by applying ideas from diverse‌ research fields and application‌ domains. We believe that‌‌ studying all these different aspects at once without‌ restricting to specific systems‌ is the key to‌‌ push forward our understanding of such challenges and‌ to propose innovative solutions.‌ This is why we‌‌ intend to investigate problems arising from application domains‌ as varied as large‌ computing systems, wireless networks,‌‌ smart grids and transportation systems.

The members of‌ the POLARIS project cover‌ a very wide spectrum‌‌ of expertise in performance evaluation and models, distributed‌ optimization, and analysis of‌ HPC middleware. Specifically, POLARIS'‌‌ members have worked extensively on:

Experiment design:
Experimental‌ methodology, measuring/monitoring/tracing tools, experiment‌ control, design of experiments,‌‌ and reproducible research, especially‌ in the context of large computing infrastructures (such‌ as computing grids, HPC, volunteer computing and embedded‌ systems).
Trace Analysis:
Parallel application visualization (paje, triva/viva,‌ framesoc/ocelotl, ...), characterization of failures in large distributed‌ systems, visualization and analysis for geographical information systems,‌ spatio-temporal analysis of media events in RSS flows‌ from newspapers, and others.
Modeling and Simulation:
Emulation,‌ discrete event simulation, perfect sampling, Markov chains, Monte‌ Carlo methods, and others.
Optimization:
Stochastic approximation, mean‌ field limits, game theory, discrete and continuous optimization,‌ learning and information theory.

2.3 Contribution to AI/Learning‌

AI and Learning is everywhere now. Let us‌ clarify how our research activities are positioned with‌ respect to this trend.

A first line of‌ research in POLARIS is devoted to the use‌ of statistical learning techniques (Bayesian inference) to model‌ the expected performance of distributed systems, to build‌ aggregated performance views, to feed simulators of such‌ systems, or to detect anomalous behaviours.

In a‌ distributed context it is also essential to design‌ systems that can seamlessly adapt to the workload‌ and to the evolving behaviour of its components‌ (users, resources, network). Obtaining faithful information on the‌ dynamic of the system can be particularly difficult,‌ which is why it is generally more efficient‌ to design systems that dynamically learn the best‌ actions to play through trial and errors. A‌ key characteristic of the work in the POLARIS‌ project is to leverage regularly game-theoretic modeling to‌ handle situations where the resources or the decision‌ is distributed among several agents or even situations‌ where a centralised decision maker has to adapt‌ to strategic users.

An important research direction in‌ POLARIS is thus centered on reinforcement learning (Multi-armed‌ bandits, Q-learning, online learning) and active learning in‌ environments with one or several of the following‌ features:

Feedback is limited (e.g., gradient or even‌ stochastic gradients are not available, which requires for‌ example to resort to stochastic approximations);
Multi-agent setting‌ where each agent learns, possibly not in a‌ synchronised way (i.e., decisions may be taken asynchronously,‌ which raises convergence issues);
Delayed feedback (avoid oscillations‌ and quantify convergence degradation);
Non stochastic (e.g., adversarial)‌ or non stationary workloads (e.g., in presence of‌ shocks);
Systems composed of a very large number‌ of entities, that we study through mean field‌ approximation (mean-field games and mean field control).

As‌ a side effect, many of the gained insights‌ can often be used to dramatically improve the‌ scalability and the performance of the implementation of‌ more standard machine or deep learning techniques over‌ supercomputers.

The POLARIS members are thus particularly interested‌ in the design and analysis of adaptive learning‌ algorithms for multi-agent systems, i.e. agents that seek‌ to progressively improve their performance on a specific‌ task. The resulting algorithms should not only learn‌ an efficient (Nash) equilibrium but they should also‌ be capable of doing so quickly (low regret),‌ even when facing the difficulties associated to a‌ distributed context (lack of coordination, uncertain world, information delay, limited feedback, …)‌

In the rest of‌ this document, we describe‌‌ in detail our new results in the above‌ areas.

3 Research program‌

3.1 Performance Evaluation

Participants:‌‌ Jonatha Anselmi, Vincent Danjean, Nicolas Gast‌, Guillaume Huard,‌ Arnaud Legrand, Florence‌‌ Perronnin, Jean-Marc Vincent.

Project-team positioning

Evaluating‌ the scalability, robustness, energy‌ consumption and performance of‌‌ large infrastructures such as exascale platforms and clouds‌ raises severe methodological challenges.‌ The complexity of such‌‌ platforms mandates empirical evaluation but direct experimentation via‌ an application deployment on‌ a real-world testbed is‌‌ often limited by the few platforms available at‌ hand and is even‌ sometimes impossible (cost, access,‌‌ early stages of the infrastructure design, etc.). Furthermore,‌ such experiments are costly,‌ difficult to control and‌‌ therefore difficult to reproduce. Although many of these‌ digital systems have been‌ built by human, they‌‌ have reached such a complexity level that we‌ are no longer able‌ to study them like‌‌ artificial systems and have to deal with the‌ same kind of experimental‌ issues as natural sciences.‌‌ The development of a sound experimental methodology for‌ the evaluation of resource‌ management solutions is among‌‌ the most important ways to cope with the‌ growing complexity of computing‌ environments. Although computing environments‌‌ come with their own specific challenges, we believe‌ such general observation problems‌ should be addressed by‌‌ borrowing good practices and techniques developed in many‌ other domains of science,‌ in particular (1) Predictive‌‌ Simulation, (2) Trace Analysis and Visualization, and (3)‌ the Design of Experiments.‌

Scientific achievements

Large computing‌‌ systems are particularly complex to understand because of‌ the interplay between their‌ discrete nature (originating from‌‌ deterministic computer programs) and their stochastic nature (emerging‌ from the physical world,‌ long distance interactions, and‌‌ complex hardware and software stacks). A first line‌ of research in POLARIS‌ is devoted to the‌‌ design of relatively simple statistical models of key‌ components of distributed systems‌ and their exploitation to‌‌ feed simulators of such systems, to build aggregated‌ performance views, and to‌ detect anomalous behaviors.

Predictive‌‌ Simulation

Unlike direct experimentation via an application deployment‌ on a real-world testbed,‌ simulation enables fully repeatable‌‌ and configurable experiments that can often be conducted‌ quickly for arbitrary hypothetical‌ scenarios. In spite of‌‌ these promises, current simulation practice is often not‌ conducive to obtaining scientifically‌ sound results. To date,‌‌ most simulation results in the parallel and distributed‌ computing literature are obtained‌ with simulators that are‌‌ ad hoc, unavailable, undocumented, and/or no longer maintained.‌ As a result, most‌ published simulation results build‌‌ on throw-away (short-lived and non validated) simulators that‌ are specifically designed for‌ a particular study, which‌‌ prevents other researchers from building upon it. There‌ is thus a strong‌ need for recognized simulation‌‌ frameworks by which simulation results can be reproduced,‌ further analyzed and improved.‌

Many simulators of MPI‌‌ applications have been developed by renowned HPC groups‌ (e.g., at SDSC 112‌, BSC 45,‌‌ UIUC 120, Sandia‌ Nat. Lab. 118, ORNL 46 or ETH‌ Zürich 81) but most of them build‌ on restrictive network and application modeling assumptions that‌ generally prevent to faithfully predict execution times, which‌ limits the use of simulation to indication of‌ gross trends at best.

The SimGrid simulation toolkit,‌ whose development started more than 20 years ago‌ in UCSD, is a renowned project which gathers‌ more than 1,700 citations and has supported the‌ research of at least 550 articles. The most‌ important contribution of POLARIS to this project in‌ the last years has been to improve the‌ quality of SimGrid to the point where it‌ can be used effectively on a daily basis‌ by practitioners to accurately reproduce the dynamic of‌ real HPC systems. In particular, SMPI 54,‌ a simulator based on SimGrid that simulates unmodified‌ MPI applications written in C/C++ or FORTRAN, has‌ now become a very unique tool allowing to‌ faithfully study particularly complex scenario such as legacy‌ Geophysics application that suffers from spatial and temporal‌ load balancing problem 85, 84 or the‌ HPL benchmark 52, 53. We have‌ shown that the performance (both for time and‌ energy consumption 80) predicted through our simulations‌ was systematically within a few percents of real‌ experiments, which allows to reliably tune the applications‌ at very low cost. This capacity has also‌ been leveraged to study (through StarPU-SimGrid) complex‌ and modern task-based applications running on heterogeneous sets‌ of hybrid (CPUs + GPUs) nodes 99.‌ The phenomenon studied through this approach would be‌ particularly difficult to study through real experiments but‌ yet allow to address real problems of these‌ applications. Finally, SimGrid is also heavily used through‌ BatSim, a batch simulator developed in the‌ DATAMOVE team and which leverages SimGrid, to investigate‌ the performance of machine learning strategies in a‌ batch scheduling context 88, 121.

Trace‌ Analysis and Visualization

Many monolithic visualization tools have‌ been developed by renowned HPC groups since decades‌ (e.g., BSC 103, Jülich and TU Dresden‌ 98, 48, UIUC 79, 107‌, 83 and ANL 119) but most‌ of these tools build on the classical information‌ visualization 109 that consists in always first presenting‌ an overview of the data, possibly by plotting‌ everything if computing power allows, and then to‌ allow users to zoom and filter, providing details‌ on demand. However in our context, the amount‌ of data comprised in such traces is several‌ orders of magnitude larger than the number of‌ pixels on a screen and displaying even a‌ small fraction of the trace leads to harmful‌ visualization artifacts. Such traces are typically made of‌ events that occur at very different time and‌ space scales and originate from different sources, which‌ hinders classical approaches, especially when the application structure‌ departs from classical MPI programs with a BSP/SPMD‌ structure. In particular, modern HPC applications that build on a task-based runtime‌ and run on hybrid‌ nodes are particularly challenging‌‌ to analyze. Indeed, the underlying task-graph is dynamically‌ scheduled to avoid spurious‌ synchronizations, which prevents classical‌‌ visualizations to exploit and reveal the application structure.‌

In 62, we‌ explain how modern data‌‌ analytics tools can be used to build, from‌ heterogeneous information sources, custom,‌ reproducible and insightful visualizations‌‌ of task-based HPC applications at a very low‌ development cost in the‌ StarVZ framework. By specifying‌‌ and validating statistical models of the performance of‌ HPC applications/systems, we manage‌ to identify when their‌‌ behavior departs from what is expected and detect‌ performance anomalies. This approach‌ has first been applied‌‌ to state-of-the art linear algebra libraries in 62‌ and more recently to‌ a sparse direct solver‌‌ 96. In both cases, we have been‌ able to identify and‌ fix several non-trivial anomalies‌‌ that had not been noticed even by the‌ application and runtime developers.‌ Finally, these models not‌‌ only allow to reveal when applications depart from‌ what is expected but‌ also to summarize the‌‌ execution by focusing on the most important features,‌ which is particularly useful‌ when comparing two executions.‌‌

Design of Experiments and Reproducibility

Part of our‌ work is devoted to‌ the control of experiments‌‌ on both classical (HPC) and novel (IoT/Fog in‌ a smart home context)‌ infrastructures. To this end,‌‌ we heavily rely on experimental testbeds such as‌ Grid5000 and FIT-IoTLab that‌ can be well-controlled but‌‌ real experiments are nonetheless quite resource-consuming. Design of‌ experiments has been successfully‌ applied in many fields‌‌ (e.g., agriculture, chemistry, industrial processes) where experiments are‌ considered expensive. Building on‌ concrete use cases, we‌‌ explore how Design of Experiments and Reproducible Research‌ techniques can be used‌ to (1) design transparent‌‌ auto-tuning strategies of scientific computation kernels 47,‌ 108 (2) set up‌ systematic performance non regression‌‌ tests on Grid5000 (450 nodes for 1.5 year)‌ and detect many abnormal‌ events (related to bios‌‌ and system upgrades, cooling, faulty memory and power‌ instability) that had a‌ significant effect on the‌‌ nodes, from subtle performance changes of 1% to‌ much more severe degradation‌ of more than 10%,‌‌ and had yet been unnoticed by both Grid’5000‌ technical team and Grid’5000‌ users (3) design and‌‌ evaluate the performance of service provisioning strategies 56‌, 55 in Fog‌ infrastructures.

3.2 Asymptotic Methods‌‌

Participants: Jonatha Anselmi, Romain Couillet, Nicolas‌ Gast, Bruno Gaujal‌, Florence Perronnin,‌‌ Jean-Marc Vincent.

Project-team positioning

Stochastic models often‌ suffer from the curse‌ of dimensionality: their complexity‌‌ grows exponentially with the number of dimensions of‌ the system. At the‌ same time, very large‌‌ stochastic systems are sometimes easier to analyze: it‌ can be shown that‌ some classes of stochastic‌‌ systems simplify as their dimension goes to infinity‌ because of averaging effects‌ such as the law‌‌ of large numbers, or the central limit theorem.‌ This forms the basis‌ of what is called‌‌ an asymptotic method,‌ which consists in studying what happens when a‌ system gets large in order to build an‌ approximation that is easier to study or to‌ simulate.

Within the team, the research that we‌ conduct in this axis is to foster the‌ applicability of these asymptotic methods to new application‌ areas. This leads us to work on the‌ application of classical methods to new problems, but‌ also to develop new approximation methods that take‌ into account special features of the systems we‌ study (i.e., moderate number of dimensions, transient behavior,‌ random matrices). Typical applications are mean field method‌ for performance evaluation, application to distributed optimization, and‌ more recently statistical learning. One originality of our‌ work is to quantify precisely what is the‌ error made by such approximations. This allows us‌ to define refinement terms that lead to more‌ accurate approximations.

Scientific achievements

Refined mean field approximation‌

Mean field approximation is a well-known technique in‌ statistical physics, that was originally introduced to study‌ systems composed of a very large number of‌ particles (say $n > 10^{20}$ ). The‌ idea of this approximation is to assume that‌ objects are independent and only interact between them‌ through an average environment (the mean field). Nowadays,‌ variants of this technique are widely applied in‌ many domains: in game theory for instance (with‌ the example of mean field games), but also‌ to quantify the performance of distributed algorithms. Mean‌ field approximation is often justified by showing that‌ a system of $n$ well-mixed interacting objects converges‌ to its deterministic mean field approximation as $n‌$ goes to infinity. Yet, this does not explain‌ why mean field approximation provides a very accurate‌ approximation of the behavior of systems composed by‌ a few hundreds of objects or less. Until‌ recently, this was essentially an open question.

In‌ 64, we give a partial answer to‌ this question. We show that, for most of‌ the mean field models used for performance evaluation,‌ the error made when using a mean field‌ approximation is a $Θ (1 / n‌)$ . This results greatly improved compared to‌ previous work that showed that the error made‌ by mean field approximation was smaller than $O‌ (1 / \sqrt{n})$ . On the‌ contrary, we obtain the exact rate of accuracy.‌ This result came from the use of Stein's‌ method that allows one to quantify precisely the‌ distance between two stochastic processes. Subsequently, in 68‌, we show that the constant in the‌ $Θ (1 / n)$ can be‌ computed numerically by a very efficient algorithm. By‌ using this, we define the notion of refined‌ approximation which consists in adding the $1 /‌ n$ -correction term. This methods can also be‌ generalize to higher order extensions or 70,‌ 63.

Design and analysis of distributed control‌ algorithms

Mean field approximation is widely used in‌ the performance evaluation community to analyze and design distributed control algorithms. Our‌ contribution in this domain‌ has covered mainly two‌‌ applications: cache replacement algorithms and load balancing algorithms.‌

Cache replacement algorithms are‌ widely used in content‌‌ delivery networks. In 50, 72, 71‌, we show how‌ mean field and refined‌‌ mean field approximation can be used to evaluate‌ the performance of list-based‌ cache replacement algorithms. In‌‌ particular, we show that such policies can outperform‌ the classically used LRU‌ algorithm. A methodological contribution‌‌ of our work is that, when evaluating precisely‌ the behavior of such‌ a policy, the refined‌‌ mean field approximation is both faster and more‌ accurate than what could‌ be obtained with a‌‌ stochastic simulator.

Computing resources are often spread across‌ many machines. An efficient‌ use of such resources‌‌ requires the design of a good load balancing‌ strategy, to distribute the‌ load among the available‌‌ machines. In 43, 44, 42,‌ we study two paradigms‌ that we use to‌‌ design asymptotically optimal load balancing policies where a‌ central broker sends tasks‌ to a set of‌‌ parallel servers. We show in 43, 42‌ that combining the classical‌ round-robin allocation plus an‌‌ evaluation of the tasks sizes can yield a‌ policy that has a‌ zero delay in the‌‌ large system limit. This policy is interesting because‌ the broker does not‌ need any feedback from‌‌ the servers. At the same time, this policy‌ needs to estimate or‌ know job durations, which‌‌ is not always possible. A different approach is‌ used in 44 where‌ we consider a policy‌‌ that does not need to estimate job durations‌ but that uses some‌ feedback from the servers‌‌ plus a memory of where jobs where send.‌ We show that this‌ paradigm can also be‌‌ used to design zero-delay load balancing policies as‌ the system size grows‌ to infinity.

Mean field‌‌ games

Various notions of mean field games have‌ been introduced in the‌ years 2000-2010 in theoretical‌‌ economics, engineering or game theory. A mean field‌ game is a game‌ in which an individual‌‌ tries to maximize its utility while evolving in‌ a population of other‌ individuals whose behavior are‌‌ not directly affected by the individual. An equilibrium‌ is a population dynamics‌ for which a selfish‌‌ individual would behave as the population. In 58‌, we develop the‌ notion of discrete space‌‌ mean field games, that is more amenable to‌ study than the previously‌ introduced notions of mean‌‌ field games. This leads to two interesting contributions:‌ mean field games are‌ not always the limits‌‌ of stochastic games as the number of players‌ grow 57, mean‌ field games can be‌‌ used to study how much vaccination should be‌ subsidized to encourage people‌ to adapt a socially‌‌ optimal behaviour 73.

3.3 Distributed Online Optimization‌ and Learning in Games‌

Participants: Nicolas Gast,‌‌ Romain Couillet, Bruno Gaujal, Arnaud Legrand‌, Patrick Loiseau,‌ Panayotis Mertikopoulos, Bary‌‌ Pradelski.

Project-team positioning‌

Online learning concerns the study of repeated decision-making‌ in changing environments. Of course, depending on the‌ context, the words “learning” and “decision-making” may refer‌ to very different things: in economics, this could‌ mean predicting how rational agents react to market‌ drifts; in data networks, it could mean adapting‌ the way packets are routed based on changing‌ traffic conditions; in machine learning and AI applications,‌ it could mean training a neural network or‌ the guidance system of a self-driving car; etc.‌ In particular, the changes in the learner's environment‌ could be either exogenous (that is, independent of‌ the learner's decisions, such as the weather affecting‌ the time of travel), or endogenous (i.e., they‌ could depend on the learner's decisions, as in‌ a game of poker), or any combination thereof.‌ However, the goal for the learner(s) is always‌ the same: to make more informed decisions that‌ lead to better rewards over time.

The study‌ of online learning models and algorithms dates back‌ to the seminal work of Robbins, Nash and‌ Bellman in the 50's, and it has since‌ given rise to a vigorous research field at‌ the interface of game theory, control and optimization,‌ with numerous applications in operations research, machine learning,‌ and data science. In this general context, our‌ team focuses on the asymptotic behavior of online‌ learning and optimization algorithms, both single- and multi-agent:‌ whether they converge, at what speed, and/or what‌ type of non-stationary, off-equilibrium behaviors may arise when‌ they do not.

The focus of POLARIS on‌ game-theoretic and Markovian models of learning covers a‌ set of specific challenges that dovetail in a‌ highly synergistic manner with the work of other‌ learning-oriented teams within Inria (like SCOOL in Lille,‌ SIERRA in Paris, and THOTH in Grenoble), and‌ it is an important component of Inria's activities‌ and contributions in the field (which includes major‌ industrial stakeholders like Google / DeepMind, Facebook, Microsoft,‌ Amazon, and many others).

Scientific achievements

Our team's‌ work on online learning covers both single- and‌ multi-agent models; in the sequel, we present some‌ highlights of our work structured along these basic‌ axes.

In the single-agent setting, an important problem‌ in the theory of Markov decision processes –‌ i.e., discrete-time control processes with decision-dependent randomness –‌ is the so-called “restless bandit” problem. Here, the‌ learner chooses an action – or “arm” –‌ from a finite set, and the mechanism determining‌ the action's reward changes depending on whether the‌ action was chosen or not (in contrast to‌ standard Markov problems where the activation of an‌ arm does not have this effect). In this‌ general setting, Whittle conjectured – and Weber and‌ Weiss proved – that Whittle's eponymous index policy‌ is asymptotically optimal. However, the result of Weber‌ and Weiss is purely asymptotic, and the rate‌ of this convergence remained elusive for several decades.‌ This gap was finally settled in a series‌ of POLARIS papers 6667, where we showed that Whittle indices‌ (as well as other‌ index policies) become optimal‌‌ at a geometric rate under the same technical‌ conditions used by Weber‌ and Weiss to prove‌‌ Whittle's conjecture, plus a technical requirement on the‌ non-singularity of the fixed‌ point of the mean-field‌‌ dynamics. We also propose the first sub-cubic algorithm‌ to compute Whittle and‌ Gittins indexes. As for‌‌ reinforcement learning in Markovian bandits, we have shown‌ that Bayesian and optimistic‌ approaches do not use‌‌ the structure of Markovian bandits similarly: While Bayesian‌ learning has both a‌ regret and a computational‌‌ complexity that scales linearly with the number of‌ arms, optimistic approaches all‌ incur an exponential computation‌‌ time, at least in their current versions 65‌.

In the multi-agent‌ setting, our work has‌‌ focused on the following fundamental question:

Does the‌ concurrent use of (‌possibly optimal) single-agent‌‌ learning algorithms

ensure convergence to Nash equilibrium in‌ multi-agent, game-theoretic environments?

Conventional‌ wisdom might suggest a‌‌ positive answer to this question because of the‌ following “folk theorem”: under‌ no-regret learning, the agents'‌‌ empirical frequency of play converges to the game's‌ set of coarse correlated‌ equilibria. However, the actual‌‌ implications of this result are quite weak: First,‌ it concerns the empirical‌ frequency of play and‌‌ not the day-to-day sequence of actions employed by‌ the players. Second, it‌ concerns coarse correlated equilibria‌‌ which may be supported on strictly dominated strategies‌ – and are thus‌ unacceptable in terms of‌‌ rationalizability. These realizations prompted us to make a‌ clean break with conventional‌ wisdom on this topic,‌‌ ultimately showing that the answer to the above‌ question is, in general,‌ “no”: specifically, 93,‌‌ 91 showed that the (optimal) class of “follow-the-regularized-leader”‌ (FTRL) learning algorithms leads‌ to Poincaré recurrence even‌‌ in simple, $2 \times 2$ min-max games, thus‌ precluding convergence to Nash‌ equilibrium in this context.‌‌

This negative result generated significant interest in the‌ literature as it contributed‌ in shifting the focus‌‌ towards identifying which Nash equilibria may arise as‌ stable limit points of‌ FTRL algorithms and dynamics.‌‌ Earlier work by POLARIS on the topic 49‌, 94, 95‌ suggested that strict Nash‌‌ equilibria play an important role in this question.‌ This suspicion was recently‌ confirmed in a series‌‌ of papers 61, 78 where we established‌ a sweeping negative result‌ to the effect that‌‌ mixed Nash equilibria are incompatible with no-regret learning‌. Specifically, we showed‌ that any Nash equilibrium‌‌ which is not strict cannot be stable and‌ attracting under the dynamics‌ of FTRL, especially in‌‌ the presence of randomness and uncertainty. This result‌ has significant implications for‌ predicting the outcome of‌‌ a multi-agent learning process because, combined with 94‌, it establishes the‌ following far-reaching equivalence: a‌‌ state is asymptotically stable under no-regret learning if‌ and only if it‌ is a strict Nash‌‌ equilibrium.

Going beyond finite games, this further raised‌ the question of what‌ type of non-convergent behaviors‌‌ can be observed in‌ continuous games – such as the class of‌ stochastic min-max problems that are typically associated to‌ generative adversarial networks (GANs) in machine learning. This‌ question was one of our primary collaboration axes‌ with EPFL, and led to a joint research‌ project focused on the characterization of the convergence‌ properties of zeroth-, first-, and (scalable) second-order methods‌ in non-convex/non-concave problems. In particular, we showed in‌ 82 that these state-of-the-art min-max optimization algorithms may‌ converge with arbitrarily high probability to attractors that‌ are in no way min-max optimal or even‌ stationary – and, in fact, may not even‌ contain a single stationary point (let alone a‌ Nash equilibrium). Spurious convergence phenomena of this type‌ can arise even in two-dimensional problems, a fact‌ which corroborates the empirical evidence surrounding the formidable‌ difficulty of training GANs.

3.4 Responsible Computer Science‌

Participants: Nicolas Gast, Romain Couillet, Bruno‌ Gaujal, Arnaud Legrand, Patrick Loiseau,‌ Panayotis Mertikopoulos, Bary Pradelski.

Project-team positioning‌

The topics in this axis emerge from current‌ social and economic questions rather than from a‌ fixed set of mathematical methods. To this end‌ we have identified large trends such as energy‌ efficiency, fairness, privacy, and the growing number of‌ new market places. In addition, COVID has posed‌ new questions that opened new paths of research‌ with strong links to policy making.

Throughout these‌ works, the focus of the team is on‌ modeling aspects of the aforementioned problems, and obtaining‌ strong theoretical results that can give high-level guidelines‌ on the design of markets or of decision-making‌ procedures. Where relevant, we complement those works by‌ measurement studies and audits of existing systems that‌ allow identifying key issues. As this work is‌ driven by topics, rather than methods, it allows‌ for a wide range of collaborations, including with‌ enterprises (e.g., Naverlabs), policy makers, and academics from‌ various fields (economics, policy, epidemiology, etc.).

Other teams‌ at Inria cover some of the societal challenges‌ listed here (e.g., PRIVATICS, COMETE) but rather in‌ isolation. The specificity of POLARIS resides in the‌ breadth of societal topics covered and of the‌ collaborations with non-CS researchers and non-research bodies; as‌ well as in the application of methods such‌ as game theory to those topics.

Scientific achievements‌

Algorithmic fairness

As algorithmic decision-making became increasingly omnipresent‌ in our daily lives (in domains ranging from‌ credits to advertising, hiring, or medicine); it also‌ became increasingly apparent that the outcome of algorithms‌ can be discriminatory for various reasons. Since 2016,‌ the scientific community working on the problem of‌ algorithmic fairness has been exponentially increasing. In this‌ context, in the early days, we worked on‌ better understanding the extent of the problem through‌ measurement in the case of social networks 111‌. In particular, in this work, we showed‌ that in advertising platforms, discrimination can occur from‌ multiple different internal processes that cannot be controlled,‌ and we advocate for measuring discrimination on the outcome directly. Then we‌ worked on proposing solutions‌ to guarantee fair representation‌‌ in online public recommendations (aka trending topics on‌ Twitter) 51. This‌ is an example of‌‌ an application in which it was observed that‌ recommendations are typically biased‌ towards some demographic groups.‌‌ In this work, our proposed solution draws an‌ analogy between recommendation and‌ voting and builds on‌‌ existing works on fair representation in voting. Finally,‌ in most recent times,‌ we worked on better‌‌ understanding the sources of discrimination, in the particular‌ simple case of selection‌ problems, and the consequences‌‌ of fixing it. While most works attribute discrimination‌ to implicit bias of‌ the decision maker 87‌‌, we identified a fundamentally different source of‌ discrimination: Even in the‌ absence of implicit bias‌‌ in a decision maker’s estimate of candidates’ quality,‌ the estimates may differ‌ between the different groups‌‌ in their variance—that is, the decision maker’s ability‌ to precisely estimate a‌ candidate’s quality may depend‌‌ on the candidate’s group 60. We show‌ that this differential variance‌ leads to discrimination for‌‌ two reasonable baseline decision makers (group-oblivious and Bayesian‌ optimal). Then we analyze‌ the consequence on the‌‌ selection utility of imposing fairness mechanisms such as‌ demographic parity or its‌ generalization; in particular we‌‌ identify some cases for which imposing fairness can‌ improve utility. In 59‌, we also study‌‌ similar questions in the two-stage setting, and derive‌ the optimal selector and‌ the “price of local‌‌ fairness’’ one pays in utility by imposing that‌ the interim stage be‌ fair.

Privacy and transparency‌‌ in social computing system

Online services in general,‌ and social networks in‌ particular, collect massive amounts‌‌ of data about their users (both online and‌ offline). It is critical‌ that (i) the users’‌‌ data is protected so that it cannot leak‌ and (ii) users can‌ know what data the‌‌ service has about them and understand how it‌ is used—this is the‌ transparency requirement. In this‌‌ context, we did two kinds of work. First,‌ we studied social networks‌ through measurement, in particular‌‌ using the use case of Facebook. We showed‌ that their advertising platform,‌ through the PII1‌‌-based targeting option, allowed attackers to discover some‌ personal data of users‌ 113. We also‌‌ proposed an alternative design—valid for any system that‌ proposed PII-based targeting—and proved‌ that it fixes the‌‌ problem. We then audited the transparency mechanisms of‌ the Facebook ad platform,‌ specifically the “Ad Preferences’’‌‌ page that shows what interests the platform inferred‌ about a user, and‌ the “Why am I‌‌ seeing this’’ button that gives some reasons why‌ the user saw a‌ particular ad. In both‌‌ cases, we laid the foundation for defining the‌ quality of explanations and‌ we showed that the‌‌ explanations given were lacking key desirable properties (they‌ were incomplete and misleading,‌ they have since been‌‌ changed) 41. A follow-up work shed further‌ light on the typical‌ uses of the platform‌‌ 40. In another‌ work, we proposed an innovative protocol based on‌ randomized withdrawal to protect public posts deletion privacy‌ 97. Finally, in 69, we study‌ an alternative data sharing ecosystem where users can‌ choose the precision of the data they give.‌ We model it as a game and show‌ that, if users are motivated to reveal data‌ by a public good component of the outcome’s‌ precision, then certain basic statistical properties (the optimality‌ of generalized least squares in particular) no longer‌ hold.

Online markets

Market design operates at the‌ intersection of computer science and economics and has‌ become increasingly important as many markets are redesigned‌ on digital platforms. Studying markets for commodities, in‌ an ongoing project we evaluate how different fee‌ models alter strategic incentives for both buyers and‌ sellers. We identify two general classes of fees:‌ for one, strategic manipulation becomes infeasible as the‌ market grows large and agents therefore have no‌ incentive to misreport their true valuation. On the‌ other hand, strategic manipulation is possible and we‌ show that in this case agents aim to‌ maximally shade their bids. This has immediate implications‌ for the design of such markets. By contrast,‌ 92 considers a matching market where buyers and‌ sellers have heterogeneous preferences over each other. Traders‌ arrive at random to the market and the‌ market maker, having limited information, aims to optimize‌ when to open the market for a clearing‌ event to take place. There is a tradeoff‌ between thickening the market (to achieve better matches)‌ and matching quickly (to reduce waiting time of‌ traders in the market). The tradeoff is made‌ explicit for a wide range of underlying preferences.‌ These works are adding to an ongoing effort‌ to better understand and design markets 10489‌.

COVID

The COVID-19 pandemic has put humanity‌ to one of the defining challenges of its‌ generation and as such naturally trans-disciplinary efforts have‌ been necessary to support decision making. In a‌ series of articles 106102 we proposed Green‌ Zoning. `Green zones’–areas where the virus is under‌ control based on a uniform set of conditions–can‌ progressively return to normal economic and social activity‌ levels, and mobility between them is permitted. By‌ contrast, stricter public health measures are in place‌ in ‘red zones’, and mobility between red and‌ green zones is restricted. France and Spain were‌ among the first countries to introduce green zoning‌ in April 2020. The initial success of this‌ proposal opened up the way to a large‌ amount of follow-up work analyzing and proposing various‌ tools to effectively deploy different tools to combat‌ the pandemic (e.g., focus-mass testing 105 and a‌ vaccination policy 100). In a joint work‌ with a group of leading economists, public health‌ researchers and sociologists it was found that countries‌ that opted to aim to eliminate the virus‌ fared better not only for public health, but‌ also for the economy and civil liberties 101. Overall this work‌ has been characterized by‌ close interactions with policy‌‌ makers in France, Spain and the European Commission‌ as well as substantial‌ activity in public discourse‌‌ (via TV, newspapers and radio).

Energy efficiency

Our‌ work on energy efficiency‌ spanned multiple different areas‌‌ and applications such as embedded systems and smart‌ grids. Minimizing the energy‌ consumption of embedded systems‌‌ with real-time constraints is becoming more important for‌ ecological as well as‌ practical reasons since batteries‌‌ are becoming standard power supplies. Dynamically changing the‌ speed of the processor‌ is the most common‌‌ and efficient way to reduce energy consumption 110‌. In fact, this‌ is the reason why‌‌ modern processors are equipped with Dynamic Voltage and‌ Frequency Scaling (DVFS) technology‌ 117. In a‌‌ stochastic environment, with random job sizes and arrival‌ times, combining hard deadlines‌ and energy minimization via‌‌ DVFS-based techniques is difficult because forcing hard deadlines‌ requires considering the worst‌ cases, hardly compatible with‌‌ random dynamics. Nevertheless, progress have been made to‌ solve these types of‌ problems in a series‌‌ of papers using constrained Markov decision processes, both‌ on the theoretical side‌ (proving existence of optimal‌‌ policies and showing their structure 76, 74‌, 75) as‌ well as on the‌‌ experimental side (showing the gains of optimal policies‌ over classical solutions 77‌).

In the context‌‌ of a collaboration with Enedis and Schneider Electric‌ (via the Smart Grid‌ chair of Grenoble-INP), we‌‌ also study the problem of using smart meters‌ to optimize the behavior‌ of electrical distribution networks.‌‌ We made three kinds of contributions on this‌ subject: (1) how to‌ design efficient control strategies‌‌ in such a system 114, 116,‌ 115, (2) how‌ to co-simulate an electrical‌‌ network and a communication network 86, and‌ (3) what is the‌ performance of the communication‌‌ protocol (PLC G3) used by the Linky smart‌ meters 90.

4‌ Application domains

4.1 Large‌‌ Computing Infrastructures

Supercomputers typically comprise thousands to millions‌ of multi-core CPUs with‌ GPU accelerators interconnected by‌‌ complex interconnection networks that are typically structured as‌ an intricate hierarchy of‌ network switches. Capacity planning‌‌ and management of such systems not only raises‌ challenges in term of‌ computing efficiency but also‌‌ in term of energy consumption. Most legacy (SPMD)‌ applications struggle to benefit‌ from such infrastructure since‌‌ the slightest failure or load imbalance immediately causes‌ the whole program to‌ stop or at best‌‌ to waste resources. To scale and handle the‌ stochastic nature of resources,‌ these applications have to‌‌ rely on dynamic runtimes that schedule computations and‌ communications in an opportunistic‌ way. Such evolution raises‌‌ challenges not only in terms of programming but‌ also in terms of‌ observation (complexity and dynamicity‌‌ prevents experiment reproducibility, intrusiveness hinders large scale data‌ collection, ...) and analysis‌ (dynamic and flexible application‌‌ structures make classical visualization and simulation techniques totally‌ ineffective and require to‌ build on ad hoc‌‌ information on the application‌ structure).

4.2 Next-Generation Wireless Networks

Considerable interest has‌ arisen from the seminal prediction that the use‌ of multiple-input, multiple-output (MIMO) technologies can lead to‌ substantial gains in information throughput in wireless communications,‌ especially when used at a massive level. In‌ particular, by employing multiple inexpensive service antennas, it‌ is possible to exploit spatial multiplexing in the‌ transmission and reception of radio signals, the only‌ physical limit being the number of antennas that‌ can be deployed on a portable device. As‌ a result, the wireless medium can accommodate greater‌ volumes of data traffic without requiring the reallocation‌ (and subsequent re-regulation) of additional frequency bands. In‌ this context, throughput maximization in the presence of‌ interference by neighboring transmitters leads to games with‌ convex action sets (covariance matrices with trace constraints)‌ and individually concave utility functions (each user's Shannon‌ throughput); developing efficient and distributed optimization protocols for‌ such systems is one of the core objectives‌ of the research theme presented in Section 3.3‌.

Another major challenge that occurs here is‌ due to the fact that the efficient physical‌ layer optimization of wireless networks relies on perfect‌ (or close to perfect) channel state information (CSI),‌ on both the uplink and the downlink. Due‌ to the vastly increased computational overhead of this‌ feedback – especially in decentralized, small-cell environments –‌ the continued transition to fifth generation (5G) wireless‌ networks is expected to go hand-in-hand with distributed‌ learning and optimization methods that can operate reliably‌ in feedback-starved environments. Accordingly, one of POLARIS' application-driven‌ goals will be to leverage the algorithmic output‌ of Theme 5 into a highly adaptive resource‌ allocation framework for next-géneration wireless systems that can‌ effectively "learn in the dark", without requiring crippling‌ amounts of feedback.

4.3 Energy and Transportation

Smart‌ urban transport systems and smart grids are two‌ examples of collective adaptive systems. They consist of‌ a large number of heterogeneous entities with decentralised‌ control and varying degrees of complex autonomous behaviour.‌ We develop an analysis tool to help to‌ reason about such systems. Our work relies on‌ tools from fluid and mean-field approximation to build‌ decentralized algorithms that solve complex optimization problems. We‌ focus on two problems: decentralized control of electric‌ grids and capacity planning in vehicle-sharing systems to‌ improve load balancing.

4.4 Social Computing Systems

Social‌ computing systems are online digital systems that use‌ personal data of their users at their core‌ to deliver personalized services directly to the users.‌ They are omnipresent and include for instance recommendation‌ systems, social networks, online medias, daily apps, etc.‌ Despite their interest and utility for users, these‌ systems pose critical challenges of privacy, security, transparency,‌ and respect of certain ethical constraints such as‌ fairness. Solving these challenges involves a mix of‌ measurement and/or audit to understand and assess issues,‌ and modeling and optimization to propose and calibrate‌ solutions.

5 Social and environmental responsibility

5.1 Footprint‌ of research activities

We try to keep the carbon footprint of the‌ team has low as‌ possible by a stricter‌‌ laptop renewal policy and by reducing plane travels‌ (e.g., using visioconference or‌ sometimes by avoiding publishing‌‌ our research in conferences that would take place‌ on the other side‌ of the planet).

Our‌‌ team does not train heavy ML models requiring‌ important processing power although‌ some of us perform‌‌ computer science experiments, mostly using the Grid5000 platforms.‌ We keep this usage‌ very reasonable and rely‌‌ on cheaper alternatives (e.g., simulations) as much as‌ possible.

5.2 Impact of‌ research results

Jean-Marc Vincent‌‌ is heavily engaged since several years in the‌ training of computer science‌ teachers at the elementary/middle/high‌‌ school levels. Among one of his many activities,‌ we can mention his‌ involvement in the design‌‌ of the Numérique et Sciences Informatiques, NSI :‌ les fondamentaux MOOC.‌

6 Highlights of the‌‌ year

6.1 Awards

The article Multi-agent learning under‌ uncertainty: Recurrence vs. concentration‌ 20Panayotis Mertikopoulos and‌‌ his co-authors was selected for a spotlight at‌ NeurIPS 2025.

The article‌ Does Stochastic Gradient really‌‌ succeed for Bandits? 37 by Dorian Baudry and‌ his coauthors has been‌ admited for oral presentation‌‌ at the NeurIPS 2025 conference. There were 21575‌ valid paper submissions to‌ the NeurIPS Main Track‌‌ this year, of which the program committee accepted‌ 5290 (24.52%) papers in‌ total, with breakdown of‌‌ 4525 as posters, 688 as spotlight and 77‌ as oral.

Mathieu Besançon‌ has been a member‌‌ of the team that won the 2025 Mixed-Integer‌ Programming Workshop Computational Competition‌.

Victor Boone received‌‌ the 2025 UGA Academic Thesis Prize for his‌ research work among PhDs‌ graduating in 2024.

6.2‌‌ PhD defense

Hugo Lebeau has defended his PhD‌ thesis on March 2025‌ entitled Random Matrix and‌‌ Tensor Models for Large Data Processing 29.‌

Romain Cravic has defended‌ his PhD thesis on‌‌ November 2025 entitled Learning in stochastic games 28‌.

7 Latest software‌ developments, platforms, open data‌‌

7.1 Latest software developments

7.1.1 SimGrid

Keywords:
Large-scale‌ Emulators, Grid Computing, Distributed‌ Applications
Scientific Description:

SimGrid‌‌ is a toolkit that provides core functionalities for‌ the simulation of distributed‌ applications in heterogeneous distributed‌‌ environments. The simulation engine uses algorithmic and implementation‌ techniques toward the fast‌ simulation of large systems‌‌ on a single machine. The models are theoretically‌ grounded and experimentally validated.‌ The results are reproducible,‌‌ enabling better scientific practices.

Its models of networks,‌ cpus and disks are‌ adapted to (Data)Grids, P2P,‌‌ Clouds, Fogs, Clusters and HPC, allowing multi-domain studies.‌ It can be used‌ either to simulate algorithms‌‌ and prototypes of applications, or to emulate real‌ MPI applications through the‌ virtualization of their communication,‌‌ or to formally assess algorithms and applications that‌ can run in the‌ framework.

The formal verification‌‌ module explores all possible message interleavings in the‌ application, searching for states‌ violating the provided properties.‌‌ This tool can be used to assess safety‌ properties over arbitrary and‌ legacy codes, thanks to‌‌ a system-level introspection tool‌ that provides a finely detailed view of the‌ running application to the model checker. This can‌ for example be leveraged to verify arbitrary MPI‌ code written in C/C++/Fortran.
Functional Description:
SimGrid is‌ a simulation toolkit that provides core functionalities for‌ the simulation of distributed applications in large scale‌ heterogeneous distributed environments.
Release Contributions:

Breaking the seal:‌ v4.0 was not the final release.

* Allow‌ one to unseal netzones to modify the platform‌ even after the simulation start. * The model-checker‌ can now report memory race conditions (see tutorial).‌ * Pip builds should now work out of‌ the box. * (+ the usual bug fixes‌ overall, and improvements to the Java/Python bindings).
News‌ of the Year:

There were 2 major releases‌ in 2025. We released v4.0 in March, embodying‌ 10 years of development. This turns SimGrid into‌ a mature and stable research instrument. The users‌ can easily extend this tool to adapt it‌ to their specific research, while trusting its software‌ implementation. Release v4.1 was published in November, showing‌ that the development did not stall even if‌ the framework is mostly in maintenance mode. The‌ performance simulation mode was extended to allow modifications‌ of the platform topology during the simulation.

Most‌ of our work this year is related to‌ the use of SimGrid for performance prediction of‌ HPC applications and capacity planning of supercomputers.
URL:‌
https://simgrid.org/
Publication:
hal-04909441
Contact:
Martin Quinson
Participants:
Mathieu‌ Laurent, Anne-Cécile Orgerie, Arnaud Legrand, Augustin Degomme, Arnaud‌ Giersch, Frédéric Suter, Martin Quinson, Samuel Thibault
Partners:‌
CNRS, ENS Rennes

7.1.2 PSI

Name:
Perfect Simulator‌
Keywords:
Markov model, Simulation
Functional Description:
Perfect simulator‌ is a simulation software of markovian models. It‌ is able to simulate discrete and continuous time‌ models to provide a perfect sampling of the‌ stationary distribution or directly a sampling a functional‌ of this distribution by using coupling from the‌ past. The simulation kernel is based on the‌ CFTP algorithm, and the internal simulation of transitions‌ on the Aliasing method.
URL:
https://gitlab.inria.fr/PSI/psi3/
Contact:
Jean-Marc‌ Vincent

7.1.3 marmoteCore

Name:
Markov Modeling Tools and‌ Environments - the Core
Keywords:
Modeling, Stochastic models,‌ Markov model
Functional Description:

marmoteCore is a C++‌ environment for modeling with Markov chains. It consists‌ in a reduced set of high-level abstractions for‌ constructing state spaces, transition structures and Markov chains‌ (discrete-time and continuous-time). It provides the ability of‌ constructing hierarchies of Markov models, from the most‌ general to the particular, and equip each level‌ with specifically optimized solution methods.

This software was‌ started within the ANR MARMOTE project: ANR-12-MONU-00019.
URL:‌
https://gitlab.inria.fr/PSI/marmotecore/
Publications:
hal-01651940, hal-01276456
Contact:
Alain Jean-Marie‌
Participants:
Alain Jean-Marie, Hlib Mykhailenko, Benjamin Briot, Franck‌ Quessette, Issam Rabhi, Jean-Marc Vincent, Jean-Michel Fourneau
Partners:‌
Université de Versailles St-Quentin-en-Yvelines, Université Paris Nanterre

7.1.4‌ MarTO

Name:
Markov Toolkit for Markov models simulation:‌ perfect sampling and Monte Carlo simulation
Keywords:
Perfect‌ sampling, Markov model
Functional Description:
MarTO is a‌ simulation software of markovian models. It is able to simulate discrete and‌ continuous time models to‌ provide a perfect sampling‌‌ of the stationary distribution or directly a sampling‌ of functional of this‌ distribution by using coupling‌‌ from the past. The simulation kernel is based‌ on the CFTP algorithm,‌ and the internal simulation‌‌ of transitions on the Aliasing method. This software‌ is a rewrite, more‌ efficient and flexible, of‌‌ PSI
URL:
https://gitlab.inria.fr/MarTo/marto
Contact:
Vincent Danjean

7.1.5 GameSeer‌

Keyword:
Game theory
Functional‌ Description:
GameSeer is a‌‌ tool for students and researchers in game theory‌ that uses Mathematica to‌ generate phase portraits for‌‌ normal form games under a variety of (user-customizable)‌ evolutionary dynamics. The whole‌ point behind GameSeer is‌‌ to provide a dynamic graphical interface that allows‌ the user to employ‌ Mathematica's vast numerical capabilities‌‌ from a simple and intuitive front-end. So, even‌ if you've never used‌ Mathematica before, you should‌‌ be able to generate fully editable and customizable‌ portraits quickly and painlessly.‌
URL:
http://polaris.imag.fr/panayotis.mertikopoulos/publications/s01-gameseer/
Contact:
Panayotis‌‌ Mertikopoulos

7.1.6 rmf_tool

Name:
A library to Compute‌ (Refined) Mean Field Approximations‌
Keyword:
Mean Field
Functional‌‌ Description:

The tool accepts three model types:

-‌ homogeneous population processes (HomPP)‌

- density dependent population‌‌ processes (DDPPs)

- heterogeneous population models (HetPP)

In‌ particular, it provides a‌ numerical algorithm to compute‌‌ the constant of the refined mean field approximation‌ provided in the paper‌ "A Refined Mean Field‌‌ Approximation" by N. Gast and B. Van Houdt,‌ SIGMETRICS 2018, and a‌ framework to compute heterogeneous‌‌ mean field approximations as proposed in "Mean Field‌ and Refined Mean Field‌ Approximations for Heterogeneous Systems:‌‌ It Works!" by N. Gast and S. Allmeier,‌ SIGMETRICS 2022.
URL:
https://github.com/ngast/rmf_tool‌
Publications:
hal-01622054, tel-02509756‌‌, hal-03485044
Contact:
Nicolas Gast

7.1.7 SCIP

Name:‌
Solving Constraint Integer Programs‌
Keywords:
Linear optimization, Mathematical‌‌ Optimization, Mixed Integer Programming
Functional Description:
SCIP is‌ currently one of the‌ fastest non-commercial solvers for‌‌ mixed integer programming (MIP) and mixed integer nonlinear‌ programming (MINLP). It is‌ also a framework for‌‌ constraint integer programming and branch-cut-and-price. It allows for‌ total control of the‌ solution process and the‌‌ access of detailed information down to the guts‌ of the solver.
Release‌ Contributions:

SCIP 10.0.0

Features‌‌ and Performance Improvements

Exact Solving:

- added numerically‌ exact solving mode for‌ mixed-integer linear programs to‌‌ the core framework including certification of branch-and-bound phase,‌ - core extensions (new‌ wrapper struct SCIP_RATIONAL for‌‌ rational arithmetic currently based on Boost, GMP, and‌ MPFR, new data structure‌ SCIP_LPEXACT for handling rational‌‌ LP relaxation and computing safe dual bounds, new‌ interfaces to exact LP‌ solvers SoPlex and QSopt_ex,‌‌ safe dualproof version of conflict analysis, new data‌ structure SCIP_CERTIFICATE for certificate‌ printing/proof logging) - new‌‌ plugins: new constraint handler "exactlinear" for handling linear‌ constraints with rational data,‌ new constraint handler "exactsol"‌‌ to post-process and repair solutions from floating-point heuristics,‌ - plugins revised for‌ numerically exact solving mode:‌‌ adjusted readers for MPS, LP, CIP, OPB/WBO, and‌ ZIMPL files, extended presolver‌ "milp" to perform rational‌‌ presolving with PaPILO, adjusted‌ constraint handler "integral" and default reliability pseudo-cost branching‌ rule "relpscost", extended Gomory cut separator to separate‌ and certify numerically safe MIR cuts, adjusted all‌ primal heuristics (except for five dedicated MINLP heuristics),‌ new interfaces to exact LP solvers SoPlex and‌ QSopt_ex

Symmetry Handling

- added more techniques to‌ handle reflection symmetries, in particular, for orbitopes with‌ column reflections and matrices whose rows and columns‌ can be permuted by a symmetry - Dejavu‌ can be used to compute symmetries, the source‌ code is shipped with SCIP and incorporates sassy‌ - implemented symmetry detection callbacks for disjunction and‌ superindicator constraint handlers - detailed information about applied‌ symmetry handling techniques can be printed to the‌ terminal - improve memory usage by introducing different‌ constraint handlers for full orbitopes and packing/partitioning orbitopes‌ - symmetry detection no longer treats implicit integer‌ variables separately, but computes symmetries based on the‌ variable type inferred from variable bounds and implied‌ integrality - extended the statistics to also include‌ information about the number of variables (per type)‌ affected by symmetry - implemented method to compute‌ new permutations from a given list of symmetry‌ group generators - cons_orbisack, cons_orbitope_full, cons_orbitope_pp, and cons_symresack‌ now try to replace the stored aggregated variables‌ by active ones at the end of presolving,‌ this should reduce the size of copies of‌ the presolved problem simplified symmetry detection graphs in‌ case all edges have the same color

Presolve:‌

- distinguish implicit integrality of variables into strong‌ and weak type, depending on whether integrality is‌ implied for all feasible or only at least‌ one optimal solution - added a new presolver‌ "implint", which detects implied integral variables by detecting‌ (transposed) network submatrices in the problem, for now,‌ this plugin is disabled by default - added‌ support for (transposed) network matrix detection allow multi-aggregation‌ of unbounded slack variables, which may enable more‌ bound tightening due to a reduction in the‌ number of unbounded variables resolve all fixings in‌ xor constraints also for an available integer variable‌
URL:
https://www.scipopt.org/
Contact:
Mathieu Besancon
Partners:
TU Darmstadt,‌ RWTH Aachen University, Friedrich-Alexander-Universität Erlangen-Nürnberg, Eindhoven University of‌ Technology, University of Twente, University of Bayreuth, Forschungscampus‌ Modal

8 New results

The new results produced‌ by the team in 2025 can be grouped‌ into the following categories.

8.1 Scheduling in Data‌ Centers

Participants: Jonatha Anselmi, Bruno Gaujal,‌ Nicolas Gast.

Queuing theory is a general‌ modeling framework, originally developed by Erlang to model‌ the system of calls at the Copenhagen Telephone‌ Exchange Company, and which has later been extensively‌ used to optimize telecommunications, traffic, the design of‌ factories and shops, etc. It is particularly suited‌ to the modeling and optimization of the operation‌ of data-centers, clouds or HPC centers and has‌ lead to the development of a variety of‌ effective and low-cost scheduling strategies. We contribute to‌ this framework and extend it in relation with‌ recent online learning techniques as well as with characteristics of modern workload.‌

Non-Stationary Gradient Descent for‌ Optimal Auto-Scaling in Serverless‌‌ Platforms

To efficiently manage serverless computing platforms, a‌ key aspect is the‌ auto-scaling of services, i.e.,‌‌ the set of computational resources allocated to a‌ service adapts over time‌ as a function of‌‌ the traffic demand. The objective is to find‌ a compromise between user-perceived‌ performance and energy consumption.‌‌ In 3, we consider the scale-per-request auto-scaling‌ pattern and investigate how‌ many function instances (or‌‌ servers) should be spawned each time an unfortunate‌ job arrives, i.e., a‌ job that finds all‌‌ servers busy upon its arrival. We address this‌ problem by following a‌ stochastic optimization approach: we‌‌ develop a stochastic gradient descent scheme of the‌ Kiefer-Wolfowitz type that applies‌ over a single run‌‌ of the state evolution. At each iteration, the‌ proposed scheme computes an‌ estimate of the number‌‌ of servers to spawn each time an unfortunate‌ job arrives to minimize‌ some cost function. Under‌‌ natural assumptions, we show that the sequence of‌ estimates produced by our‌ scheme is asymptotically optimal‌‌ almost surely. In addition, we prove that its‌ convergence rate is $O‌ (n^{- 2‌‌ / 3})$ where $n$ is the number‌ of iterations.

From a‌ mathematical point of view,‌‌ the stochastic optimization framework induced by auto-scaling exhibits‌ non-standard aspects that we‌ approach from a general‌‌ point of view. We consider the setting where‌ a controller can only‌ get samples of the‌‌ transient – rather than stationary – behavior of‌ the underlying stochastic system.‌ To handle this difficulty,‌‌ we develop arguments that exploit properties of the‌ mixing time of the‌ underlying Markov chain. By‌‌ means of numerical simulations, we validate the proposed‌ approach and quantify its‌ gain with respect to‌‌ common existing scale-up rules.

Autoscaling in Serverless Platforms‌ via Online Learning with‌ Convergence Guarantees

As the‌‌ adoption of serverless computing platforms continue to grow,‌ designing autoscaling policies that‌ strike the right balance‌‌ between energy efficiency and user-perceived performance has become‌ a central challenge. In‌ 35, we propose‌‌ an online learning algorithm with theoretical convergence guarantees‌ that dynamically tunes control‌ parameters in a serverless‌‌ autoscaling environment. The proposed algorithm, grounded in stochastic‌ gradient descent, learns online-during‌ the actual operation of‌‌ the platform-the optimal values of three key control‌ parameters: (i) the target‌ stock size of prewarmed‌‌ (idle) functions, (ii) the threshold triggering provisioning actions,‌ and (iii) the expiration‌ rate of idle resources.‌‌ We prove that, under Markovian dynamics, the algorithm‌ converges to the parameter‌ set that minimizes a‌‌ cost function capturing the tradeoff between energy consumption‌ and response latency. In‌ addition, we demonstrate that‌‌ its structure naturally supports parallelization, significantly accelerating convergence.‌

Extensive numerical experiments show‌ that our method outperforms‌‌ existing baselines, including recent deep learning-based approaches, even‌ under non-Markovian settings-highlighting both‌ its robustness and practical‌‌ viability for next-generation serverless infrastructures.

Energy-Optimal Scheduling with‌ Variable Processing Speed: The‌ Role of Task Size‌‌ Variability

In 2,‌ we study the execution of a single task‌ with an unknown size on a server with‌ variable processing speed. Our goal is to analyze‌ structural properties of the optimal energy consumption under‌ the optimal speed profile that minimizes the expected‌ energy consumption while meeting a hard deadline constraint.‌ Specifically, we investigate how the task size probability‌ distribution impacts the overall energy.

Under mild assumptions,‌ our main result shows that the expected energy‌ consumption induced by the optimal speed profile preserves‌ the convex increasing order with respect to the‌ task size distribution. Then, we leverage this property‌ to derive simple bounds and conduct a worst-case‌ analysis. In particular, we derive a simple, general‌ formula for the energy gap induced by the‌ 'best' and 'worst' task size distributions, expressed in‌ terms of the support and expectation of the‌ task size.

Time-Constrained Energy Minimization for Online Execution‌ of a Stochastic DAG Task

In 30,‌ We study the problem of energy-efficient online execution‌ of a complex task on a server with‌ variable processing speed. The task consists of a‌ set of stochastic elementary jobs structured as a‌ Directed Acyclic Graph (DAG), where each job's execution‌ may reveal new information that influences future scheduling‌ decisions. Our objective is to determine an online‌ speed control policy that minimizes the expected energy‌ consumption while ensuring that the task completes before‌ a strict deadline. Leveraging tools from convex optimization,‌ the optimality principle, and backward induction, we derive‌ a structural characterization of the optimal policy. We‌ find that this is linked to a set‌ of second-order differential equations and exhibits a non-trivial‌ form. Building on this result, we develop a‌ discretization-based algorithm that efficiently approximates the optimal policy.‌ The proposed algorithm is provably asymptotically exact in‌ the discretization step and has computational complexity $O‌ (| A | + K N)‌$ , where $| A |$ and $K$ denote‌ the number of edges and vertices (i.e., jobs)‌ in the underlying DAG, and $N$ is the‌ discretization granularity. Our results offer a principled and‌ computationally efficient solution framework for online execution of‌ structured stochastic workloads under strict energy and timing‌ constraints.

8.2 Performance evaluation of Large Systems

Participants:‌ Nicolas Gast, Arnaud Legrand.

8.2.1 Experimental‌ practices and Simulation

Lowering entry barriers to developing‌ custom simulators of distributed applications and platforms with‌ SimGrid

Researchers in parallel and distributed computing (PDC)‌ often resort to simulation because experiments conducted using‌ a simulator can be for arbitrary experimental scenarios,‌ are less resource-, labor-, and time-consuming than their‌ real-world counterparts, and are perfectly repeatable and observable.‌ Many frameworks have been developed to ease the‌ development of PDC simulators, and these frameworks provide‌ different levels of accuracy, scalability, versatility, extensibility, and‌ usability. The SimGrid framework 39 has been used‌ by many PDC researchers to produce a wide‌ range of simulators for over two decades. Its‌ popularity is due to a large emphasis placed on accuracy, scalability, and‌ versatility, and is in‌ spite of shortcomings in‌‌ terms of extensibility and usability. Although SimGrid provides‌ sensible simulation models for‌ the common case, it‌‌ was difficult for users to extend these models‌ to meet domain-specific needs.‌ Furthermore, SimGrid only provided‌‌ relatively low-level simulation abstractions, making the implementation of‌ a simulator of a‌ complex system a labor-intensive‌‌ undertaking. In 6 we describe developments in the‌ last decade that have‌ contributed to vastly improving‌‌ extensibility and usability, thus lowering or removing entry‌ barriers for users to‌ develop custom SimGrid simulators.‌‌

Journée thématique du GDR RSD : Experimental Practices‌ in the Systems and‌ Networks Community

33 This‌‌ working group met on September 14, 2024, in‌ Paris to discuss experimental‌ practices in the Systems‌‌ and Networks communities. The purpose of this document‌ is fourfold:

To reflect‌ on the practices for‌‌ conducting experimental research in this community;
To share‌ the best practices we‌ have identified with the‌‌ community;
List available resources (apart from platforms, which‌ are the subject of‌ a separate workshop);
Propose‌‌ recommendations for disseminating these best practices and creating‌ an “experimental culture.”

8.2.2‌ Mean Field

As a‌‌ system of stochastically interacting entities grows large, the‌ size of its state-space‌ quickly explodes (curse of‌‌ dimensionality) and both its analysis and optimization become‌ intractable. Fortunately, symmetries and‌ regularities can be exploited,‌‌ in particular through the so-called Mean Field approximation,‌ which averages out the‌ state over degrees of‌‌ freedom. This technique from statistical physics is particularly‌ effective when studying computer‌ systems and over the‌‌ years, we have refined such approximation, proposed extensions,‌ and developed fine analysis‌ methods.

Accuracy of the‌‌ Graphon Mean Field Approximation for Interacting Particle Systems‌

In 1 we consider‌ a system of $N‌‌$ particles whose interactions are characterized by a (weighted)‌ graph $G^{N}$ .‌ Each particle is a‌‌ node of the graph with an internal state.‌ This state changes according‌ to Markovian dynamics that‌‌ depend on the states of neighboring particles. We‌ study the limiting properties‌ of the state dynamics,‌‌ focusing on the dense graph regime, in which‌ the average degree of‌ a node grows linearly‌‌ with $N$ . We show that, when ${G‌}^{N}$ converges to a‌ piecewise Lipschitz graphon $G‌‌$ , the behavior of the system converges to‌ a deterministic limit, the‌ graphon mean field approximation.‌‌ We obtain convergence rates depending on the system‌ size N and cut-norm‌ distance between $G^{N‌‌}$ and G. We apply these results for two‌ subcases: when $G^{N‌}$ is a discretization of‌‌ the graph $G$ with individually weighted edges; when‌ $G^{N}$ is a‌ random graph obtained by‌‌ sampling edges with probabilities obtained from $G$ .‌ In the case of‌ weighted interactions, we obtain‌‌ a bound of order $O (1 /‌ N)$ . In‌ the random graph case,‌‌ the error is of order $O (\sqrt{log‌ (N) /‌ N})$ with high‌‌ probability. We illustrate the‌ applicability of our results and the numerical efficiency‌ of the approximation through two examples: a graph-based‌ load-balancing model and a heterogeneous bike-sharing system.

8.3‌ Reinforcement Learning and MDP

Participants: Victor Boone,‌ Dorian Baudry, Nicolas Gast, Bruno Gaujal‌.

A limitation of the classical queuing theory‌ framework is that policies may require knowledge of‌ the system parameters whereas such parameters are rarely‌ known in advance, hence our interest in reinforcement‌ learning technique and Markov Decision Processes. Over the‌ years, we have developed analysis and modeling techniques‌ specifically suited to queuing systems whose state space‌ quickly explodes (learning in queues), which makes classical‌ RL approaches or results inappropriate.

Logarithmic regret of‌ exploration in average reward Markov decision processes

In‌ average reward Markov decision processes, state-of-the-art algorithms for‌ regret minimization follow a well-established framework: They are‌ model-based, optimistic and episodic. First, they maintain a‌ confidence region from which optimistic policies are computed‌ using a well-known subroutine called Extended Value Iteration‌ (EVI). Second, these policies are used over time‌ windows called episodes, each ended by the Doubling‌ Trick (DT) rule or a variant thereof. In‌ 13, without modifying EVI, we show that‌ there is a significant advantage in replacing (DT)‌ by another simple rule, that we call the‌ Vanishing Multiplicative (VM) rule. When managing episodes with‌ (VM), the algorithm's regret is, both in theory‌ and in practice, as good if not better‌ than with (DT), while the one-shot behavior is‌ greatly improved. More specifically, the management of bad‌ episodes (when sub-optimal policies are being used) is‌ much better under (VM) than (DT) by making‌ the regret of exploration logarithmic rather than linear.‌ These results are made possible by a new‌ in-depth understanding of the contrasting behaviors of confidence‌ regions during good and bad episodes.

8.4 Optimization‌ Techniques

Participants: Mathieu Besançon, Dorian Baudry,‌ Panayotis Mertikopoulos.

Optimization arises in almost every‌ domain of computer science and requires a variety‌ of techniques, to which we contribute both from‌ a theoretical and practical perspective.

8.4.1 Langevin Sampling‌ and Large Deviation Techniques

The global convergence of‌ stochastic gradient descent in non-convex landscapes: Sharp estimates‌ via large deviations

In 11, we examine‌ the time it takes for stochastic gradient descent‌ (SGD) to reach the global minimum of a‌ general, non-convex loss function. We approach this question‌ through the lens of randomly perturbed dynamical systems‌ and large deviations theory, and we provide a‌ tight characterization of the global convergence time of‌ SGD via matching upper and lower bounds. These‌ bounds are dominated by the most "costly" set‌ of obstacles that the algorithm may need to‌ overcome in order to reach a global minimizer‌ from a given initialization, coupling in this way‌ the global geometry of the underlying loss landscape‌ with the statistics of the noise entering the‌ process. Finally, motivated by applications to the training‌ of deep neural networks, we also provide a series of refinements and‌ extensions of our analysis‌ for loss functions with‌‌ shallow local minima.

Tamed Langevin sampling under weaker‌ conditions

Motivated by applications‌ to deep learning which‌‌ often fail standard Lipschitz smoothness requirements, we examine‌ in 22 the problem‌ of sampling from distributions‌‌ that are not log-concave and are only weakly‌ dissipative, with log-gradients allowed‌ to grow superlinearly at‌‌ infinity. In terms of structure, we only assume‌ that the target distribution‌ satisfies either a log-Sobolev‌‌ or a Poincaré inequality and a local Lipschitz‌ smoothness assumption with modulus‌ growing possibly polynomially at‌‌ infinity. This set of assumptions greatly exceeds the‌ operational limits of the‌ "vanilla" unadjusted Langevin algorithm‌‌ (ULA), making sampling from such distributions a highly‌ involved affair. To account‌ for this, we introduce‌‌ a taming scheme which is tailored to the‌ growth and decay properties‌ of the target distribution,‌‌ and we provide explicit non-asymptotic guarantees for the‌ proposed sampler in terms‌ of the Kullback-Leibler (KL)‌‌ divergence, total variation, and Wasserstein distance to the‌ target distribution.

Contractive kinetic‌ Langevin samplers beyond global‌‌ Lipschitz continuity

In 36, we examine the‌ problem of sampling from‌ log-concave distributions with (possibly)‌‌ superlinear gradient growth under kinetic (underdamped) Langevin algorithms.‌ Using a carefully tailored‌ taming scheme, we propose‌‌ two novel discretizations of the kinetic Langevin SDE,‌ and we show that‌ they are both contractive‌‌ and satisfy a log-Sobolev inequality. Building on this,‌ we establish a series‌ of non-asymptotic bounds in‌‌ 2-Wasserstein distance between the law reached by each‌ algorithm and the underlying‌ target measure.

8.4.2 Frank-Wolfe‌‌

Frank-Wolfe Algorithms: Sparsity Guarantees and an Application to‌ Robust Optimization

Frank-Wolfe (FW)‌ methods are a class‌‌ of nonlinear optimization algorithms over a compact constraint‌ set leveraging first-order information‌ of the objective and‌‌ linear optimization on the constraints. They have risen‌ in popularity in the‌ last decade for their‌‌ applications in operations research and learning. In 27‌, we first present‌ a recently proposed enhancement‌‌ of all Frank-Wolfe algorithms that ensure that the‌ iterates remain sparse, in‌ the sense that they‌‌ are formed as a convex combination of a‌ low number of vertices.‌ We then view an‌‌ application of FW to robust optimization. Sparsity Guarantees‌ for Frank-Wolfe. In a‌ first part, we present‌‌ recent progress on the pivoting framework that modifies‌ Frank-Wolfe variants, ensuring the‌ sparsity of the iterate.‌‌ We then introduce the so-called active set identification‌ property of some FW‌ variants and how they‌‌ can be leveraged to ensure sparser iterates in‌ a so-called pivoting framework.‌ Frank-Wolfe for robust optimization‌‌ under an oracle model. In the second part‌ of the talk, we‌ focus on recent applications‌‌ of FW or FW-inspired methods to tackle robust‌ optimization under an oracle‌ setting. We highlight how‌‌ different approaches from the literature are connected through‌ the lens of these‌ algorithms or are dual‌‌ of each other, including a recent simplicial decomposition,‌ a cutting plane, and‌ a FW approach with‌‌ Nesterov smoothing.

Improved algorithms‌ and novel applications of the FrankWolfe.jl library

Frank-Wolfe‌ (FW) algorithms have emerged as an essential class‌ of methods for constrained optimization, especially on large-scale‌ problems. In 4, we summarize the algorithmic‌ design choices and progress made in the last‌ years of the development of FrankWolfe.jl, a‌ Julia package gathering high-performance implementations of state-of-the-art FW‌ variants. We review key use cases of the‌ library in the recent literature, which match its‌ original dual purpose: first, becoming the de-facto toolbox‌ for practitioners applying FW methods to their problem,‌ and second, offering a modular ecosystem to algorithm‌ designers who experiment with their own variants and‌ implementations of algorithmic blocks. Finally, we demonstrate the‌ performance of several FW variants on important problem‌ classes in several experiments, which we curated in‌ a separate repository for continuous benchmarking.

Efficient Quadratic‌ Corrections for Frank-Wolfe Algorithms

In 16, we‌ develop a Frank-Wolfe algorithm with corrective steps, generalizing‌ previous algorithms including blended conditional gradients, blended pairwise‌ conditional gradients, and fully-corrective Frank-Wolfe. For this, we‌ prove tight convergence guarantees together with an optimal‌ face identification property. Furthermore, we propose two highly‌ efficient corrective steps for convex quadratic objectives based‌ on linear optimization or linear system solving, akin‌ to Wolfe's minimum-norm point, and show that they‌ converge in finite time under suitable conditions. Beyond‌ optimization problems that are directly quadratic, we revisit‌ two algorithms - split conditional gradient and second-order‌ conditional gradient sliding - which can leverage quadratic‌ corrections to accelerate their quadratic subproblems. We demonstrate‌ improved convergence rates for the first and broader‌ applicability for the second, which may be of‌ independent interest. Finally, we show substantial computational speedups‌ for Frank-Wolfe-based algorithms with quadratic corrections across the‌ considered problem classes.

Secant Line Search for Frank-Wolfe‌ Algorithms

In 17, we present a new‌ step-size strategy based on the secant method for‌ Frank-Wolfe algorithms. This strategy, which requires mild assumptions‌ about the function under consideration, can be applied‌ to any Frank-Wolfe algorithm. It is as effective‌ as full line search and, in particular, allows‌ for adapting to the local smoothness of the‌ function but comes with a significantly reduced computational‌ cost, leading to higher effective rates of convergence.‌ We provide theoretical guarantees and demonstrate the effectiveness‌ of the strategy through numerical experiments.

The Pivoting‌ Framework: Frank-Wolfe Algorithms with Active Set Size Control‌

In 25, we propose the pivoting meta‌ algorithm (PM) to enhance optimization algorithms that generate‌ iterates as convex combinations of vertices of a‌ feasible region $C \subseteq ℝ^{n}$ , including‌ Frank-Wolfe (FW) variants. PM guarantees that the active‌ set (the set of vertices in the convex‌ combination) of the modified algorithm remains as small‌ as $dim (C) + 1$ as‌ stipulated by Carathéodory’s theorem. PM achieves this by‌ reformulating the active set expansion task into an‌ equivalent linear program, which can be efficiently solved‌ using a single pivot step akin to the primal simplex algorithm; the‌ convergence rate of the‌ original algorithms are maintained.‌‌ Furthermore, we establish the connection between PM and‌ active set identification, in‌ particular showing under mild‌‌ assumptions that PM applied to the away-step Frank-Wolfe‌ algorithm (AFW) or the‌ blended pairwise Frank-Wolfe algorithm‌‌ (BPFW) bounds the active set size by the‌ dimension of the optimal‌ face plus 1. We‌‌ provide numerical experiments to illustrate practicality and efficacy‌ on active set size‌ reduction.

8.4.3 Mixed Integer‌‌ Optimization

Convex mixed-integer optimization with Frank–Wolfe methods

Mixed-integer‌ nonlinear optimization encompasses a‌ broad class of problems‌‌ that present both theoretical and computational challenges. We‌ propose in 8 a‌ new type of method‌‌ to solve these problems based on a branch-and-bound‌ algorithm with convex node‌ relaxations. These relaxations are‌‌ solved with a Frank-Wolfe algorithm over the convex‌ hull of mixed-integer feasible‌ points instead of the‌‌ continuous relaxation via calls to a mixed-integer linear‌ solver as the linear‌ minimization oracle. The proposed‌‌ method computes feasible solutions while working on a‌ single representation of the‌ polyhedral constraints, leveraging the‌‌ full extent of mixed-integer linear solvers without an‌ outer approximation scheme and‌ can exploit inexact solutions‌‌ of node subproblems.

The SCIP Optimization Suite 10.0‌

The SCIP Optimization Suite‌ provides a collection of‌‌ software packages for mathematical optimization, centered around the‌ constraint integer programming (CIP)‌ framework SCIP. 34 discusses‌‌ the enhancements and extensions included in SCIP Optimization‌ Suite 10.0. The updates‌ in SCIP 10.0 include‌‌ a new solving mode for exactly solving rational‌ mixed-integer linear programs, a‌ new presolver for detecting‌‌ implied integral variables, a novel cut-based conflict analysis‌ and separator for flower‌ inequalities, two new heuristics,‌‌ a novel tool for explaining infeasibility, a new‌ interface for nonlinear solvers‌ as well as improvements‌‌ in symmetry handling, branching strategies, and SCIP's Benders'‌ decomposition framework. SCIP Optimization‌ Suite 10.0 also includes‌‌ new and improved features in the the presolving‌ library PaPILO, the parallel‌ framework UG, and the‌‌ decomposition framework GCG. Moreover, the SCIP Optimization Suite‌ 10.0 contains MIP-DD, the‌ first open-source delta debugger‌‌ for mixed-integer programming solvers. These additions and enhancements‌ have resulted in an‌ overall performance improvement of‌‌ SCIP in terms of solving time, number of‌ nodes in the branch-and-bound‌ tree, as well as‌‌ the reliability of the solver.

8.4.4 Application to‌ various domains

Integrating Aggregated‌ Electric Vehicle Flexibilities in‌‌ Unit Commitment Models using Submodular Optimization

The Unit‌ Commitment (UC) problem consists‌ in controlling a large‌‌ fleet of heterogeneous electricity production units in order‌ to minimize the total‌ production cost while satisfying‌‌ consumer demand. Electric Vehicles (EVs) are used as‌ a source of flexibility‌ and are often aggregated‌‌ for problem tractability. In 32, we develop‌ a new approach to‌ integrate EV flexibilities in‌‌ the UC problem and exploit the generalized polymatroid‌ structure of aggregated flexibilities‌ of a large population‌‌ of users to develop an exact optimization algorithm,‌ combining a cutting-plane approach‌ and submodular optimization. We‌‌ show in particular that‌ the UC can be solved exactly in a‌ time which scales linearly, up to a logarithmic‌ factor, in the number of EV users when‌ each production unit is subject to convex constraints.‌ We illustrate our approach by solving a real‌ instance of a long-term UC problem, combining open-source‌ data of the European grid (European Resource Adequacy‌ Assessment project) and data originating from a survey‌ of user behavior of the French EV fleet.‌

Efficient Sparse Flow Decomposition Methods for RNA Multi-Assembly‌

Decomposing a flow on a Directed Acyclic Graph‌ (DAG) into a weighted sum of a small‌ number of paths is an essential task in‌ operations research and bioinformatics. This problem, referred to‌ as Sparse Flow Decomposition (SFD), has gained significant‌ interest, in particular for its application in RNA‌ transcript multi-assembly, the identification of the multiple transcripts‌ corresponding to a given gene and their relative‌ abundance. Several recent approaches cast SFD variants as‌ integer optimization problems, motivated by the NPhardness of‌ the formulations they consider. In 12, we‌ propose an alternative formulation of SFD as a‌ data fitting problem on the conic hull of‌ the flow polytope. By reformulating the problem on‌ the flow polytope for compactness and solving it‌ using specific variants of the Frank-Wolfe algorithm, we‌ obtain a method converging rapidly to the minimizer‌ of the chosen loss function while producing a‌ parsimonious decomposition. Our approach subsumes previous formulations of‌ SFD with exact and inexact flows and can‌ model different priors on the error distributions. Computational‌ experiments show that our method outperforms recent integer‌ optimization approaches in runtime, but is also highly‌ competitive in terms of reconstruction of the underlying‌ transcripts, despite not explicitly minimizing the solution cardinality.‌

Mixed-Integer Optimization for Loopless Flux Distributions in Metabolic‌ Networks

Constraint-based metabolic models can be used to‌ investigate the intracellular physiology of microorganisms. These models‌ couple genes to reactions, and typically seek to‌ predict metabolite fluxes that optimize some biologically important‌ metric. Classical techniques, like Flux Balance Analysis (FBA),‌ formulate the metabolism of a microbe as an‌ optimization problem where growth rate is maximized. While‌ FBA has found widespread use, it often leads‌ to thermodynamically infeasible solutions that contain internal cycles‌ (loops). To address this shortcoming, Loopless-Flux Balance Analysis‌ (ll-FBA) seeks to predict flux distributions that do‌ not contain these loops. ll-FBA is a disjunctive‌ program, usually reformulated as a mixed-integer program, and‌ is challenging to solve for biological models that‌ often contain thousands of reactions and metabolites. In‌ 24, we compare various reformulations of ll-FBA‌ and different solution approaches. Overall, the combinatorial Benders'‌ decomposition is the most promising of the tested‌ approaches with which we could solve most instances.‌ However, the model size and numerical instability pose‌ a challenge to the combinatorial Benders' method.

8.4.5‌ Optimization for Machine Learning

Many learning algorithms operate‌ in centralized way, which raises many practical issues‌ in terms of scalability, privacy, hence a high interest for designing efficient‌ distributed and federated machine‌ learning algorithms. Furthermore generally,‌‌ the optimization space is also quite particular, which‌ calls for specific regularization‌ techniques and optimization algorithms.‌‌

Non-stationary Bandit Convex Optimization: A Comprehensive Study

Bandit‌ Convex Optimization is a‌ fundamental class of sequential‌‌ decision-making problems, where the learner selects actions from‌ a continuous domain and‌ observes a loss (but‌‌ not its gradient) at only one point per‌ round. In 38,‌ we study this problem‌‌ in non-stationary environments, and aim to minimize the‌ regret under three standard‌ measures of non-stationarity: the‌‌ number of switches $S$ in the comparator sequence,‌ the total variation $Δ‌$ of the loss functions,‌‌ and the path-length $P$ of the comparator sequence.‌ We propose a polynomial-time‌ algorithm, Tilted Exponentially Weighted‌‌ Average with Sleeping Experts (TEWA-SE), which adapts the‌ sleeping experts framework from‌ online convex optimization to‌‌ the bandit setting. For strongly convex losses, we‌ prove that TEWA-SE is‌ minimax-optimal with respect to‌‌ known $S$ and $Δ$ by establishing matching upper‌ and lower bounds. By‌ equipping TEWA-SE with the‌‌ Bandit-over-Bandit framework, we extend our analysis to environments‌ with unknown non-stationarity measures.‌ For general convex losses,‌‌ we introduce a second algorithm, clipped Exploration by‌ Optimization (cExO), based on‌ exponential weights over a‌‌ discretized action space. While not polynomial-time computable, this‌ method achieves minimax-optimal regret‌ with respect to known‌‌ $S$ and $Δ$ , and improves on the‌ best existing bounds with‌ respect to $P$ .‌‌

Model Predictive Control is Almost Optimal for Restless‌ Bandit

In 15,‌ we consider the discrete‌‌ time infinite horizon average reward restless markovian bandit‌ (RMAB) problem. We propose‌ a model predictive control‌‌ based non-stationary policy with a rolling computational horizon‌ $τ$ . At each‌ time-slot, this policy solves‌‌ a τ horizon linear program whose first control‌ value is kept as‌ a control for the‌‌ RMAB. Our solution requires minimal assumptions and quantifies‌ the loss in optimality‌ in terms of $τ‌‌$ and the number of arms, $N$ . We‌ show that its sub-optimality‌ gap is $O (‌‌ 1 / \sqrt{N})$ in general, and $exp‌ (- Ω (‌ N))$ under‌‌ a local-stability condition. Our proof is based on‌ a framework from dynamic‌ control known as dissipativity‌‌. Our solution easy to implement and performs‌ very well in practice‌ when compared to the‌‌ state of the art. Further, both our solution‌ and our proof methodology‌ can easily be generalized‌‌ to more general constrained MDP settings and should‌ thus, be of great‌ interest to the burgeoning‌‌ RMAB community.

Does Stochastic Gradient really succeed for‌ Bandits?

Recent works of‌ Mei et al. have‌‌ deepened the theoretical understanding of the Stochastic Gradient‌ Bandit (SGB) policy, showing‌ that using a constant‌‌ learning rate guarantees asymptotic convergence to the optimal‌ policy, and that sufficiently‌ small learning rates can‌‌ yield logarithmic regret. However, whether logarithmic regret holds‌ beyond small learning rates‌ remains unclear. In 37‌‌, we take a‌ step towards characterizing the regret regimes of SGB‌ as a function of its learning rate. For‌ two-armed bandits, we identify a sharp threshold, scaling‌ with the suboptimality gap $Δ$ , below which‌ SGB achieves logarithmic regret on all instances, and‌ above which it can incur polynomial regret on‌ some instances. This result highlights the necessity of‌ knowing (or estimating) $Δ$ to ensure logarithmic regret‌ with a constant learning rate. For general $K‌$ -armed bandits, we further show the learning rate‌ must additionally scale inversely with $K$ to avoid‌ polynomial regret. We introduce novel techniques to derive‌ regret upper bounds for SGB, laying the groundwork‌ for future advances in the theory of gradient-based‌ bandit algorithms.

8.5 Learning in games

Participants: Davide‌ Legacci, Panayotis Mertikopoulos, Bary Pradelski.‌

Learning in games naturally occurs in situations where‌ the resources or the decision is distributed among‌ several agents or even in situations where a‌ centralized decision maker has to adapt to strategic‌ users. Yet, it is considerably more difficult than‌ in classical minimization games as the resulting equilibria‌ may be attractive or not and the dynamic‌ often exhibit cyclic behaviors. Understanding and characterizing the‌ geometry of such spaces is thus the key‌ to propose efficient algorithms. This line of work‌ has led to the defense of one PhD‌ thesis in 2025.

8.5.1 Learning in stochastic games‌

Learning in stochastic games

The thesis of Romain‌ Cravic 28 addresses two learning problems in two-player‌ zero-sum stochastic games. The first problem concerns a‌ learner who embodies one of the two players‌ and learns to play efficiently against an arbitrary‌ opponent via a simulator of the game, whose‌ internal parameters are unknown to the learner at‌ the beginning of the learning process. The second‌ problem concerns learning the Nash equilibrium of the‌ game through self-play, assuming perfect knowledge of the‌ considered model. For the first problem, we consider‌ games with perfect information, meaning that players fully‌ observe the current state of the game when‌ making decisions. In contrast, we do not make‌ this assumption for the second problem and instead‌ consider games with imperfect information, where players must‌ rely on the history of their past observations‌ to infer the current state of the game‌ and the information obtained by their opponent. For‌ each of these two problems, we propose an‌ algorithm that handles infinite-horizon games by linking them‌ to the finite-horizon case through the introduction of‌ a discount factor on the rewards produced by‌ the game over time. More precisely, we propose‌ the DONQ-learning algorithm for the first problem, which‌ we refer to as black-box learning in games‌ with perfect information, and the DOS-CFR algorithm for‌ the second problem, which we refer to here‌ as the solving of games with imperfect information.‌ In both cases, the theoretical guarantees established in‌ our analysis of these algorithms translate into probabilistic,‌ sublinear regret bounds in the time horizon T as it tends to‌ infinity. Finally, the thesis‌ includes an experimental component‌‌ consisting of developing an artificial intelligence capable of‌ playing efficiently in a‌ memory-bluff game: Robin Wood.‌‌

A Quadratic Speedup in Finding Nash Equilibria of‌ Quantum Zero-Sum Games

Recent‌ developments in domains such‌‌ as non-local games, quantum interactive proofs, and quantum‌ generative adversarial networks have‌ renewed interest in quantum‌‌ game theory and, specifically, quantum zero-sum games. Central‌ to classical game theory‌ is the efficient algorithmic‌‌ computation of Nash equilibria, which represent optimal strategies‌ for both players. In‌ 2008, Jain and Watrous‌‌ proposed the first classical algorithm for computing equilibria‌ in quantum zero-sum games‌ using the Matrix Multiplicative‌‌ Weight Updates (MMWU) method to achieve a convergence‌ rate of $O (‌ d / ϵ^{2‌‌})$ iterations to $ϵ$ -Nash equilibria in the‌ $4 d$ -dimensional spectraplex.‌ In 10, we‌‌ propose a hierarchy of quantum optimization algorithms that‌ generalize MMWU via an‌ extra-gradient mechanism. Notably, within‌‌ this proposed hierarchy, we introduce the Optimistic Matrix‌ Multiplicative Weights Update (OMMWU)‌ algorithm and establish its‌‌ average-iterate convergence complexity as $O (d /‌ ϵ)$ iterations to‌ $ϵ$ -Nash equilibria. This‌‌ quadratic speed-up relative to Jain and Watrous' original‌ algorithm sets a new‌ benchmark for computing $ϵ‌‌$ -Nash equilibria in quantum zero-sum games.

8.5.2 Efficient‌ Learning in Complex Landscape‌ Games

Characterizing the Convergence‌‌ of Game Dynamics via Potentialness

Understanding the convergence‌ landscape of multi-agent learning‌ is a fundamental problem‌‌ of great practical relevance in many applications of‌ artificial intelligence and machine‌ learning. While it is‌‌ known that learning dynamics converge to Nash equilibrium‌ in potential games, the‌ behavior of dynamics in‌‌ many important classes of games that do not‌ admit a potential is‌ poorly understood. To measure‌‌ how “close” a game is to being potential,‌ we consider in 5‌ a distance function, that‌‌ we call “potentialness”, and which relies on a‌ strategic decomposition of games‌ introduced by Candogan et‌‌ al. (2011). We introduce a numerical framework enabling‌ the computation of this‌ metric, which we use‌‌ to calculate the degree of “potentialness” in generic‌ matrix games, as well‌ as (non-generic) games that‌‌ are important in economic applications, namely auctions and‌ contests. Understanding learning in‌ the latter games has‌‌ become increasingly important due to the wide-spread automation‌ of bidding and pricing‌ with no-regret learning algorithms.‌‌ We empirically show that potentialness decreases and concentrates‌ with an increasing number‌ of agents or actions;‌‌ in addition, potentialness turns out to be a‌ good predictor for the‌ existence of pure Nash‌‌ equilibria and the convergence of no-regret learning algorithms‌ in matrix games. In‌ particular, we observe that‌‌ potentialness is very low for complete-information models of‌ the all-pay auction where‌ no pure Nash equilibrium‌‌ exists, and much higher for Tullock contests, first-,‌ and second-price auctions, explaining‌ the success of learning‌‌ in the latter. In the incomplete-information version of‌ the all-pay auction, a‌ pure Bayes-Nash equilibrium exists‌‌ and it can be‌ learned with gradient-based algorithms. Potentialness nicely characterizes these‌ differences to the complete-information version.

Efficient kernelized learning‌ in polyhedral games beyond full-information: From Colonel Blotto‌ to congestion games

In 18, we examine‌ the problem of efficiently learning coarse correlated equilibria‌ (CCE) in polyhedral games, that is, normal-form games‌ with an exponentially large number of actions per‌ player and an underlying combinatorial structure. Prominent examples‌ of such games are the classical Colonel Blotto‌ and congestion games. To achieve computational efficiency, the‌ learning algorithms must exhibit regret and per-iteration complexity‌ that scale polylogarithmically in the size of the‌ players’ action sets. This challenge has recently been‌ addressed in the full-information setting, primarily through the‌ use of kernelization. However, in the case of‌ the realistic, but mathematically challenging, partial-information setting, existing‌ approaches result in suboptimal and impractical runtime complexity‌ to learn CCE. We tackle this limitation by‌ building a framework based on the kernelization paradigm.‌ We apply this framework to prominent examples of‌ polyhedral games—namely the Colonel Blotto, graphic matroid and‌ network congestion games — and provide computationally efficient‌ payoff-based learning algorithms, which significantly improve upon prior‌ works in terms of the runtime for learning‌ CCE in these settings.

Invariance and concentration properties‌ of gradient-based learning in games

In 19,‌ we study the long-run behavior of learning in‌ strongly monotone games with stochastic, gradient-based feedback. For‌ concreteness, we focus on the stochastic projected gradient‌ (SPG) algorithm, and we examine the asymptotic distribution‌ of its iterates when the method is run‌ with constant step-size updates (the de facto choice‌ for practical deployments of the algorithm). In contrast‌ to variants of the method with a vanishing‌ step-size case, SPG with a constant step-size does‌ not converge: instead, it reaches a neighborhood of‌ the game's Nash equilibrium at an exponential rate,‌ and then, due to persistent noise, it fluctuates‌ in its vicinity without converging (occasionally moving away‌ on rare occasions). We provide a theoretical quantification‌ of this behavior by analyzing the Markovian structure‌ of the process. Namely, we show that, regardless‌ of the algorithm's initialization, the distribution of its‌ iterates converges at a geometric rate to a‌ unique invariant measure which is concentrated in a‌ neighborhood of the game's Nash equilibrium. More explicitly,‌ we quantify the degree of this concentration and‌ the rate of convergence of the algorithm's empirical‌ frequency of play to the invariant measure of‌ the process in Wasserstein distance, and we provide‌ explicit bounds in terms of the method's step-size,‌ the variance of the noise entering the process,‌ and the geometric features of the game's payoff‌ landscape.

8.5.3 Uncertainty and Robustness

The impact of‌ uncertainty on regularized learning in games

In 14‌, we investigate how randomness and uncertainty influence‌ learning in games. Specifically, we examine a perturbed‌ variant of the dynamics of "follow the regularized‌ leader" (FTRL), where the players' payoff observations and‌ strategy updates are continually impacted by random shocks. Our findings reveal that,‌ in a fairly precise‌ sense, "uncertainty favors extremes":‌‌ in any game, regardless of the noise level,‌ every player's trajectory of‌ play reaches an arbitrarily‌‌ small neighborhood of a pure strategy in finite‌ time (which we estimate).‌ Moreover, even if the‌‌ player does not ultimately settle at this strategy,‌ they return arbitrarily close‌ to some (possibly different)‌‌ pure strategy infinitely often. This prompts the question‌ of which sets of‌ pure strategies emerge as‌‌ robust predictions of learning under uncertainty. We show‌ that (a) the only‌ possible limits of the‌‌ FTRL dynamics under uncertainty are pure Nash equilibria;‌ and (b) a span‌ of pure strategies is‌‌ stable and attracting if and only if it‌ is closed under better‌ replies. Finally, we turn‌‌ to games where the deterministic dynamics are recurrent-such‌ as zero-sum games with‌ interior equilibria-and show that‌‌ randomness disrupts this behavior, causing the stochastic dynamics‌ to drift toward the‌ boundary on average.

Multi-agent‌‌ learning under uncertainty: Recurrence vs. concentration

In 20‌, we examine the‌ convergence landscape of multi-agent‌‌ learning under uncertainty. Specifically, we analyze two stochastic‌ models of regularized learning‌ in continuous games-one in‌‌ continuous and one in discrete time-with the aim‌ of characterizing the long-run‌ behavior of the induced‌‌ sequence of play. In stark contrast to deterministic,‌ full-information models of learning‌ (or models with a‌‌ vanishing learning rate), we show that the resulting‌ dynamics do not converge‌ in general. In lieu‌‌ of this, we ask instead which actions are‌ played more often in‌ the long run, and‌‌ by how much. We show that, in strongly‌ monotone games, the dynamics‌ of regularized learning may‌‌ wander away from equilibrium infinitely often, but they‌ always return to its‌ vicinity in finite time‌‌ (which we estimate), and their long-run distribution is‌ sharply concentrated around a‌ neighborhood thereof. We quantify‌‌ the degree of this concentration, and we show‌ that these favorable properties‌ may all break down‌‌ if the underlying game is not strongly monotone-underscoring‌ in this way the‌ limits of regularized learning‌‌ in the presence of persistent randomness and uncertainty.‌

Robust equilibria in continuous‌ games: From strategic to‌‌ dynamic robustness

In 21, we examine the‌ robustness of Nash equilibria‌ in continuous games, under‌‌ both strategic and dynamic uncertainty. Starting with the‌ former, we introduce the‌ notion of a robust‌‌ equilibrium as those equilibria that remain invariant to‌ small-but otherwise arbitrary-perturbations to‌ the game's payoff structure,‌‌ and we provide a crisp geometric characterization thereof.‌ Subsequently, we turn to‌ the question of dynamic‌‌ robustness, and we examine which equilibria may arise‌ as stable limit points‌ of the dynamics of‌‌ "follow the regularized leader" (FTRL) in the presence‌ of randomness and uncertainty.‌ Despite their very distinct‌‌ origins, we establish a structural correspondence between these‌ two notions of robustness:‌ strategic robustness implies dynamic‌‌ robustness, and, conversely, the requirement of strategic robustness‌ cannot be relaxed if‌ dynamic robustness is to‌‌ be maintained. Finally, we‌ examine the rate of convergence to robust equilibria‌ as a function of the underlying regularizer, and‌ we show that entropically regularized learning converges at‌ a geometric rate in games with affinely constrained‌ action spaces.

On the discrete-time origins of the‌ replicator dynamics: From convergence to instability and chaos‌

In 7, we consider three distinct discrete-time‌ models of learning and evolution in games: a‌ biological model based on intra-species selective pressure, the‌ dynamics induced by pairwise proportional imitation, and the‌ exponential / multiplicative weights (EW) algorithm for online‌ learning. Even though these models share the same‌ continuous-time limit – the replicator dynamics – we‌ show that second-order effects play a crucial role‌ and may lead to drastically different behaviors in‌ each model, even in very simple, symmetric $2‌ \times 2$ games. Specifically, we study the resulting‌ discrete-time dynamics in a class of parametrized congestion‌ games, and we show that (i) in the‌ biological model of intra-species competition, the dynamics remain‌ convergent for any parameter value; (ii) the dynamics‌ of pairwise proportional imitation exhibit an entire range‌ of behaviors for larger time steps and different‌ equilibrium configurations (stability, instability, and even Li-Yorke chaos);‌ while (iii) in the EW algorithm, increasing the‌ time step (almost) inevitably leads to chaos (again,‌ in the formal, Li-Yorke sense). This divergence of‌ behaviors comes in stark contrast to the globally‌ convergent behavior of the replicator dynamics, and serves‌ to delineate the extent to which the replicator‌ dynamics provide a useful predictor for the long-run‌ behavior of their discrete-time origins.

8.6 Random matrix‌ analysis and Machine Learning

Participants: Romain Couillet,‌ Hugo Lebeau, Victor Leger, Charles SejourneLeger‌.

Random matrix theory has recently proven to‌ be a very effective tool to understand Machine‌ Learning challenges. In particular, concentration results can be‌ used to derive more efficient and frugal algorithms.‌ This line of work has led to the‌ defense of one PhD thesis in 2025.

Random‌ Matrix and Tensor Models for Large Data Processing‌

The exponential growth in computing power has enabled‌ the widespread deployment of machine learning, which has‌ in turn given rise to new challenges in‌ data processing. The sheer volume of data now‌ being generated means that the standard statistical assumption‌ of a number of samples far greater than‌ their dimension is no longer tenable. In the‌ paradigm of the Big Data era, datasets are‌ typically of very large dimension and may also‌ comprise several modes, indicating a variety of sources,‌ modalities, domains, and so on. Furthermore, the advancement‌ of technologies required to develop models capable of‌ processing vast quantities of data results in significant‌ environmental and human costs. In light of these‌ concerns, it is imperative to promote a more‌ clever and prudent use of our resources.Random matrix‌ theory provides powerful tools to precisely study the‌ statistical and computational limitations associated with the processing‌ of large and multidimensional data. Through this lens, the thesis of Hugo‌ Lebeau 29 examines several‌ learning approaches to identify‌‌ the relevant parameters influencing the success of a‌ task and thereby facilitate‌ an informed use.

Firstly,‌‌ we examine an extension of spectral clustering to‌ data streams. This approach‌ enables the clustering of‌‌ a potentially very large dataset with a controlled‌ and limited memory usage.‌ Our findings demonstrate that,‌‌ with an astute management of the available memory,‌ it is possible to‌ achieve performance levels comparable‌‌ to those obtained without resource constraints.We then turn‌ our attention to the‌ computational limits to tensor‌‌ estimation, with a particular focus on low-rank approximation.‌ The study the reconstruction‌ performance of the truncated‌‌ MLSVD (which generalizes the concept of truncated SVD‌ to tensors) as well‌ as the HOOI algorithm‌‌ precisely describes the conditions required for the reconstruction‌ of a noisy signal‌ in multidimensional data. Additionally,‌‌ we utilize a similar approach to investigate the‌ multi-view clustering problem from‌ the perspective of a‌‌ rank-one tensor approximation. Our findings shed light on‌ and precisely quantifies the‌ pivotal role of view‌‌ informativeness in the quality of the estimation.Lastly, we‌ investigate the statistical limits‌ to tensor estimation by‌‌ introducing a matrix model associated to the maximum‌ likelihood problem. This approach‌ allows us to characterize‌‌ the reconstruction performance of the corresponding estimator through‌ a spectral analysis that‌ can be performed with‌‌ the standard tools of random matrix theory. To‌ illustrate this method, we‌ examine the best rank-one‌‌ approximation of a tensor, where a given proportion‌ of entries is randomly‌ removed to reduce its‌‌ memory cost. This allows us to quantify the‌ impact of such a‌ procedure on the quality‌‌ of the resulting estimate. Finally, we propose to‌ extend the presented method‌ to a more general‌‌ tensor estimation framework, which reveals attractive new challenges‌ for the study of‌ large random tensors.

A‌‌ Random Matrix Approach to Low-Multilinear-Rank Tensor Approximation

9‌ presents a comprehensive understanding‌ of the estimation of‌‌ a planted low-rank signal from a general spiked‌ tensor model near the‌ computational threshold. Relying on‌‌ standard tools from the theory of large random‌ matrices, we characterize the‌ large-dimensional spectral behavior of‌‌ the unfoldings of the data tensor and exhibit‌ relevant signal-to-noise ratios governing‌ the detectability of the‌‌ principal directions of the signal. These results allow‌ to accurately predict the‌ reconstruction performance of truncated‌‌ multilinear SVD (MLSVD) in the non-trivial regime. This‌ is particularly important since‌ it serves as an‌‌ initialization of the higher-order orthogonal iteration (HOOI) scheme,‌ whose convergence to the‌ best low-multilinear-rank approximation depends‌‌ entirely on its initialization. We give a sufficient‌ condition for the convergence‌ of HOOI and show‌‌ that the number of iterations before convergence tends‌ to 1 in the‌ large-dimensional limit.

Performance of‌‌ Rank-One Tensor Approximation on Incomplete Data

In 26‌, we are interested‌ in the estimation of‌‌ a rank-one tensor signal when only a portion‌ $ϵ$ of its noisy‌ observation is available. We‌‌ show that the study‌ of this problem can be reduced to that‌ of a random matrix model whose spectral analysis‌ gives access to the reconstruction performance. These results‌ shed light on and specify the loss of‌ performance induced by an artificial reduction of the‌ memory cost of a tensor via the deletion‌ of a random part of its entries.

8.7‌ Fairness and equity in digital (recommendation, advertising, persistent‌ storage) systems

Participants: Rémi Castera, Nicolas Gast‌, Mathieu Molina, Bary Pradelski.

The‌ general deployment of machine-learning systems in many domains‌ ranging from security to recommendation and advertising to‌ guide strategic decisions leads to an interesting line‌ of research from a game theory perspective. In‌ this context, fairness, discrimination, and privacy are particularly‌ important issues.

Prophet Inequalities: Competing with the Top‌ $ℓ$ Items is Easy

In 23, we‌ explore a prophet inequality problem, where the values‌ of a sequence of items are drawn i.i.d.‌ from some distribution, and an online decision maker‌ must select one item irrevocably. We establish that‌ ${CR}_{ℓ}$ the worst-case competitive ratio between the‌ expected optimal performance of an online decision maker‌ compared to that of a prophet who uses‌ the average of the top $ℓ$ items is‌ exactly the solution to an integral equation. This‌ quantity ${CR}_{ℓ}$ is larger than $1 -‌ e^{- ℓ}$ . This implies that the‌ bound converges exponentially fast to 1 as $ℓ‌$ grows. In particular for $ℓ = 2$ ,‌ $C R_{2} \approx 0.966$ ,‌ which is much closer to 1 than the‌ classical bound of $0 . 745$ for $ℓ‌ = 1$ . Additionally, we prove asymptotic lower‌ bounds for the competitive ratio of a more‌ general scenario, where the decision maker is permitted‌ to select $k$ items. This subsumes the $k‌$ multi-unit i.i.d. prophet problem and provides the current‌ best asymptotic guarantees, as well as enables broader‌ understanding in the more general framework. Finally, we‌ prove a tight asymptotic competitive ratio when only‌ static threshold policies are allowed.

9 Bilateral contracts‌ and grants with industry

Participants: Nicolas Gast.‌

Nicolas Gast participates to the "Defi EDF" and‌ is currently supervising a PhD student (Hélène Arvis)‌ via a "CIFRE" contract.

10 Partnerships and cooperations‌

10.1 International initiatives

10.1.1 Inria associate team not‌ involved in an IIL or an international program‌

AIRBA

Participants: Nicolas Gast.

Title:
AI for‌ restless bandits and its application
Duration:
2023 –‌ 2025
Coordinator:
Gupta Manu Kumar
Partners:
- Indian Institute‌ of Technology Roorkee Roorkee (Inde)
Inria contact:
Nicolas‌ Gast
Summary:
Multi-armed restless bandit problems (MARBPs) are‌ Markov decision process models for optimal dynamic priority‌ allocation to a collection of stochastic binary-action (active/passive)‌ projects evolving over time. Typical applications include maintenance‌ problems, in which a collection of agents must‌ be send to various objects subject to failures,‌ or stochastic scheduling problems. If MARBPs are in‌ general intractable, there exists efficient relaxation when the problems parameters are known.‌ The goal of this‌ project is to build‌‌ on recent progress on Reinforcement Learning to create‌ new tools to solve‌ this problem when the‌‌ parameters are unknown. Problems of interests are: how‌ to define online indices‌ and how to learn‌‌ them; What is the performance of such online‌ policies; Application of these‌ to real-life examples such‌‌ as machine-repairman problem of dynamic asset allocations.

10.2‌ International research visitors

10.2.1‌ Visits of international scientists‌‌

Other international visits to the team

We have‌ hosted Deborah Hendrych (Zuse‌ Institute Berlin) from July‌‌ 1st to August 1st 2025, Mohamed Ghannan (Zuse‌ Institute Berlin) in July,‌ and Abednego Kambale (Politecnico‌‌ di Milano) from April to June.

10.2.2 Visits‌ to international teams

Research‌ stays abroad

Panayotis Mertikopoulos‌‌ taught a two-week invited course at the recently‌ inaugurated Moroccan Center for‌ Game Theory (MCGT) at‌‌ UM6P, Rabat. As a result of this course,‌ R. Laraki (CNRS/UM6P) and‌ Panayotis Mertikopoulos will jointly‌‌ co-supervise the PhD of Omar Abbadi (co-tutelle between‌ UM6P and UGA)
Panayotis‌ Mertikopoulos spent three weeks‌‌ as a research visitor at the Archimedes AI‌ research center in Athens,‌ Greece

10.3 European initiatives‌‌

10.3.1 Horizon Europe

Panayotis Mertikopoulos participated in the‌ submission of the QU-METRIC‌ proposal to the call‌‌ HORIZON-EIC-2025-PATHFINDEROPEN. The proposal was unsuccessful; unclear if there‌ will be a follow-up‌ with the same consortium.‌‌

10.4 National initiatives

Projects indicated with a $☆‌$ are projects coordinated by‌ members of the POLARIS‌‌ team.

ANR

ANR REFINO (JCJC 2020-2025) $☆$

Refined‌ Mean Field Optimization[250K€]‌

REFINO is an ANR‌‌ starting grant (JCJC) coordinated by Nicolas Gast. The‌ main objective on this‌ project is to provide‌‌ an innovative framework for optimal control of stochastic‌ distributed agents. Restless bandit‌ allocation is one particular‌‌ example where the control that can be sent‌ to each arm is‌ restricted to an on/off‌‌ signal. The originality of this framework is the‌ use of refined mean‌ field approximation to develop‌‌ control heuristics that are asymptotically optimal as the‌ number of arms goes‌ to infinity and that‌‌ also have a better performance than existing heuristics‌ for a moderate number‌ of arms. As an‌‌ example, we will use this framework in the‌ context of smart grids,‌ to develop control policies‌‌ for distributed electric appliances.
ANR FAIRPLAY (JCJC 2021-2025)‌ $☆$

Fair algorithms via‌ game theory and sequential‌‌ learning[245K€]

FAIRPLAY is an ANR starting grant‌ (JCJC) coordinated by Patrick‌ Loiseau. Machine learning algorithms‌‌ are increasingly used to optimize decision making in‌ various areas, but this‌ can result in unacceptable‌‌ discrimination. The main objective of this project is‌ to propose an innovative‌ framework for the development‌‌ of learning algorithms that respect fairness constraints. While‌ the literature mostly focuses‌ on idealized settings, the‌‌ originality of this framework and central focus of‌ this project is the‌ use of game theory‌‌ and sequential learning methods to account for constraints‌ that appear in practical‌ applications: strategic and decentralized‌‌ aspects of the decisions‌ and the data provided and absence of knowledge‌ of certain parameters key to the fairness definition.‌

10.5 Regional initiatives

Participants: Bruno Gaujal.

MIAI‌ Cluster chair:
Fundamentals of Reinforcement Learning
- PI: Pierre‌ Gaillard and Bruno Gaujal
- Members: Nicolas Gast and‌ Jean-Philippe GAYON (UCA)
- The goal of a reinforcement‌ learning algorithm is to gather information about the‌ unknown system being explored by the learner to‌ better understand its dynamical properties and exploit them‌ to optimize its behavior. Whenever the learner has‌ an a priori offline information about the system,‌ it can leverage this knowledge to be more‌ efficient in learning its optimal behavior. This approach‌ is coined by the global concept of structured‌ learning.
  
  This leads us to the research question‌ that we want to tackle with the FunRL‌ project: How to design algorithms with optimal theoretical‌ guarantees that exploit a (known or unknown) structure‌ of the problem to solve? This question will‌ be developed in three directions. First, we will‌ tackle the online control of queueing networks, which‌ raises the important issue of stability and rarely‌ visited states. The as Markov decision processes (MDPs),‌ which are stochastic dynamical systems that can be‌ controlled. The main originality of this axe with‌ respect tothe others is that these dynamical systems‌ are constrained by the structure of the problem,‌ the challenge being to efficiently use our knowledge‌ of such a structure.Third we will study parametric‌ learning, where a learner adapts its policy to‌ a problem with a known structure but whose‌ parameters are unknown. This has applications to auto-scaling‌ problems in cloud computing, resource allocation, and sequential‌ decisions.

11 Dissemination

11.1 Promoting scientific activities

11.1.1‌ Scientific events: selection

Panayotis Mertikopoulos has been senior‌ area chair at NeurIPS 2025 and area chair‌ at ICML 2025.

Chair of conference program committees‌

Nicolas Gast is TPC co-chair (ACM SIGMETRICS 2026)‌

Member of the conference program committees

Jonatha Anselmi‌ has been a PC member of MASCOTS 2025‌.
Mathieu Besancon has been a PC member‌ of JuMP-dev 2025, JuliaCon Local Paris 2025‌, European Mixed Integer Programming Workshop 2025,‌ and AAAI 2026
Bruno Gaujal has been a‌ PC member of Sigmetrics, NeurIPS

Reviewer

Mathieu Besancon‌ has been a reviewer for Mathematical Programming and‌ Journal of Global Optimization.
Jonatha Anselmi has been‌ a reviewer for IEEE transactions on parallel and‌ distributed systems, IEEE transactions on cloud computing, Journal‌ of Applied probability, IEEE transactions on networking.

11.1.2‌ Journal

Member of the editorial boards

Nicolas Gast‌ is associate editor of the journals Performance Evaluation‌ and Stochastic Models.
Panayotis Mertikopoulos is managing‌ co-editor of the Open Journal of Mathematical Optimization‌ (OJMO) and associate editor at Mathematics of Operations‌ Research (MOR), the International Journal of Game Theory‌ (IJGT), the EURO Journal on Computational Optimization (EJCO),‌ and Operations Research Letters (ORL).

11.1.3 Invited talks‌

Arnaud Legrand was invited to give lectures and‌ keynotes on Reproducible Research and Open Science on the following occasions:
- Congrès‌ de la ROADEF, Champ-sur-Marne‌ (Feb. 2025)
- École thématique‌‌ Science Ouverte pour les SHS, Oléron (June‌ 2025)
- M1 students in‌ computer science at UGA‌‌ (Dec. 2025): Reproducible Research and Computer Science
- 1st‌ year Inria PhD students‌ (Nov. 2025)
Bruno Gaujal‌‌ and Nicolas Gast were invited to give a‌ tutorial on Stochastic bandits‌ at Performance 2025 (Amsterdam).‌‌
Bruno Gaujal was invited to a roundtable on‌ the future of performance‌ evaluation at Atelier Evaluation‌‌ des Performances (Toulouse).
Panayotis Mertikopoulos was invited to‌ present his work at‌ the following events:
- Invited‌‌ conference talk: 75 years of Nash equilibrium Oxford,‌ UK
- Invited workshop talk:‌ 3rd Paris Workshop on‌‌ Game Theory and Language
- Tutorial talk at the‌ 7th Annual Conference on‌ Learning for Dynamics &‌‌ Control
- Invited seminar talks at: LSE, MCGT

11.1.4‌ Leadership within the scientific‌ community

Mathieu Besancon is‌‌ a member of the MOBIDEC PEPR in the‌ ACME project.
Panayotis Mertikopoulos‌ is the scientific coordinator‌‌ of the PEPR IA projet ciblé (acceleration grant)‌ FOUNDRY: Foundations of robustness‌ and reliability in artificial‌‌ intelligence. The project has a total budget‌ of 5M€ and involves‌ five research teams across‌‌ France (POLARIS in Grenoble, the Inria teams SCOOL‌ and FAIRPLAY, Dauphine and‌ IMT in Paris, and‌‌ ENS Lyon).
Jean-Marc Vincent is a member of‌ the scientific committee of‌ the CIST.
Jean-Marc Vincent‌‌ is vice-head of the SIF, adjunct on teaching.‌ In this context he‌ is in charge of‌‌ the organization of the annual meeting on education‌ in computer science.

11.1.5‌ Research administration

Mathieu Besancon‌‌ has been a member of the hiring committee‌ for an Assistant Professor‌ at Univ. Clermont Auvergne.‌‌
Nicolas Gast is vice-head of the Labex EnergyAlps‌ that federates the community‌ working on electrical energy‌‌ in Grenoble.
Nicolas Gast is vice-head of the‌ école doctorale MSTII (the‌ doctoral school managing PhD‌‌ students in computer science and mathemathics at Univ.‌ Grenoble Alpes).
Bruno Gaujal‌ was president of the‌‌ hiring community for and Assistant Professor at Grenoble‌ INP-GI.
Arnaud Legrand is‌ a member of the‌‌ Section 6 of the CoNRS.
Arnaud Legrand is‌ head of the SRCPR‌ axis of the LIG‌‌ and a member of LIG bureau. He has‌ been particularly involved in‌ the HCERES evaluation process‌‌ of the LIG.
Arnaud Legrand h as been‌ a member of the‌ HCERES evaluation committee of‌‌ the IRIT.
Arnaud Legrand is a member of‌ Comité Scientique of the‌ Inria Grenoble.
Florence Perronin‌‌ is a member of the QVT team of‌ the LIG.
Jean-Marc Vincent‌ was a member of‌‌ the jury for Mcf hiring in Univ. Nancy.‌

11.2 Teaching - Supervision‌ - Juries - Educational‌‌ and pedagogical outreach

Jonatha Anselmi teaches Probabilités et‌ simulation (32h) and Évaluation‌ de performances (32h) at‌‌ PolyTech Grenoble.
Mathieu Besançon teaches half of the‌ Advanced Models and Methods‌ of Operations Research at‌‌ the M2 ORCO (UGA) and in the Master‌ de Mathématiques et Applications‌ (UGA).
Nicolas Gast teaches‌‌ the Reinforcement learning part‌ of the M2 course Mathematical foundations of machine‌ learning at the M2 MOSIG (Grenoble).
Bruno Gaujal‌ teaches Optimisation under uncertainties (18h) at the M2‌ ORCO (UGA).
Bruno Gaujal and Nicolas Gast teach‌ Markov Decision Process and Reinforcement Learning (32h in‌ total) at the M2 Info (ENS Lyon).
Nicolas‌ Gast is responsible of the course "Introduction au‌ Machine Learning" that is an optional course of‌ the third year or the "licence informatique" at‌ Univ. Grenoble Alpes.
Nicolas Gast is co-responsible of‌ the courses "MDP and reinforcement learning" (M2, Ens‌ Lyon), "Mathematical Foundations of machine learning"(M2, UGA), and‌ "Optimization under uncertainties" (M2, UGA).
Arnaud Legrand and‌ Jean-Marc Vincent teach the transversal Scientific Methodology and‌ Empirical Evaluation lecture (18h) at the M2 MOSIG‌ (UGA).
The 3rd edition of the MOOC of‌ Arnaud Legrand , K. Hinsen and C. Pouzat‌ on Reproducible Research: Methodological Principles for a Transparent‌ Science is still running. Over the 3 editions‌ (Oct.-Dec. 2018, Apr.-June 2019, March 2020 - end‌ of 2026), more than 25,600 persons have followed‌ this MOOC and about 3200 certificates of achievement‌ have been delivered (about 14% for the laste‌ session, which is very high for a MOOC).‌ More than half of participants are PhD students‌ and about 10% are undergraduates.
The 2nd edition‌ of the MOOC of Arnaud Legrand , K.‌ Hinsen and C. Pouzat on Reproducible Research II:‌ Practices and tools for managing computations and data‌ has been launched from May 2025 to October‌ 2025. It has has attracted 3,000 persons.
Florence‌ Perronin teaches Programming Languages in L1.
Florence Perronin‌ is a member of the conseil de perfectionnement‌ of the Mathematics license.
Jean-Marc Vincent is in‌ charge of the coordination of the training of‌ high school teachers in computer science (NSI) in‌ Grenoble.
Jean-Marc Vincent teaches Algorithms and Probabilities at‌ the L3, UGA.
Jean-Marc Vincent teaches Statistical Models‌ and Litterate Programming at the L3 MIAGE, UGA.‌
Jean-Marc Vincent teaches Mathematics for Computer Science at‌ the M1 MOSIG, UGA.
Jean-Marc Vincent participates to‌ the Histoire de l’informatique lecture at the ENSIMAG.‌
Panayotis Mertikopoulos organized and taught a PhD seminar‌ course on Game Theory at MCGT (Moroccan Center‌ for Game Theory (MCGT) at UM6P, Rabat).

11.2.1‌ Supervision

Bruno Gaujal is member of the CSI‌ of Vittorio Puricelli (Toulouse) and Alessia Rigonat (Paris).‌

11.2.2 Juries

Nicolas Gast was president of the‌ PhD jury of Bianca Marin Moreno (Univ. Grenoble‌ Alpes) entitled Apprentissage par renforcement convexe en ligne‌ et applications aux problèmes de gestion de l'énergie‌.
Nicolas Gast was reviewer for the PhD‌ of Thomas Le Corre (ENS PSL) entitled Distributed‌ control of flexible loads in power grids.‌
Bruno Gaujal was a member of the jury‌ of the PhD defense of Lucas Weber (Paris)‌ entitled Exploiting Partial System Knowledge in Reinforcement Learning‌ for Admission Control and Electricity Storage Optimization,‌ president of the jury of the PhD of‌ Andrea Fox (Avigon) entitled Reinforcement Learning for Resource Allocation in Edge/Fog Systems‌ , and reviewer for‌ the Phd of Maria‌‌ Cherifa (Saclay) entitled Dynamics and learning in some‌ onlineallocation problems and of‌ the PhD of Chiara‌‌ Mignacco (Saclay) entitled A mathematical study of policy‌ orchestration for reinforcement learning‌.
Bruno Gaujal was‌‌ president of the HDR jury of Pierre Gaillard‌ (Grenoble)
Arnaud Legrand was‌ a reviewer for the‌‌ PhD of Léo Cosseron (ÉNS Rennes): Time-accurate network‌ simulation interconnecting virtual machines‌ with hardware virtualization towards‌‌ stealth analysis.
Arnaud Legrand was president of‌ the PhD jury of‌ Eduardo Tomasi Ribeiro (Univ.‌‌ Grenoble Alpes): Single address space for 128-bit massively‌ parallel computers.

11.3‌ Popularization

11.3.1 Specific official‌‌ responsibilities in science outreach structures

Jean-Marc Vincent is‌ the head of the‌ evaluation committee for education‌‌ in AI of the MIAI chaire.

12 Scientific‌ production

12.1 Publications of‌ the year

International journals‌‌

1 articleS.Sebastian Allmeier and N.Nicolas‌ Gast. Accuracy of‌ the Graphon Mean Field‌‌ Approximation for Interacting Particle Systems.Stochastic Systems‌154December 2025‌, 273-290HAL DOI‌‌back to text
2 articleJ.Jonatha Anselmi‌ and B.Bruno Gaujal‌. Energy-Optimal Scheduling with‌‌ Variable Processing Speed: The Role of Task Size‌ Variability.Operations Research‌ Letters64December 2025‌‌, 107379HAL DOIback to text
3‌ articleJ.Jonatha Anselmi‌, B.Bruno Gaujal‌‌ and L.-S.Louis-Sébastien Rebuffi. Non-Stationary Gradient Descent‌ for Optimal Auto-Scaling in‌ Serverless Platforms.IEEE/ACM‌‌ Transactions on Networking2025, 1-14HAL DOI‌back to text
4‌ articleM.Mathieu Besançon‌‌, S.Sébastien Designolle, J.Jannis Halbey‌, D.Deborah Hendrych‌, D.Dominik Kuzinowicz‌‌, S.Sebastian Pokutta, H.Hannah Troppens‌, D.Daniel Viladrich‌ Herrmannsdoerfer and E.Elias‌‌ Wirth. Improved algorithms and novel applications of‌ the FrankWolfe.jl library.‌ACM Transactions on Mathematical‌‌ SoftwareSeptember 2025, 1-31HAL DOI back‌ to text
5 article‌M.Martin Bichler,‌‌ D.Davide Legacci, M.Matthias Oberlechner,‌ P.Panayotis Mertikopoulos and‌ B.Bary Pradelski.‌‌ Characterizing the Convergence of Game Dynamics via Potentialness‌.Transactions on Machine‌ Learning Research JournalMarch‌‌ 2025, 1-22HALback to text
6‌ articleH.Henri Casanova‌, A.Arnaud Giersch‌‌, A.Arnaud Legrand, M.Martin Quinson‌ and F.Frédéric Suter‌. Lowering entry barriers‌‌ to developing custom simulators of distributed applications and‌ platforms with SimGrid.‌Parallel Computing123March‌‌ 2025, 103125HALDOI back to text‌
7 articleF.Fryderyk‌ Falniowski and P.Panayotis‌‌ Mertikopoulos. On the discrete-time origins of the‌ replicator dynamics: From convergence‌ to instability and chaos‌‌.International Journal of Game Theory547‌February 2025HAL DOI‌back to text
8‌‌ articleD.Deborah Hendrych, H.Hannah Troppens‌, M.Mathieu Besançon‌ and S.Sebastian Pokutta‌‌. Convex mixed-integer optimization with Frank–Wolfe methods.‌Mathematical Programming ComputationJune‌ 2025, 1-49HAL‌‌DOI back to text‌
9 articleH.Hugo Lebeau, F.Florent‌ Chatelain and R.Romain Couillet. A Random‌ Matrix Approach to Low-Multilinear-Rank Tensor Approximation.Journal‌ of Machine Learning Research2672025,‌ 1-64HAL back to text
10 articleF.‌Francisca Vasconcelos, E.-V.Emmanouil-Vasileios Vlatakis-Gkaragkounis, P.‌Panayotis Mertikopoulos, G.Georgios Piliouras and M.‌ I.Michael I Jordan. A Quadratic Speedup‌ in Finding Nash Equilibria of Quantum Zero-Sum Games‌.Quantum9May 2025, 1737HAL‌DOI back to text

International peer-reviewed conferences

11‌ inproceedingsW.Waïss Azizian, F.Franck Iutzeler‌, J.Jérôme Malick and P.Panayotis Mertikopoulos‌. The global convergence of stochastic gradient descent‌ in non-convex landscapes: Sharp estimates via large deviations‌.Proceedings of the 42nd International Conference on‌ Machine LearningICML 2025 - 42nd International Conference‌ on Machine LearningVancouver, Canada2025, 1-67‌HAL back to text
12 inproceedingsM.Mathieu‌ Besançon. Efficient Sparse Flow Decomposition Methods for‌ RNA Multi-Assembly.ECAI 2025 - 28th European‌ Conference on Artificial IntelligenceECAI 2025 - 28th‌ European Conference on Artificial IntelligenceBologna, Italy2025‌, 1-8HAL back to text
13 inproceedings‌V.Victor Boone and B.Bruno Gaujal.‌ Logarithmic regret of exploration in average reward Markov‌ decision processes.Proceedings of Thirty Eighth Conference‌ on Learning TheoryProceedings of Thirty Eighth Conference‌ on Learning Theory291291Lyon, France2025‌, 454--533HAL back to text
14 inproceedings‌P.-L.Pierre-Louis Cauvin, D.Davide Legacci and‌ P.Panayotis Mertikopoulos. The impact of uncertainty‌ on regularized learning in games.Proceedings of‌ the 42nd International Conference on Machine LearningICML‌ 2025 - 42nd International Conference on Machine Learning‌Vancouver, Canada2025, 1-50HAL back to‌ text
15 inproceedingsN.Nicolas Gast and D.‌Dheeraj Narasimha. Model Predictive Control is Almost‌ Optimal for Restless Bandit.COLT 2025 -‌ 38th Annual Conference on Learning TheoryLyon, France‌2025, 1-35HALDOI back to text‌
16 inproceedingsJ.Jannis Halbey, S.Seta‌ Rakotomandimby, M.Mathieu Besançon, S.Sébastien‌ Designolle and S.Sebastian Pokutta. Efficient Quadratic‌ Corrections for Frank-Wolfe Algorithms.NeurIPSSan Diego,‌ United StatesDecember 2025HAL back to text‌
17 inproceedingsD.Deborah Hendrych, S.Sebastian‌ Pokutta, M.Mathieu Besançon and D.David‌ Martínez-Rubio. Secant Line Search for Frank-Wolfe Algorithms‌.Proceedings of Machine Learning ResearchICML 2025‌ - Forty-Second International Conference on Machine LearningVancouver‌ (BC), CanadaJuly 2025, 1-22HAL back‌ to text
18 inproceedingsA.Andreas Kontogiannis,‌ V.Vasilis Pollatos, G.Gabriele Farina,‌ P.Panayotis Mertikopoulos and I.Ioannis Panageas.‌ Efficient kernelized learning in polyhedral games beyond full-information:‌ From Colonel Blotto to congestion games.NeurIPS‌ 2025 - Proceedings of the 39th International Conference‌ on Neural Information Processing Systems, 2025San Diego,‌ United StatesSeptember 2025HAL back to text
19 inproceedingsK.Kyriakos‌ Lotidis, P.Panayotis‌ Mertikopoulos, N.Nicholas‌‌ Bambos and J.Jose Blanchet. Invariance and‌ concentration properties of gradient-based‌ learning in games.‌‌CDC 2025 - 64th IEEE Annual Conference on‌ Decision and ControlRio‌ de Janeiro, BrazilIEEE‌‌2025, 1-8HALback to text
20‌ inproceedingsK.Kyriakos Lotidis‌, P.Panayotis Mertikopoulos‌‌, N.Nicholas Bambos and J.Jose Blanchet‌. Multi-agent learning under‌ uncertainty: Recurrence vs. concentration‌‌.NeurIPS 2025 - Proceedings of the 39th‌ International Conference on Neural‌ Information Processing Systems, 2025‌‌San Diego, United StatesDecember 2025HAL back‌ to text back to‌ text
21 inproceedingsK.‌‌Kyriakos Lotidis, P.Panayotis Mertikopoulos, N.‌Nicholas Bambos and J.‌Jose Blanchet. Robust‌‌ equilibria in continuous games: From strategic to dynamic‌ robustness.NeurIPS 2025‌ - Proceedings of the‌‌ 39th International Conference on Neural Information Processing Systems,‌ 2025San Diego, United‌ States2025, 1-33‌‌HAL back to text
22 inproceedingsI.Iosif‌ Lytras and P.Panayotis‌ Mertikopoulos. Tamed Langevin‌‌ sampling under weaker conditions.AISTATS 2025 -‌ 28th International Conference on‌ Artificial Intelligence and Statistics‌‌Mai Khao, Thailand2025, 1-32HAL back‌ to text
23 inproceedings‌M.Mathieu Molina,‌‌ N.Nicolas Gast, P.Patrick Loiseau and‌ V.Vianney Perchet.‌ Prophet Inequalities: Competing with‌‌ the Top Items is Easy.SODA 2025‌ - ACM-SIAM Symposium on‌ Discrete AlgorithmsNew Orleans,‌‌ United StatesACM2025, 1-45HAL back‌ to text
24 inproceedings‌H.Hannah Troppens,‌‌ M.Mathieu Besançon, S. E.St. Elmo‌ Wilken and S.Sebastian‌ Pokutta. Mixed-Integer Optimization‌‌ for Loopless Flux Distributions in Metabolic Networks.‌23rd International Symposium on‌ Experimental Algorithms (SEA 2025)‌‌SEA 2025 - 23rd International Symposium on Experimental‌ Algorithms338Venice, Italy‌Schloss Dagstuhl – Leibniz-Zentrum‌‌ für Informatik2025HALDOI back to text‌
25 inproceedingsE.Elias‌ Wirth, M.Mathieu‌‌ Besançon and S.Sebastian Pokutta. The Pivoting‌ Framework: Frank-Wolfe Algorithms with‌ Active Set Size Control‌‌.Proceedings of Machine Learning ResearchAISTATS 2025‌ - 28th International Conference‌ on Artificial Intelligence and‌‌ StatisticsMai Khao, ThailandMay 2025, 1-24‌HAL back to text‌

National peer-reviewed Conferences

26‌‌ inproceedingsH.Hugo Lebeau. Performance of Rank-One‌ Tensor Approximation on Incomplete‌ Data.GRETSI 2025‌‌ – XXXème Colloque Francophone de Traitement du Signal‌ et des ImagesStrasbourg,‌ France2025, 1-4‌‌HAL back to text

Conferences without proceedings

27‌ inproceedingsM.Mathieu Besançon‌. Frank-Wolfe Algorithms: Sparsity‌‌ Guarantees and an Application to Robust Optimization.‌ROADEF 2025 - 26ème‌ Congrès annuel de la‌‌ Société française de recherche opérationnelle et d'aide à‌ la décisionChamps-Sur-Marne, France‌February 2025, 1-1‌‌HAL back to text

Doctoral dissertations and habilitation‌ theses

28 thesisR.‌Romain Cravic. Learning‌‌ in stochastic games.UGA - Université Grenoble‌ AlpesNovember 2025HAL‌back to text back‌‌ to text
29 thesis‌H.Hugo Lebeau. Random Matrix and Tensor‌ Models for Large Data Processing.Université Grenoble‌ Alpes [2020-....]March 2025HAL back to text‌back to text

Reports & preprints

30 misc‌J.Jonatha ANSELMI, B.Bruno Gaujal and‌ K.Karl Gottlieb. Time-Constrained Energy Minimization for‌ Online Execution of a Stochastic DAG Task.‌2025HAL back to text
31 reportN.‌Nathalie Appel, J.Jérémie Bourdon, N.‌Nicolas Bousquet, J.Johanne Cohen, A.‌Antoine Genitrini, P.Patricia Georgeon, Y.‌Yves Grandvalet, K.Katia Jaffrès-Runser, A.‌Arnaud Legrand, D.Damian Markham, A.‌Anca Muscholl, A.Anastasia Paparrizou, L.‌Loïc Paulevé, M.Michael Poss, M.‌Maria Gradinariu Potop-Butucaru, J.-F.Jean-Florent Raymond,‌ R.Romain Rouvoy, Y.Yolande Sallent,‌ P.Pierre Senellart, T.Thomas Seiller,‌ Y.-Q.Ye-Qiong Song, A.Alain Tchana and‌ H.Hélène Waeselynck. Section 06 Sciences de‌ l’information : fondements de l’informatique, calculs, algorithmes, représentations,‌ exploitations: Rapport de conjoncture 2024.CNRS2025‌, 1-20HAL
32 miscH.Hélène Arvis‌, O.Olivier Beaude, N.Nicolas Gast‌, S.Stéphane Gaubert and B.Bruno Gaujal‌. Integrating Aggregated Electric Vehicle Flexibilities in Unit‌ Commitment Models using Submodular Optimization.November 2025‌HAL back to text
33 miscM.Mathieu‌ Bacou, D.David Beserra, E.Eugen‌ Dedu, L.Loïc Desgeorges, D.Didier‌ Donsez, A.Alexandre Guitton, B.Baptiste‌ Jonglez, A.Arnaud Legrand, G.Georgios‌ Papadopoulos, O.Olivier Richard, S.Samir‌ Si-Mohammed, N.Nina Tamdrari and F.Fabrice‌ Theoleyre. Journée thématique du GDR RSD :‌ pratiques expérimentales de la communauté systèmes et réseaux‌.January 2025HALback to text
34‌ reportC.Christopher Hojny, M.Mathieu Besançon‌, K.Ksenia Bestuzheva, S.Sander Borst‌, A.Antonia Chmiela, J.João Dionísio‌, J.Johannes Ehls, L.Leon Eifler‌, M.Mohammed Ghannam, A.Ambros Gleixner‌, A.Adrian Göß, A.Alexander Hoen‌, J.Jacob von Holly-Ponientzietz, R.Rolf‌ van der Hulst, D.Dominik Kamp,‌ T.Thorsten Koch, K.Kevin Kofler,‌ J.Jurgen Lentz, M.Marco Lübbecke,‌ S. J.Stephen J. Maher, P. M.‌Paul Matti Meinhold, G.Gioni Mexi,‌ T.Til Mohr, E.Erik Mühmer,‌ K. K.Krunal Kishor Patel, M. E.‌Marc E. Pfetsch, S.Sebastian Pokutta,‌ C. R.Chantal Reinartz Groba, F.Felipe‌ Serrano, Y.Yuji Shinano, M.Mark‌ Turner, S.Stefan Vigerske, M.Matthias‌ Walter, D.Dieter Weninger and L.Liding‌ Xu. The SCIP Optimization Suite 10.0.‌Zuse Institute BerlinNovember 2025HAL back to‌ text
35 miscA. W.Abednego Wamuhindo Kambale‌, J.Jonatha ANSELMI, D.Danilo Ardagna and B.Bruno Gaujal‌. Autoscaling in Serverless‌ Platforms via Online Learning‌‌ with Convergence Guarantees.2025HAL back to‌ text
36 miscI.‌Iosif Lytras and P.‌‌Panayotis Mertikopoulos. Contractive kinetic Langevin samplers beyond‌ global Lipschitz continuity.‌September 2025HAL back‌‌ to text

Other scientific publications

37 inproceedings D.‌Dorian Baudry, E.‌Emmeran Johnson, S.‌‌Simon Vary, C.Ciara Pike-Burke and P.‌Patrick Rebeschini. Does‌ Stochastic Gradient really succeed‌‌ for Bandits? NeurIPS 2025 - 39th Conference on‌ Neural Information Processing Systems‌ San Diego (Californie -‌‌ EU), United States December 2025 HAL back to‌ text back to text‌
38 inproceedingsX.Xiaoqi‌‌ Liu, D.Dorian Baudry, J.Julian‌ Zimmert, P.Patrick‌ Rebeschini and A.Arya‌‌ Akhavan. Non-stationary Bandit Convex Optimization: A Comprehensive‌ Study.NeurIPS 2025‌ - 39th Conference on‌‌ Neural Information Processing SystemsSan Diego (Californie -‌ EU), United StatesDecember‌ 2025HAL back to‌‌ text

Software

39 softwareH.Henri Casanova,‌ A.Augustin Degomme,‌ A.Arnaud Giersch,‌‌ A.Arnaud Legrand, M.Martin Quinson and‌ F.Frédéric Suter.‌ SimGrid.4.0April‌‌ 2025 lic: GNU Lesser General Public License v2.1‌ or later.HAL‌Software Heritage VCS back‌‌ to text

12.2 Cited publications

40 inproceedingsA.‌Athanasios Andreou, M.‌Marcio Silva, F.‌‌Fabrício Benevenuto, O.Oana Goga, P.‌Patrick Loiseau and A.‌Alan Mislove. Measuring‌‌ the Facebook Advertising Ecosystem.NDSS 2019 -‌ Proceedings of the Network‌ and Distributed System Security‌‌ SymposiumSan Diego, United StatesFebruary 2019,‌ 1-15HAL DOI back‌ to text
41 inproceedings‌‌A.Athanasios Andreou, G.Giridhari Venkatadri,‌ O.Oana Goga,‌ K. P.Krishna P‌‌ Gummadi, P.Patrick Loiseau and A.Alan‌ Mislove. Investigating Ad‌ Transparency Mechanisms in Social‌‌ Media: A Case Study of Facebook's Explanations.‌NDSS 2018 - Network‌ and Distributed System Security‌‌ SymposiumSan Diego, United StatesFebruary 2018,‌ 1-15HAL DOI back‌ to text
42 article‌‌J.Jonatha Anselmi. Combining Size-Based Load Balancing‌ with Round-Robin for Scalable‌ Low Latency.IEEE‌‌ Transactions on Parallel and Distributed Systems2019,‌ 1-3HAL DOI back‌ to text back to‌‌ text
43 articleJ.Jonatha Anselmi and J.‌Josu Doncel. Asymptotically‌ Optimal Size-Interval Task Assignments‌‌.IEEE Transactions on Parallel and Distributed Systems‌30112019,‌ 2422-2433HAL DOI back‌‌ to text back to text
44 articleJ.‌Jonatha Anselmi and F.‌François Dufour. Power-of-d-Choices‌‌ with Memory: Fluid Limit and Optimality.Mathematics‌ of Operations Research45‌32020, 862-888‌‌HAL DOI back to text back to text‌
45 inproceedingsR. M.‌Rosa M. Badia,‌‌ J.Jesús Labarta, J.Judit Giménez and‌ F.Francesc Escalé.‌ Dimemas: Predicting MPI Applications‌‌ Behaviour in Grid Environments.Proc. of the‌ Workshop on Grid Applications‌ and Programming Tools2003‌‌back to text
46‌ conferenceS.Swen Böhm and C.Christian Engelmann‌. xSim: The Extreme-Scale Simulator.HPCSIstanbul,‌ Turkey2011back to text
47 inproceedingsP.‌Pedro Bruel, S.Steven Quinito Masnada,‌ B.Brice Videau, A.Arnaud Legrand,‌ J.-M.Jean-Marc Vincent and A.Alfredo Goldman.‌ Autotuning under Tight Budget Constraints: A Transparent Design‌ of Experiments Approach.CCGrid 2019 - International‌ Symposium in Cluster, Cloud, and Grid ComputingLarcana,‌ CyprusIEEEMay 2019, 1-10HAL DOI‌back to text
48 incollectionH.Holger Brunst‌, D.Daniel Hackenberg, G.Guido Juckeland‌ and H.Heide Rohling. Comprehensive Performance Tracking‌ with VAMPIR 7.Tools for High Performance‌ Computing 2009The paper details the latest improvements‌ in the Vampir visualization tool.Springer Berlin Heidelberg‌2010DOI back to text
49 articleP.‌Pierre COUCHENEY, B.Bruno Gaujal and P.‌Panayotis Mertikopoulos. Penalty-Regulated Dynamics and Robust Learning‌ Procedures in Games.Mathematics of Operations Research‌4032015, 611-633HAL DOI back‌ to text
50 articleG.Giuliano Casale and‌ N.Nicolas Gast. Performance analysis methods for‌ list-based caches with non-uniform access.IEEE/ACM Transactions‌ on NetworkingDecember 2020, 1-18HAL DOI‌back to text
51 inproceedingsA.Abhijnan Chakraborty‌, G. K.Gourab K Patro, N.‌Niloy Ganguly, K. P.Krishna P Gummadi‌ and P.Patrick Loiseau. Equality of Voice:‌ Towards Fair Representation in Crowdsourced Top-K Recommendations.‌FAT* 2019 - ACM Conference on Fairness, Accountability,‌ and TransparencyProceedings of the ACM Conference on‌ Fairness, Accountability, and Transparency (FAT*)Atlanta, United States‌ACMJanuary 2019, 129-138HAL DOI back‌ to text
52 inproceedingsT.Tom Cornebize,‌ A.Arnaud Legrand and F. C.Franz C‌ Heinrich. Fast and Faithful Performance Prediction of‌ MPI Applications: the HPL Case Study.2019‌ IEEE International Conference on Cluster Computing (CLUSTER)Albuquerque,‌ United StatesSeptember 2019HAL DOI back to‌ text
53 articleT.Tom Cornebize and A.‌Arnaud Legrand. Simulation-based Optimization and Sensibility Analysis‌ of MPI Applications: Variability Matters.Journal of‌ Parallel and Distributed ComputingApril 2022HAL DOI‌back to text
54 articleA.Augustin Degomme‌, A.Arnaud Legrand, G.Georges Markomanolis‌, M.Martin Quinson, M. L.Mark‌ Lee Stillwell and F.Frédéric Suter. Simulating‌ MPI applications: the SMPI approach.IEEE Transactions‌ on Parallel and Distributed Systems288August‌ 2017, 14HALDOI back to text‌
55 inproceedingsB.Bruno Donassolo, I.Ilhem‌ Fajjari, A.Arnaud Legrand and P.Panayotis‌ Mertikopoulos. Load Aware Provisioning of IoT Services‌ on Fog Computing Platform.IEEE International Conference‌ on Communications (ICC)Shanghai, ChinaIEEEMay 2019‌HAL DOI back to text
56 articleB.‌Bruno Donassolo, A.Arnaud Legrand, P.‌Panayotis Mertikopoulos and I.Ilhem Fajjari. Online‌ Reconfiguration of IoT Applications in the Fog: The Information-Coordination Trade-off.IEEE‌ Transactions on Parallel and‌ Distributed Systems335‌‌2022, 1156-1172HALDOI back to text‌
57 inproceedings J.Josu‌ Doncel, N.Nicolas‌‌ Gast and B.Bruno Gaujal. Are mean-field‌ games the limits of‌ finite stochastic games? The‌‌ 18th Workshop on MAthematical performance Modeling and Analysis‌ Nice, France June 2016‌ HAL back to text‌‌
58 articleJ.Josu Doncel, N.Nicolas‌ Gast and B.Bruno‌ Gaujal. Discrete Mean‌‌ Field Games: Existence of Equilibria and Convergence.‌Journal of Dynamics and‌ Games632019‌‌, 1-19HAL DOIback to text
59‌ inproceedingsV.Vitalii Emelianov‌, G.George Arvanitakis‌‌, N.Nicolas Gast, K. P.Krishna‌ P Gummadi and P.‌Patrick Loiseau. The‌‌ Price of Local Fairness in Multistage Selection.‌IJCAI-2019 - Twenty-Eighth International‌ Joint Conference on Artificial‌‌ IntelligenceMacao, FranceInternational Joint Conferences on Artificial‌ Intelligence OrganizationAugust 2019‌, 5836-5842HAL DOI‌‌back to text
60 inproceedingsV.Vitalii Emelianov‌, N.Nicolas Gast‌, K. P.Krishna‌‌ P. Gummadi and P.Patrick Loiseau. On‌ Fair Selection in the‌ Presence of Implicit Variance‌‌.EC 2020 - Twenty-First ACM Conference on‌ Economics and ComputationBudapest,‌ HungaryACMJuly 2020‌‌, 649–675HAL DOIback to text
61‌ inproceedingsL.Lampros Flokas‌, E. V.Emmanouil‌‌ V Vlatakis-Gkaragkounis, T.Thanasis Lianeas, P.‌Panayotis Mertikopoulos and G.‌Georgios Piliouras. No-regret‌‌ learning and mixed Nash equilibria: They do not‌ mix.NeurIPS 2020‌ - 34th International Conference‌‌ on Neural Information Processing SystemsVancouver, CanadaDecember‌ 2020, 1-24HAL‌back to text
62‌‌ articleV.Vinicius Garcia Pinto, L. M.‌Lucas Mello Schnorr,‌ L.Luka Stanisic,‌‌ A.Arnaud Legrand, S.Samuel Thibault and‌ V.Vincent Danjean.‌ A Visual Performance Analysis‌‌ Framework for Task-based Parallel Applications running on Hybrid‌ Clusters.Concurrency and‌ Computation: Practice and Experience‌‌3018April 2018, 1-31HAL DOI‌back to text back‌ to text
63 article‌‌N.Nicolas Gast, L.Luca Bortolussi and‌ M.Mirco Tribastone.‌ Size Expansions of Mean‌‌ Field Approximation: Transient and Steady-State Analysis.Performance‌ Evaluation2018, 1-15‌HAL back to text‌‌
64 inproceedingsN.Nicolas Gast. Expected Values‌ Estimated via Mean-Field Approximation‌ are $1 / N‌‌$ -Accurate.ACM SIGMETRICS / International Conference on‌ Measurement and Modeling of‌ Computer Systems , SIGMETRICS‌‌ '171ACM SIGMETRICS / International Conference on‌ Measurement and Modeling of‌ Computer Systems , SIGMETRICS‌‌ '17Urbana-Champaign, United StatesJune 2017, 26‌HAL DOI back to‌ text
65 article N.‌‌Nicolas Gast, B.Bruno Gaujal and K.‌Kimang Khun. Learning‌ algorithms for Markovian Bandits:‌‌ Is Posterior Sampling more Scalable than Optimism? Transactions‌ on Machine Learning Research‌ Journal November 2022 HAL‌‌ back to text
66 unpublishedN.Nicolas Gast‌, B.Bruno Gaujal‌ and C.Chen Yan‌‌. Exponential Convergence Rate‌ for the Asymptotic Optimality of Whittle Index Policy‌.December 2020, HAL back to text‌
67 articleN.Nicolas Gast, B.Bruno‌ Gaujal and C.Chen Yan. LP-based policies‌ for restless bandits: necessary and sufficient conditions for‌ (exponentially fast) asymptotic optimality.Mathematics of Operations‌ ResearchDecember 2023HALDOI back to text‌
68 inproceedingsN.Nicolas Gast and B. V.‌Benny Van Houdt. A Refined Mean Field‌ Approximation.ACM SIGMETRICS 2018Irvine, United States‌June 2018, 1HAL back to text‌
69 articleN.Nicolas Gast, S.Stratis‌ Ioannidis, P.Patrick Loiseau and B.Benjamin‌ Roussillon. Linear Regression from Strategic Data Sources‌.ACM Transactions on Economics and Computation8‌2May 2020, 1-24HAL DOI back‌ to text
70 inproceedingsN.Nicolas Gast,‌ D.Diego Latella and M.Mieke Massink.‌ A Refined Mean Field Approximation for Synchronous Population‌ Processes.MAMA 2018Workshop on MAthematical performance Modeling‌ and AnalysisIrvine, United StatesJune 2018,‌ 1-3HAL back to text
71 inproceedingsN.‌Nicolas Gast and B.Benny Van Houdt.‌ Asymptotically Exact TTL-Approximations of the Cache Replacement Algorithms‌ LRU(m) and h-LRU.28th International Teletraffic Congress‌ (ITC 28)Würzburg, GermanySeptember 2016HAL back‌ to text
72 articleN.Nicolas Gast and‌ B.Benny Van Houdt. TTL Approximations of‌ the Cache Replacement Algorithms LRU(m) and h-LRU.‌Performance EvaluationSeptember 2017HAL DOI back to‌ text
73 inproceedingsB.Bruno Gaujal, J.‌Josu Doncel and N.Nicolas Gast. Vaccination‌ in a Large Population: Mean Field Equilibrium versus‌ Social Optimum.NETGCOOP 2020 - 10th International‌ Conference on NETwork Games, COntrol and OPtimizationCargèse,‌ FranceSeptember 2021, 1-9HAL back to‌ text
74 inproceedingsB.Bruno Gaujal, A.‌Alain Girault and S.Stéphan Plassart. A‌ Linear Time Algorithm for Computing Off-line Speed Schedules‌ Minimizing Energy Consumption.MSR 2019 - 12ème‌ Colloque sur la Modélisation des Systèmes RéactifsAngers,‌ FranceNovember 2019, 1-14HAL back to‌ text
75 inproceedingsB.Bruno Gaujal, A.‌Alain Girault and S.Stéphan Plassart. Discrete‌ and Continuous Optimal Control for Energy Minimization in‌ Real-Time Systems.EBCCSP 2020 - 6th International‌ Conference on Event-Based Control, Communication, and Signal Processing‌Krakow, PolandIEEESeptember 2020, 1-8HAL‌DOI back to text
76 articleB.Bruno‌ Gaujal, A.Alain Girault and S.Stéphan‌ Plassart. Dynamic Speed Scaling Minimizing Expected Energy‌ Consumption for Real-Time Tasks.Journal of Scheduling‌July 2020, 1-25HAL DOI back to‌ text
77 techreportB.Bruno Gaujal, A.‌Alain Girault and S.Stéphan Plassart. Exploiting‌ Job Variability to Minimize Energy Consumption under Real-Time‌ Constraints.RR-9300Inria Grenoble Rhône-Alpes, Université de‌ Grenoble ; Université Grenoble - AlpesNovember 2019‌, 23HAL back to text
78 inproceedings‌A.Angeliki Giannou, E. V.Emmanouil Vasileios Vlatakis-Gkaragkounis and P.Panayotis‌ Mertikopoulos. Survival of‌ the strictest: Stable and‌‌ unstable equilibria under regularized learning with partial information‌.COLT 2021 -‌ 34th Annual Conference on‌‌ Learning TheoryBoulder, United StatesAugust 2021,‌ 1-30HAL back to‌ text
79 articleM.‌‌MT Heath and J.JA Etheridge. Visualizing‌ the performance of parallel‌ programs.IEEE software‌‌85The paper presents Paragraph.1991back‌ to text
80 inproceedings‌F. C.Franz C.‌‌ Heinrich, T.Tom Cornebize, A.Augustin‌ Degomme, A.Arnaud‌ Legrand, A.Alexandra‌‌ Carpen-Amarie, S.Sascha Hunold, A.-C.Anne-Cécile‌ Orgerie and M.Martin‌ Quinson. Predicting the‌‌ Energy Consumption of MPI Applications at Scale Using‌ a Single Node.‌Cluster 2017IEEEHawaii,‌‌ United StatesSeptember 2017HAL back to text‌
81 inproceedingsT.Torsten‌ Hoefler, T.Timo‌‌ Schneider and A.Andrew Lumsdaine. LogGOPSim -‌ Simulating Large-Scale Applications in‌ the LogGOPS Model.‌‌ACM Workshop on Large-Scale System and Application Performance‌2010back to text‌
82 inproceedingsY.-P.Ya-Ping‌‌ Hsieh, P.Panayotis Mertikopoulos and V.Volkan‌ Cevher. The limits‌ of min-max optimization algorithms:‌‌ Convergence to spurious non-critical sets.ICML 2021‌ - 38th International Conference‌ on Machine LearningVienna,‌‌ AustriaJuly 2021HALback to text
83‌ articleL. V.Laxmikant‌ V. Kalé, G.‌‌Gengbin Zheng, C. W.Chee Wai Lee‌ and S.Sameer Kumar‌. Scaling applications to‌‌ massively parallel machines using Projections performance analysis tool‌.Future Generation Comp.‌ Syst.2232006‌‌back to text
84 inproceedingsR.Rafael Keller‌ Tesser, L.Lucas‌ Mello Schnorr, A.‌‌Arnaud Legrand, F.Fabrice Dupros and P.‌ O.Philippe O A‌ Navaux. Using Simulation‌‌ to Evaluate and Tune the Performance of Dynamic‌ Load Balancing of an‌ Over-decomposed Geophysics Application.‌‌Euro-Par 2017: 23rd International European Conference on Parallel‌ and Distributed ComputingSantiago‌ de Compostela, SpainAugust‌‌ 2017, 15HALback to text
85‌ articleR.Rafael Keller‌ Tesser, L.Lucas‌‌ Mello Schnorr, A.Arnaud Legrand, C.‌Christian Heinrich, F.‌Fabrice Dupros and P.‌‌ O.Philippe Olivier Alexandre Navaux. Performance Modeling‌ of a Geophysics Application‌ to Accelerate the Tuning‌‌ of Over-decomposition Parameters through Simulation.Concurrency and‌ Computation: Practice and Experience‌2018, 1-21HAL‌‌DOI back to text
86 inproceedingsT.-E.Takai-Eddine‌ Kennouche, F.Florent‌ Cadoux, N.Nicolas‌‌ Gast and B.Benoît Vinot. ASGriDS: Asynchronous‌ Smart-Grids Distributed Simulator.‌APPEEC 2019 - 11th‌‌ IEEE PES Asia-Pacific Power and Energy Engineering Conference‌Macao, Macau SAR China‌IEEEDecember 2019,‌‌ 1-5HAL back to text
87 inproceedingsJ.‌ M.Jon M. Kleinberg‌ and M.Manish Raghavan‌‌. Selection Problems in the Presence of Implicit‌ Bias.Proceedings of‌ the 9th Innovations in‌‌ Theoretical Computer Science Conference (ITCS)2018, 33:1--33:17‌back to text
88‌ inproceedingsA.Arnaud Legrand‌‌, D.Denis Trystram‌ and S.Salah Zrigui. Adapting Batch Scheduling‌ to Workload Characteristics: What can we expect From‌ Online Learning?IPDPS 2019 - 33rd IEEE International‌ Parallel & Distributed Processing SymposiumRio de Janeiro,‌ BrazilIEEEMay 2019, 686-695HAL DOI‌back to text
89 articleJ. D.Jacob‌ D Leshno and B. S.Bary S R‌ Pradelski. The importance of memory for price‌ discovery in decentralized markets.Games and Economic‌ Behavior125January 2021, 62-78HAL DOI‌back to text
90 inproceedingsM.Mouhcine Mendil‌, N.Nicolas Gast and H.-J.Henry-Joseph Audéoud‌. Collisions groupées lors du mécanisme d'évitement de‌ collisions de CPL-G3.CoRes 2020 - Rencontres‌ Francophones sur la Conception de Protocoles, l’Évaluation de‌ Performance et l’Expérimentation des Réseaux de CommunicationLyon,‌ FranceSeptember 2020, 1-4HAL back to‌ text
91 inproceedingsP.Panayotis Mertikopoulos, B.‌Bruno Lecouat, H.Houssam Zenati, C.-S.‌Chuan-Sheng Foo, V.Vijay Chandrasekhar and G.‌Georgios Piliouras. Optimistic Mirror Descent in Saddle-Point‌ Problems: Going the Extra (Gradient) Mile.ICLR‌ 2019 - 7th International Conference on Learning Representations‌New Orleans, United StatesMay 2019, 1-23‌HAL back to text
92 inproceedingsP.Panayotis‌ Mertikopoulos, H.Heinrich Nax and B.Bary‌ Pradelski. Quick or cheap? Breaking points in‌ dynamic markets.EC 2020 - 21st ACM‌ Conference on Economics and ComputationBudapest, HungaryJuly‌ 2020, 1-32HALback to text
93‌ inproceedingsP.Panayotis Mertikopoulos, C. H.Christos‌ Harilaos Papadimitriou and G.Georgios Piliouras. Cycles‌ in adversarial regularized learning.SODA '18 -‌ Twenty-Ninth Annual ACM-SIAM Symposium on Discrete AlgorithmsNew‌ Orleans, United StatesJanuary 2018, 2703-2717HAL‌back to text
94 articleP.Panayotis Mertikopoulos‌ and W. H.William H. Sandholm. Learning‌ in games via reinforcement learning and regularization.‌Mathematics of Operations Research414November 2016‌, 1297-1324HAL DOIback to text back‌ to text
95 articleP.Panayotis Mertikopoulos and‌ W. H.William H. Sandholm. Riemannian game‌ dynamics.Journal of Economic Theory177September‌ 2018, 315-364HALDOI back to text‌
96 articleM. C.Marcelo Cogo Miletto,‌ L. L.Lucas Leandro Nesi, L.Lucas‌ Mello Schnorr and A.Arnaud Legrand. Performance‌ Analysis of Irregular Task-Based Applications on Hybrid Platforms:‌ Structure Matters.Future Generation Computer Systems135‌October 2022HAL back to text
97 inproceedings‌M.Mohsen Minaei, M.Mainack Mondal,‌ P.Patrick Loiseau, K. P.Krishna P‌ Gummadi and A.Aniket Kate. Forgetting the‌ Forgotten with Lethe: Conceal Content Deletion from Persistent‌ Observers.PETS 2019 - 19th Privacy Enhancing‌ Technologies SymposiumStockholm, SwedenJuly 2019, 1-21‌HAL back to text
98 articleW.W.E.‌ Nagel, A.A. Arnold, M.M.‌ Weber, H.H.C. Hoppe and K.K.‌ Solchenbach. VAMPIR: Visualization and Analysis of MPI Resources.Supercomputer12‌11996back to‌ text
99 inproceedingsL.‌‌ L.Lucas Leandro Nesi, A.Arnaud Legrand‌ and L.Lucas Mello‌ Schnorr. Exploiting system‌‌ level heterogeneity to improve the performance of a‌ GeoStatistics multi-phase task-based application‌.ICPP 2021 -‌‌ 50th International Conference on Parallel ProcessingLemont, United‌ StatesAugust 2021,‌ 1-10HAL DOI back‌‌ to text
100 techreportM.Miquel Oliu-Barton and‌ B.Bary Pradelski.‌ A vaccination policy by‌‌ zones.Think tank Terra NovaOctober 2020‌HAL back to text‌
101 articleM.Miquel‌‌ Oliu-Barton, B.Bary Pradelski, P.Philippe‌ Aghion, P.Patrick‌ Artus, I.Ilona‌‌ Kickbusch, J.Jeffrey Lazarus, D.Devi‌ Sridhar and S.Samantha‌ Vanderslott. SARS-CoV-2 elimination,‌‌ not mitigation, creates best outcomes for health, the‌ economy, and civil liberties‌.The Lancet397‌‌10291June 2021, 2234-2236HAL DOI back‌ to text
102 inproceedings‌M.Miquel Oliu-Barton and‌‌ B.Bary Pradelski. Green bridges: Reconnecting Europe‌ to avoid economic disaster‌.Europe in the‌‌ Time of Covid-192020HAL back to text‌
103 inproceedingsV.V.‌ Pillet, J.J.‌‌ Labarta, T.T. Cortes and S.S.‌ Girona. PARAVER: A‌ tool to visualise and‌‌ analyze parallel code.Proceedings of Transputer and‌ Occam Developments, WOTUG-18.44‌1995back to text‌‌
104 articleB. S.Bary S R Pradelski‌ and H. H.Heinrich‌ H Nax. Market‌‌ sentiments and convergence dynamics in decentralized assignment economies‌.International Journal of‌ Game Theory491‌‌March 2020, 275-298HAL DOI back to‌ text
105 techreportB.‌Bary Pradelski and M.‌‌Miquel Oliu-Barton. Focus mass testing: How to‌ overcome low test accuracy‌.Esade Centre for‌‌ Economic PolicyDecember 2020HAL back to text‌
106 articleB.Bary‌ Pradelski and M.Miquel‌‌ Oliu-Barton. Green zoning: An effective policy tool‌ to tackle the Covid-19‌ pandemic.Health Policy‌‌1258August 2021, 981-986HAL DOI‌back to text
107‌ inproceedingsD.DA Reed‌‌, P.PC Roth, R.RA Aydt‌, K.KA Shields‌, L.LF Tavera‌‌, R.RJ Noe and B.BW Schwartz‌. Scalable performance analysis:‌ the Pablo performance analysis‌‌ environment.Scalable Parallel Libraries Conference1993back‌ to text
108 thesis‌P. H.Pedro Henrique‌‌ Rocha Bruel. Toward transparent and parsimonious methods‌ for automatic performance tuning‌.UGA (Université Grenoble‌‌ Alpes); USP (Universidade de São Paulo)July 2021‌HAL back to text‌
109 inproceedingsB.Ben‌‌ Shneiderman. The eyes have it: A task‌ by data type taxonomy‌ for information visualizations.‌‌IEEE Symposium on Visual LanguagesIEEE1996back‌ to text
110 inproceedings‌D. C.David C.‌‌ Snowdon, S.Sergio Ruocco and G.Gernot‌ Heiser. Power Management‌ and Dynamic Voltage Scaling:‌‌ Myths and Facts.Proceedings of the 2005‌ Workshop on Power Aware‌ Real-time ComputingNew Jersey,‌‌ USASeptember 2005back‌ to text
111 inproceedingsT.Till Speicher,‌ M.Muhammad Ali, G.Giridhari Venkatadri,‌ F.Filipe Ribeiro, G.George Arvanitakis,‌ F.Fabrício Benevenuto, K. P.Krishna P‌ Gummadi, P.Patrick Loiseau and A.Alan‌ Mislove. Potential for Discrimination in Online Targeted‌ Advertising.FAT 2018 - Conference on Fairness,‌ Accountability, and Transparency81New-York, United StatesFebruary‌ 2018, 1-15HALback to text
112‌ inproceedingsM.Mustafa Tikir, M.Michael Laurenzano‌, L.Laura Carrington and A.Allan Snavely‌. PSINS: An Open Source Event Tracer and‌ Execution Simulator for MPI Applications.Euro-Par2009‌back to text
113 inproceedingsG.Giridhari Venkatadri‌, A.Athanasios Andreou, Y.Yabing Liu‌, A.Alan Mislove, K. P.Krishna‌ P Gummadi, P.Patrick Loiseau and O.‌Oana Goga. Privacy Risks with Facebook’s PII-based‌ Targeting: Auditing a Data Broker’s Advertising Interface.‌39th IEEE Symposium on Security and Privacy (S&P)‌Proceedings of the 39th IEEE Symposium on Security‌ and Privacy (S&P)San Francisco, United States2018‌HAL back to text
114 inproceedingsB.Benoıt‌ Vinot, F.Florent Cadoux and N.Nicolas‌ Gast. Congestion Avoidance in Low-Voltage Networks by‌ using the Advanced Metering Infrastructure.ePerf 2018‌ - IFIP WG PERFORMANCE - 36th International Symposium‌ on Computer Performance, Modeling, Measurements and EvalutionToulouse,‌ FranceDecember 2018, 1-3HAL back to‌ text
115 inproceedingsB.Benoît Vinot, F.‌Florent Cadoux and R.Rodolphe Heliot. Decentralized‌ Optimization of Energy Exchanges in an Electricity Microgrid‌ .ACM e-Energy 2016 - 7th ACM International‌ Conference on Future Energy SystemsWaterloo, CanadaJune‌ 2016HAL DOI back to text
116 inproceedings‌B.Benoît Vinot, F.Florent Cadoux and‌ R.Rodolphe Héliot. Decentralized optimization of energy‌ exchanges in an electricity microgrid.2016 IEEE‌ PES Innovative Smart Grid Technologies Conference Europe (ISGT-Europe)‌Ljubljana, SloveniaIEEEOctober 2016, 1-6HAL‌DOI back to text
117 inproceedingsM.Mark‌ Weiser, B.Brent Welch, A.Alan‌ Demers and S.Scott Shenker. Scheduling for‌ Reduced CPU Energy.Proceedings of the 1st‌ USENIX Conference on Operating Systems Design and Implementation‌OSDI '94USAMonterey, CaliforniaUSENIX Association1994‌, 2–esback to text
118 inproceedingsJ.‌Jeremiah Wilke, K.Khachik Sargsyan, J.‌Joseph Kenny, B.Bert Debusschere, H.‌Habib Najm and G.Gilbert Hendry. Validation‌ and Uncertainty Assessment of Extreme-Scale HPC Simulation through‌ Bayesian Inference.Euro-Par2013back to text‌
119 articleO.O. Zaki, E.E.‌ Lusk, W.W. Gropp and D.D.‌ Swider. Toward Scalable Performance Visualization with Jumpshot‌.International Journal of High Performance Computing Applications‌1331999DOIback to text
120‌ inproceedingsG.Gengbin Zheng, G.Gunavardhan Kakulapati‌ and L.Laxmikant Kalé. BigSim: A Parallel‌ Simulator for Performance Prediction of Extremely Large Parallel Machines.IPDPS2004‌back to text
121‌ articleS.Salah Zrigui‌‌, R. y.Raphael y de Camargo,‌ A.Arnaud Legrand and‌ D.Denis Trystram.‌‌ Improving the Performance of Batch Schedulers Using Online‌ Job Runtime Classification.‌Journal of Parallel and‌‌ Distributed Computing164February 2022, 83-95HAL‌DOI back to text‌

POLARIS - 2025

POLARIS - 2025

2025Activity﻿​﻿﻿ reportProject-TeamPOLARIS

Keywords

Computer Science and﻿​﻿﻿ Digital Science

Other Research Topics and﻿​​﻿ Application Domains

1 Team members, visitors,﻿﻿﻿‌ external collaborators

Research Scientists﻿‌​‌

Faculty Members

Post-Doctoral Fellows​​​‌

PhD Students

Interns and​​​‌ Apprentices

Administrative​​﻿﻿ Assistants

2​‌﻿﻿ Overall objectives

2.1 Context

2.2 Objectives

2.3 Contribution to AI/Learning​‌﻿﻿

3 Research program﻿﻿﻿‌

3.1 Performance Evaluation

Project-team positioning

Scientific achievements

Predictive﻿‌​‌ Simulation

Trace​‌﻿﻿ Analysis and Visualization

Design of Experiments and﻿​​﻿ Reproducibility

3.2 Asymptotic Methods﻿‌​‌

Project-team﻿​​﻿ positioning

Scientific achievements​​﻿﻿

Refined mean field approximation​​​‌

Design and​​﻿﻿ analysis of distributed control​​​‌ algorithms

Mean field﻿‌​‌ games

3.3 Distributed Online Optimization​​​‌ and Learning in Games﻿﻿﻿‌

Project-team positioning​​​‌

Scientific achievements

3.4 Responsible Computer Science​‌﻿﻿

Project-team positioning​​​‌

Scientific achievements​​​‌

Algorithmic fairness

Privacy and transparency﻿‌​‌ in social computing system﻿​​﻿

Online markets

COVID

Energy efficiency

4﻿﻿﻿‌ Application domains

4.1 Large﻿‌​‌ Computing Infrastructures

4.2 Next-Generation Wireless﻿​﻿﻿ Networks

4.3​​﻿﻿ Energy and Transportation

4.4﻿​﻿﻿ Social Computing Systems

5 Social and﻿​﻿﻿ environmental responsibility

5.1 Footprint​‌﻿﻿ of research activities

5.2 Impact of﻿﻿﻿‌ research results

6 Highlights of the﻿‌​‌ year

6.1 Awards

6.2﻿‌​‌ PhD defense

7 Latest software﻿﻿﻿‌ developments, platforms, open data﻿‌​‌

7.1 Latest software developments﻿​​﻿

7.1.1 SimGrid

7.1.2​​﻿﻿ PSI

7.1.3 marmoteCore

7.1.4​​​‌ MarTO

7.1.5 GameSeer​​​‌

7.1.6 rmf_tool

7.1.7 SCIP

8 New results﻿​﻿﻿

8.1 Scheduling in Data​‌﻿﻿ Centers

Non-Stationary Gradient Descent for﻿﻿﻿‌ Optimal Auto-Scaling in Serverless﻿‌​‌ Platforms

Autoscaling in Serverless Platforms​​​‌ via Online Learning with﻿﻿﻿‌ Convergence Guarantees

Energy-Optimal Scheduling with​​​‌ Variable Processing Speed: The﻿﻿﻿‌ Role of Task Size﻿‌​‌ Variability

Time-Constrained Energy​​﻿﻿ Minimization for Online Execution​​​‌ of a Stochastic DAG﻿​﻿﻿ Task

8.2 Performance evaluation﻿​﻿﻿ of Large Systems

8.2.1 Experimental​​​‌ practices and Simulation

Lowering﻿​﻿﻿ entry barriers to developing​‌﻿﻿ custom simulators of distributed​​﻿﻿ applications and platforms with​​​‌ SimGrid

Journée thématique du GDR﻿​​﻿ RSD : Experimental Practices​​​‌ in the Systems and﻿﻿﻿‌ Networks Community

8.2.2﻿﻿﻿‌ Mean Field

Accuracy of the﻿‌​‌ Graphon Mean Field Approximation﻿​​﻿ for Interacting Particle Systems​​​‌

8.3​‌﻿﻿ Reinforcement Learning and MDP​​﻿﻿

Logarithmic regret of​​​‌ exploration in average reward﻿​﻿﻿ Markov decision processes

8.4 Optimization​‌﻿﻿ Techniques

8.4.1 Langevin Sampling​​​‌ and Large Deviation Techniques﻿​﻿﻿

The global convergence of​‌﻿﻿ stochastic gradient descent in​​﻿﻿ non-convex landscapes: Sharp estimates​​​‌ via large deviations

Tamed﻿​​﻿ Langevin sampling under weaker​​​‌ conditions

Contractive kinetic﻿﻿﻿‌ Langevin samplers beyond global﻿‌​‌ Lipschitz continuity

2025Activity reportProject-TeamPOLARIS

Computer Science and Digital Science

Other Research Topics and Application Domains

1 Team members, visitors,‌ external collaborators

Research Scientists‌‌

Post-Doctoral Fellows‌

Interns and‌ Apprentices

Administrative Assistants

2‌ Overall objectives

2.3 Contribution to AI/Learning‌

3 Research program‌

Predictive‌‌ Simulation

Trace‌ Analysis and Visualization

Design of Experiments and Reproducibility

3.2 Asymptotic Methods‌‌

Project-team positioning

Scientific achievements

Refined mean field approximation‌

Design and analysis of distributed control‌ algorithms

Mean field‌‌ games

3.3 Distributed Online Optimization‌ and Learning in Games‌

Project-team positioning‌

3.4 Responsible Computer Science‌

Project-team positioning‌

Scientific achievements‌

Privacy and transparency‌‌ in social computing system

4‌ Application domains

4.1 Large‌‌ Computing Infrastructures

4.2 Next-Generation Wireless Networks

4.3 Energy and Transportation

4.4 Social Computing Systems

5 Social and environmental responsibility

5.1 Footprint‌ of research activities

5.2 Impact of‌ research results

6 Highlights of the‌‌ year

6.2‌‌ PhD defense

7 Latest software‌ developments, platforms, open data‌‌

7.1 Latest software developments

7.1.2 PSI

7.1.4‌ MarTO

7.1.5 GameSeer‌

8 New results

8.1 Scheduling in Data‌ Centers

Non-Stationary Gradient Descent for‌ Optimal Auto-Scaling in Serverless‌‌ Platforms

Autoscaling in Serverless Platforms‌ via Online Learning with‌ Convergence Guarantees

Energy-Optimal Scheduling with‌ Variable Processing Speed: The‌ Role of Task Size‌‌ Variability

Time-Constrained Energy Minimization for Online Execution‌ of a Stochastic DAG Task

8.2 Performance evaluation of Large Systems

8.2.1 Experimental‌ practices and Simulation

Lowering entry barriers to developing‌ custom simulators of distributed applications and platforms with‌ SimGrid

Journée thématique du GDR RSD : Experimental Practices‌ in the Systems and‌ Networks Community

8.2.2‌ Mean Field

Accuracy of the‌‌ Graphon Mean Field Approximation for Interacting Particle Systems‌

8.3‌ Reinforcement Learning and MDP

Logarithmic regret of‌ exploration in average reward Markov decision processes

8.4 Optimization‌ Techniques

8.4.1 Langevin Sampling‌ and Large Deviation Techniques

The global convergence of‌ stochastic gradient descent in non-convex landscapes: Sharp estimates‌ via large deviations

Tamed Langevin sampling under weaker‌ conditions

Contractive kinetic‌ Langevin samplers beyond global‌‌ Lipschitz continuity

8.4.2 Frank-Wolfe‌‌

Frank-Wolfe Algorithms: Sparsity Guarantees and an Application to‌ Robust Optimization

Improved algorithms‌ and novel applications of the FrankWolfe.jl library

Efficient Quadratic‌ Corrections for Frank-Wolfe Algorithms

Secant Line Search for Frank-Wolfe‌ Algorithms

The Pivoting‌ Framework: Frank-Wolfe Algorithms with Active Set Size Control‌

8.4.3 Mixed Integer‌‌ Optimization

Convex mixed-integer optimization with Frank–Wolfe methods

The SCIP Optimization Suite 10.0‌

8.4.4 Application to‌ various domains

Integrating Aggregated‌ Electric Vehicle Flexibilities in‌‌ Unit Commitment Models using Submodular Optimization

Efficient Sparse Flow Decomposition Methods for RNA Multi-Assembly‌

Mixed-Integer Optimization for Loopless Flux Distributions in Metabolic‌ Networks

8.4.5‌ Optimization for Machine Learning

Non-stationary Bandit Convex Optimization: A Comprehensive Study

Model Predictive Control is Almost Optimal for Restless‌ Bandit

Does Stochastic Gradient really succeed for‌ Bandits?

8.5 Learning in games

8.5.1 Learning in stochastic games‌

Learning in stochastic games