Stability and Optimization of Speculative Queueing Networks

POLARIS Performance analysis and Optimization of LARge Infrastructures and Systems

Distributed and High Performance Computing

Networks, Systems and Services, Distributed Computing

http://team.inria.fr/polaris Laboratoire d'Informatique de Grenoble (LIG) Université de Grenoble Alpes, CNRS Creation of the Project-Team: 2018 January 01 Project-Team A1.2. - Networks A1.3.5. - Cloud A1.3.6. - Fog, Edge A1.6. - Green Computing A3.4. - Machine learning and statistics A3.5.2. - Recommendation systems A5.2. - Data visualization A6. - Modeling, simulation and control A6.2.3. - Probabilistic methods A6.2.4. - Statistical methods A6.2.6. - Optimization A6.2.7. - High performance computing A8.2. - Optimization A8.9. - Performance evaluation A8.11. - Game Theory A9.2. - Machine learning A9.9. - Distributed AI, Multi-agent B4.4. - Energy delivery B4.4.1. - Smart grids B4.5.1. - Green computing B6.2. - Network technologies B6.2.1. - Wired technologies B6.2.2. - Radio technology B6.4. - Internet of things B8.3. - Urbanism and urban planning B9.6.7. - Geography B9.7.2. - Open data B9.8. - Reproducibility Grenoble - Rhône-Alpes Arnaud Legrand Chercheur Team leader, CNRS, Senior Researcher oui Jonatha Anselmi Chercheur Inria, Researcher Nicolas Gast Chercheur Inria, Researcher oui Bruno Gaujal Chercheur Inria, Senior Researcher oui Patrick Loiseau Chercheur Inria, Researcher oui Panayotis Mertikopoulos Chercheur CNRS, Researcher oui Bary Pradelski Chercheur CNRS, Researcher Romain Couillet Enseignant Institut polytechnique de Grenoble, Professor, from Sep 2021 Vincent Danjean Enseignant Univ Grenoble Alpes, Associate Professor Guillaume Huard Enseignant Univ Grenoble Alpes, Associate Professor Florence Perronnin Enseignant Univ Grenoble Alpes, Associate Professor oui Jean-Marc Vincent Enseignant Univ Grenoble Alpes, Associate Professor Philippe Waille Enseignant Univ Grenoble Alpes, Associate Professor Henry Joseph Audeoud PostDoc INPG Entreprise SA Dong Quan Vu PostDoc CNRS Sebastian Allmeier PhD Inria Kimon Antonakopoulos PhD CNRS Thomas Barzola PhD Univ Grenoble Alpes Victor Boone PhD École Normale Supérieure de Lyon, from Sep 2021 Remi Castera PhD Univ Grenoble Alpes, from Oct 2021 Tom Cornebize PhD Inria, until Mar 2021 Romain Cravic PhD Inria, from Oct 2021 Vitalii Emelianov PhD Inria Yu Guan Hsieh PhD Univ Grenoble Alpes Simon Philipp Jantschgi PhD Université de Zurich Kimang Khun PhD Inria Till Kletti PhD Naver Labs, CIFRE Lucas Leandro Nesi PhD Federal University of the Rio Grande Do Sul (UFRGS), from Nov 2021 Hugo Lebeau PhD Univ Grenoble Alpes, from Oct 2021 Victor Leger PhD Univ Grenoble Alpes, from Oct 2021 Dimitrios Moustakas PhD Institut polytechnique de Grenoble Louis Sebastien Rebuffi PhD Univ Grenoble Alpes Pedro Rocha Bruel PhD Université de Sao Paulo - Brésil, until Oct 2021 Benjamin Roussillon PhD Univ Grenoble Alpes, until Sep 2021 Vera Sosnovik PhD Univ Grenoble Alpes Chen Yan PhD Univ Grenoble Alpes Bruno De Moura Donassolo Technique Inria, Engineer Eleni Gkiouzepi Technique Univ Grenoble Alpes, Engineer, until Nov 2021 Achille Baucher Stagiaire Univ Grenoble Alpes, from Oct 2021 Victor Boone Stagiaire École Normale Supérieure de Lyon, from Feb 2021 until Jul 2021 Remi Castera Stagiaire Inria, from Apr 2021 until Sep 2021 Romain Cravic Stagiaire Inria, from Feb 2021 until Jul 2021 Mael Delorme Stagiaire Inria, from May 2021 until Jun 2021 Aurelien Gauffre Stagiaire Univ Grenoble Alpes, from Feb 2021 until Jul 2021 Jeremy Guerin Stagiaire Inria, from Apr 2021 until Sep 2021 Oumaima Hajji Stagiaire Inria, from May 2021 until Jul 2021 Mathieu Molina Stagiaire Inria, from May 2021 until Oct 2021 Julie Reynier Stagiaire Inria, from May 2021 until Jul 2021 Annie Simon Assistant Inria Overall objectives Context

Large distributed infrastructures are rampant in our society. Numerical simulations form the basis of computational sciences and high performance computing infrastructures have become scientific instruments with similar roles as those of test tubes or telescopes. Cloud infrastructures are used by companies in such an intense way that even the shortest outage quickly incurs the loss of several millions of dollars. But every citizen also relies on (and interacts with) such infrastructures via complex wireless mobile embedded devices whose nature is constantly evolving. In this way, the advent of digital miniaturization and interconnection has enabled our homes, power stations, cars and bikes to evolve into smart grids and smart transportation systems that should be optimized to fulfill societal expectations.

Our dependence and intense usage of such gigantic systems obviously leads to very high expectations in terms of performance. Indeed, we strive for low-cost and energy-efficient systems that seamlessly adapt to changing environments that can only be accessed through uncertain measurements. Such digital systems also have to take into account both the users' profile and expectations to efficiently and fairly share resources in an online way. Analyzing, designing and provisioning such systems has thus become a real challenge.

Such systems are characterized by their ever-growing size, intrinsic heterogeneity and distributedness, user-driven requirements, and an unpredictable variability that renders them essentially stochastic. In such contexts, many of the former design and analysis hypotheses (homogeneity, limited hierarchy, omniscient view, optimization carried out by a single entity, open-loop optimization, user outside of the picture) have become obsolete, which calls for radically new approaches. Properly studying such systems requires a drastic rethinking of fundamental aspects regarding the system's observation (measure, trace, methodology, design of experiments), analysis (modeling, simulation, trace analysis and visualization), and optimization (distributed, online, stochastic).

Objectives

The goal of the POLARIS project is to contribute to the understanding of the performance of very large scale distributed systems by applying ideas from diverse research fields and application domains. We believe that studying all these different aspects at once without restricting to specific systems is the key to push forward our understanding of such challenges and to proposing innovative solutions. This is why we intend to investigate problems arising from application domains as varied as large computing systems, wireless networks, smart grids and transportation systems.

The members of the POLARIS project cover a very wide spectrum of expertise in performance evaluation and models, distributed optimization, and analysis of HPC middleware. Specifically, POLARIS' members have worked extensively on:

Experiment design:

Experimental methodology, measuring/monitoring/tracing tools, experiment control, design of experiments, and reproducible research, especially in the context of large computing infrastructures (such as computing grids, HPC, volunteer computing and embedded systems).

Trace Analysis:

Parallel application visualization (paje, triva/viva, framesoc/ocelotl, ...), characterization of failures in large distributed systems, visualization and analysis for geographical information systems, spatio-temporal analysis of media events in RSS flows from newspapers, and others.

Modeling and Simulation:

Emulation, discrete event simulation, perfect sampling, Markov chains, Monte Carlo methods, and others.

Optimization:

Stochastic approximation, mean field limits, game theory, discrete and continuous optimization, learning and information theory.

Contribution to AI/Learning

AI and Learning is everywhere now. Let us clarify how our research activities are positionned with respect to this trend.

A first line of research in POLARIS is devoted to the use statistical learning techniques (Bayesian inference) to model the expected performance of distributed systems, to build aggregated performance views, to feed simulators of such systems, or to detect anomalous behaviours.

In a distributed context it is also essential to design systems that can seamlessly adapt to the workload and to the evolving behaviour of its components (users, resources, network). Obtaining faithful information on the dynamic of the system can be particularly difficult, which is why it is generally more efficient to design systems that dynamically learn the best actions to play through trial and errors. A key characteristic of the work in the POLARIS project is to leverage regularly game-theoretic modeling to handle situations where the resources or the decision is distributed among several agents or even situations where a centralised decision maker has to adapt to strategic users.

An important research direction in POLARIS is thus centered on reinforcement learning (Multi-armed bandits, Q-learning, online learning) and active learning in environments with one or several of the following features:

Feedback is limited (e.g., gradient or even stochastic gradients are not available, which requires for example to resort to stochastic approximations);

Multi-agent setting where each agent learns, possibly not in a synchronised way (i.e., decisions may be taken asynchronously, which raises convergence issues);

Delayed feedback (avoid oscillations and quantify convergence degradation);

Non stochastic (e.g., adversarial) or non stationary workloads (e.g., in presence of shocks);

Systems composed of a very large number of entities, that we study through mean field approximation (mean-field games and mean field control).

As a side effect, many of the gained insights can often be used to dramatically improve the scalability and the performance of the implementation of more standard machine or deep learning techniques over supercomputers.

The POLARIS members are thus particularly interested in the design and analysis of adaptive learning algorithms for multi-agent systems, i.e. agents that seek to progressively improve their performance on a specific task (see Figure). The resulting algorithms should not only learn an efficient (Nash) equilibrium but they should also be able of doing so quickly (low regret), even when facing the difficulties associated to a distributed context (lack of coordination, uncertain world, information delay, limited feedback, …)

In the rest of this document, we describe in detail our new results in the above areas.

Research program Performance Evaluation JonathaAnselmiVincentDanjeanNicolasGastGuillaumeHuardArnaudLegrandFlorencePerronninJean-MarcVincent Project-team positioning

Evaluating the scalability, robustness, energy consumption and performance of large infrastructures such as exascale platforms and clouds raises severe methodological challenges. The complexity of such platforms mandates empirical evaluation but direct experimentation via an application deployment on a real-world testbed is often limited by the few platforms available at hand and is even sometimes impossible (cost, access, early stages of the infrastructure design, etc.). Furthermore, such experiments are costly, difficult to control and therefore difficult to reproduce. Although many of these digital systems have been built by men, they have reached such a complexity level that we are no longer able to study them like artificial systems and have to deal with the same kind of experimental issues as natural sciences. The development of a sound experimental methodology for the evaluation of resource management solutions is among the most important ways to cope with the growing complexity of computing environments. Although computing environments come with their own specific challenges, we believe such general observation problems should be addressed by borrowing good practices and techniques developed in many other domains of science, in particular (1) Predictive Simulation, (2) Trace Analysis and Visualization, and (3) the Design of Experiments.

Scientific achievements

Large computing systems are particularly complex to understand because of the interplay between their discrete nature (originating from deterministic computer programs) and their stochastic nature (emerging from the physical world, long distance interactions, and complex hardware and software stacks). A first line of research in POLARIS is devoted to the design of relatively simple statistical models of key components of distributed systems and their exploitation to feed simulators of such systems, to build aggregated performance views, and to detect anomalous behaviors.

Predictive Simulation

Unlike direct experimentation via an application deployment on a real-world testbed, simulation enables fully repeatable and configurable experiments that can often be conducted quickly for arbitrary hypothetical scenarios. In spite of these promises, current simulation practice is often not conducive to obtaining scientifically sound results. To date, most simulation results in the parallel and distributed computing literature are obtained with simulators that are ad hoc, unavailable, undocumented, and/or no longer maintained. As a result, most published simulation results build on throw-away (short-lived and non validated) simulators that are specifically designed for a particular study, which prevents other researchers from building upon it. There is thus a strong need for recognized simulation frameworks by which simulation results can be reproduced, further analyzed and improved.

Many simulators of MPI applications have been developed by renowned HPC groups (e.g., at SDSC 104, BSC 49, UIUC 112, Sandia Nat. Lab. 110, ORNL 50 or ETH Zürich 80) but most of them builds on restrictive network and application modeling assumptions that generally prevent to faithfully predict execution times, which limits the use of simulation to indication of gross trends at best.

The SimGrid simulation toolkit, whose development started more than 20 years ago in UCSD, is a renowned project which gathers more than 1,700 citations and has supported the research of at least 550 articles. The most important contribution of POLARIS to this project in the last years has been to improve the quality of SimGrid to the point where it can be used effectively on a daily basis by practitioners to accurately reproduce the dynamic of real HPC systems. In particular, SMPI57, a simulator based on SimGrid that simulates unmodified MPI applications written in C/C++ or FORTRAN, has now become a very unique tool allowing to faithfully study particularly complex scenario such as legacy a legacy Geophysics application that suffers from spatial and temporal load balancing problem 83, 82 or the HPL benchmark 5637. We have shown that the performance (both for time and energy consumption 79) predicted through our simulations was systematically within a few percents of real experiments, which allows to reliably tune the applications at very low cost. This capacity has also been leveraged to study (through StarPU-SimGrid) complex and modern task-based applications running on heterogeneous sets of hybrid (CPUs + GPUs) nodes 29. The phenomenon studied through this approach would be particularly difficult to study through real experiments but yet allow to address real problems of these applications. Finally, SimGrid is also heavily used through BatSim, a batch simulator developed in the DATAMOVE team and which leverages SimGrid, to investigate the performance of machine learning strategies in a batch scheduling context 86, 113.

Trace Analysis and Visualization

Many monolithic visualization tools have been developed by renowned HPC groups since decades (e.g., BSC 97, Jülich and TU Dresden 94, 52, UIUC 78, 100, 81 and ANL 111) but most of these tools build on the classical information visualization 101 that consists in always first presenting an overview of the data, possibly by plotting everything if computing power allows, and then to allow users to zoom and filter, providing details on demand. However in our context, the amount of data comprised in such traces is several orders of magnitude larger than the number of pixels on a screen and displaying even a small fraction of the trace leads to harmful visualization artifacts. Such traces are typically made of events that occur at very different time and space scales and originate from different sources, which hinders classical approaches, especially when the application structure departs from classical MPI programs with a BSP/SMPD structure. In particular, modern HPC applications that build on a task-based runtime and run on hybrid nodes are particularly challenging to analyze. Indeed, the underlying task-graph is dynamically scheduled to avoid spurious synchronizations, which prevents classical visualizations to exploit and reveal the application structure.

In 65, we explain how modern data analytics tools can be used to build, from heterogeneous information sources, custom, reproducible and insightful visualizations of task-based HPC applications at a very low development cost in the StarVZ framework. By specifying and validating statistical models of the performance of HPC applications/systems, we manage to identify when their behavior departs from what is expected and detect performance anomalies. This approach has first been applied to state-of-the art linear algebra libraries in 65 and more recently to a sparse direct solver 43. In both cases, we have been able to identify and fix several non-trivial anomalies that had not been noticed even by the application and runtime developpers. Finally, these models not only allow to reveal when applications depart from what is expected but also to summarize the execution by focusing on the most important features, which is particularly useful when comparing two execution

Design of Experiments and Reproducibility

Part of our work is devoted to the control of experiments on both classical (HPC) and novel (IoT/Fog in a smart home context) infrastructures. To this end, we heavily rely on experimental testbeds such as Grid5000 and FIT-IoTLab that can be well-controlled but real experiments are nonetheless quite resource-consuming. Design of experiments has been successfully applied in many fields (e.g., agriculture, chemistry, industrial processes) where experiments are considered expensive. Building on concrete use cases, we explore how Design of Experiments and Reproducible Research techniques can be used to (1) design transparent auto-tuning strategies of scientific computation kernels 5134 (2) set up systematic performance non regression tests on Grid5000 (450 nodes for 1.5 year) and detect many abnormal events (related to bios and system upgrades, cooling, faulty memory and power unstabiliy) that had a significant effect on the nodes, from subtle performance changes of 1% to much more severe degradations of more than 10%, and had yet been unnoticed by both Grid’5000 technical team and Grid’5000 users (3) design and evaluate the performance of service provisioning strategies 458 in Fog infrastructures.

Asymptotic Methods JonathaAnselmiRomainCouilletNicolasGastBrunoGaujalFlorencePerronninJean-MarcVincent Project-team positioning

Stochastic models often suffer from the curse of dimensionality: their complexity grows exponentially with the number of dimensions of the system. At the same time, very large stochastic systems are sometimes easier to analyze: it can be shown that some classes of stochastic systems simplify as their dimension goes to infinity because of averaging effects such as the law of large numbers, or the central limit theorem. This form the basis of what is called an asymptotic method, which consists in studying what happens when a system gets large in order to build an approximation that is easier to study or to simulate.

Within the team, the research that we conduct in this axis is to foster the applicability of these asymptotic methods to new application areas. This lead us to work on the application of classical methods to new problems, but also to develop new approximation methods that take into account special features of the systems we study (i.e., moderate number of dimensions, transient behavior, random matrices). Typical applications are mean field method for performance evaluation, application to distributed optimization, and more recently statistical learning. One of the originality of our work is to quantify precisely what is the error made by such approximations. This allows us to define refinement terms that lead to more accurate approximations.

Scientific achievements Refined mean field approximation

Mean field approximation is a well-known technique in statistical physics, that was originally introduced to study systems composed of a very large number of particles (say $n > 10^{20}$ ). The idea of this approximation is to assume that objects are independent and only interact between them through an average environment (the mean field). Nowadays, variants of this technique are widely applied in many domains: in game theory for instance (with the example of mean field games), but also to quantify the performance of distributed algorithms. Mean field approximation is often justified by showing a system of $n$ well-mixed interacting objects converges to its deterministic mean field approximation as $n$ goes to infinity. Yet, this does not explain why mean field approximation provides a very accurate approximation of the behavior of systems composed by a few hundreds of objects or less. Until recently, this was essentially an open question.

In 67, we give a partial answer to this question. We show that, for most of the mean field models used for performance evaluation, the error made when using a mean field approximation is a $Θ (1 / n)$ . This results greatly improved compared to previous work that showed that the error made by mean field approximation was smaller than $O (1 / \sqrt{n})$ . On the contrary, we obtain the exact rate of accuracy. This result came from the use of Stein's method that allows one to quantify precisely the distance between two stochastic processes. Subsequently, in 69, we show that the constant in the $Θ (1 / n)$ can be computed numerically by a very efficient algorithm. By using this, we define the notion of refined approximation which consists in adding the $1 / n$ -correction term. This methods can also be generalize to higher order extensions or 71, 66.

Design and analysis of distributed control algorithms

Mean field approximation is widely used in the perfromance evaluation community to analyze and design distributed control algorithms. Our contribution in this domain has covered mainly two applications: cache replacement algorithms and load balancing algorithms.

Cache replacement algorithms are widely used in content delivery networks. In 54, 73, 72, we show how mean field and refined mean field approximation can be used to evaluate the performance of list-based cache replacement algorithms. In particuler, we show that such policies can outperform the classically used LRU algorithm. A methological contribution of our work is that, when evaluating precisely the behavior of such a policy, the refined mean field approximation is both faster and more accurate than what could be obtain with a stochastic simulator.

Computing resources are often spread across many machines. An efficient use of such resources requires the design of a good load balancing strategy, to distribute the load among the available machines. In 47, 48, 46, we study two paradigms that we use to design asymptotically optimal load balancing policies where a central broker sends tasks to a set of parallel servers. We show in 47, 46 that combining the classical round-robin allocation plus an evaluation of the tasks sizes can yield a policy that has a zero delay in the large system limit. This policy is interesting because the broker does not need any feedback from the servers. At the same time, this policy needs to estimate or know job durations, which is not always possible. A different approach is used in 48 where we consider a policy that does not need to estimate job durations but that uses some feedback from the servers plus a memory of where jobs where send. We show that this paradigm can also be used to design zero-delay load balancing policies as the system size grows to infinity.

Mean field games

Various notions of mean field games have been introduced in the years 2000-2010 in theoretical economics, engineering or game theory. A mean field game is a game in which an individual tries to maximize its utility while evolving in a population of other individuals whose behavior are not directly affected by the individual. An equilibrium is a population dynamics for which a selfish individual would behave as the population. In 60, we develop the notion of discrete space mean field games, that is more amendable to study that the previously introduced notions of mean field games. This lead to two interesting contributions: mean field games are not always the limits of stochastic games as the number of players grow 59, mean field games can be used to study how much vaccination should be subsidized to encourage people to adapt an socially optimal behaviour 18.

Distributed Online Optimization and Learning in Games NicolasGastRomainCouilletBrunoGaujalArnaudLegrandPatrickLoiseauPanayotisMertikopoulosBaryPradelski Project-team positioning

Online learning concerns the study of repeated decision-making in changing environments. Of course, depending on the context, the words “learning” and “decision-making” may refer to very different things: in economics, this could mean predicting how rational agents react to market drifts; in data networks, it could mean adapting the way packets are routed based on changing traffic conditions; in machine learning and AI applications, it could mean training a neural network or the guidance system of a self-driving car; etc. In particular, the changes in the learner's environment could be either exogenous (that is, independent of the learner's decisions, such as the weather affecting the time of travel), or endogenous (i.e., they could depend on the learner's decisions, as in a game of poker), or any combination thereof. However, the goal for the learner(s) is always the same: to make more informed decisions that lead to better rewards over time.

The study of online learning models and algorithms dates back to the seminal work of Robbins, Nash and Bellman in the 50's, and it has since given rise to a vigorous research field at the interface of game theory, control and optimization, with numerous applications in operations research, machine learning, and data science. In this general context, our team focuses on the asymptotic behavior of online learning and optimization algorithms, both single- and multi-agent: whether they converge, at what speed, and/or what type of non-stationary, off-equilibrium behaviors may arise when they do not.

The focus of POLARIS on game-theoretic and Markovian models of learning covers a set of specific challenges that dovetail in a highly synergistic manner with the work of other learning-oriented teams within Inria (like SCOOL in Lille, SIERRA in Paris, and THOTH in Grenoble), and it is an important component of Inria's activities and contributions in the field (which includes major industrial stakeholders like Google / DeepMind, Facebook, Microsoft, Amazon, and many others).

Scientific achievements

Our team's work on online learning covers both single- and multi-agent models; in the sequel, we present some highlights of our work structured along these basic axes.

In the single-agent setting, an important problem in the theory of Markov decision processes – i.e., discrete-time control processes with decision-dependent randomness – is the so-called “restless bandit” problem. Here, the learner chooses an action – or “arm” – from a finite set, and the mechanism determining the action's reward changes depending on whether the action was chosen or not (in contrast to standard Markov problems where the activation of an arm does not have this effect). In this general setting, Whittle conjectured – and Weber and Weiss proved – that Whittle's eponymous index policy is asymptotically optimal. However, the result of Weber and Weiss is purely asymptotic, and the rate of this convergence remained elusive for several decades. This gap was finally settled in a series of POLARIS papers 6840, where we showed that Whittle indices (as well as other index policies) become optimal at a geometric rate under the same technical conditions used by Weber and Weiss to prove Whittle's conjecture, plus a technical requirement on the non-singularity of the fixed point of the mean-field dynamics. We also propose the first sub-cubic algorithm to compute Whittle and Gittins indexes. As for reinforcement learning in Markovian bandits, we have shown that Bayesian and optimistic approaches do not use the structure of Markovian bandits similarly: While Bayesian learning has both a regret and a computational complexity that scales linearly with the number of arms, optimistic approaches all incur an exponential computation time, at least in their current versions 39.

In the multi-agent setting, our work has focused on the following fundamental question:

Does the concurrent use of (possibly optimal) single-agent learning algorithms

ensure convergence to Nash equilibrium in multi-agent, game-theoretic environments?

Conventional wisdom might suggest a positive answer to this question because of the following “folk theorem”: under no-regret learning, the agents' empirical frequency of play converges to the game's set of coarse correlated equilibria. However, the actual implications of this result are quite weak: First, it concerns the empirical frequency of play and not the day-to-day sequence of actions employed by the players. Second, it concerns coarse correlated equilibria which may be supported on strictly dominated strategies – and are thus unacceptable in terms of rationalizability. These realizations prompted us to make a clean break with conventional wisdom on this topic, ultimately showing that the answer to the above question is, in general, “no”: specifically, 90, 88 showed that the (optimal) class of “follow-the-regularized-leader” (FTRL) learning algorithms leads to Poincaré recurrence even in simple, $2 \times 2$ min-max games, thus precluding convergence to Nash equilibrium in this context.

This negative result generated significant interest in the literature as it contributed in shifting the focus towards identifying which Nash equilibria may arise as stable limit points of FTRL algorithms and dynamics. Earlier work by POLARIS on the topic 53, 91, 92 suggested that strict Nash equilibria play an important role in this question. This suspicion was recently confirmed in a series of papers 6419 where we established a sweeping negative result to the effect that mixed Nash equilibria are incompatible with no-regret learning. Specifically, we showed that any Nash equilibrium which is not strict cannot be stable and attracting under the dynamics of FTRL, especially in the presence of randomness and uncertainty. This result has significant implications for predicting the outcome of a multi-agent learning process because, combined with 91, it establishes the following far-reaching equivalence: a state is asymptotically stable under no-regret learning if and only if it is a strict Nash equilibrium.

Going beyond finite games, this further raised the question of what type of non-convergent behaviors can be observed in continuous games – such as the class of stochastic min-max problems that are typically associated to generative adversarial networks (GANs) in machine learning. This question was one of our primary collaboration axes with EPFL, and led to a joint research project focused on the characterization of the convergence properties of zeroth-, first-, and (scalable) second-order methods in non-convex/non-concave problems. In particular, we showed in 25 that these state-of-the-art min-max optimization algorithms may converge with arbitrarily high probability to attractors that are in no way min-max optimal or even stationary – and, in fact, may not even contain a single stationary point (let alone a Nash equilibrium). Spurious convergence phenomena of this type can arise even in two-dimensional problems, a fact which corroborates the empirical evidence surrounding the formidable difficulty of training GANs.

Responsible Computer Science NicolasGastRomainCouilletBrunoGaujalArnaudLegrandPatrickLoiseauPanayotisMertikopoulosBaryPradelski Project-team positioning

The topics in this axis emerge from current social and economic questions rather than from a fixed set of mathematical methods. To this end we have identified large trends such as energy efficiency, fairness, privacy, and the growing number of new market places. In addition, COVID has posed new questions that opened new paths of research with strong links to policy making.

Throughout these works, the focus of the team is on modeling aspects of the aforementioned problems, and obtaining strong theoretical results that can give high-level guidelines on the design of markets or of decision-making procedures. Where relevant, we complement those works by measurement studies and audits of existing systems that allow identifying key issues. As this work is driven by topics, rather than methods, it allows for a wide range of collaborations, including with enterprises (e.g., Naverlabs), policy makers, and academics from various fields (economics, policy, epidemiology, etc.).

Other teams at Inria cover some of the societal challenges listed here (e.g., PRIVATICS, COMETE) but rather in isolation. The specificity of POLARIS resides in the breadth of societal topics covered and of the collaborations with non-CS researchers and non-research bodies; as well as in the application of methods such as game theory to those topics.

Scientific achievements Algorithmic fairness

As algorithmic decision-making became increasingly omnipresent in our daily lives (in domains ranging from credits to advertising, hiring, or medicine); it also became increasingly apparent that the outcome of algorithms can be discriminatory for various reasons. Since 2016, the scientific community working on the problem of algorithmic fairness has been exponentially increasing. In this context, in the early days, we worked on better understanding the extent of the problem through measurement in the case of social networks 103. In particular, in this work, we showed that in advertising platforms, discrimination can occur from multiple different internal processes that cannot be controlled, and we advocate for measuring discrimination on the outcome directly. Then we worked on proposing solutions to guarantee fair representation in online public recommendations (aka trending topics on Twitter) 55. This is an example of an application in which it was observed that recommendations are typically biased towards some demographic groups. In this work, our proposed solution draws an analogy between recommendation and voting and builds on existing works on fair representation in voting. Finally, in most recent times, we worked on better understanding the sources of discrimination, in the particular simple case of selection problems, and the consequences of fixing it. While most works attribute discrimination to implicit bias of the decision maker 85, we identified a fundamentally different source of discrimination: Even in the absence of implicit bias in a decision maker’s estimate of candidates’ quality, the estimates may differ between the different groups in their variance—that is, the decision maker’s ability to precisely estimate a candidate’s quality may depend on the candidate’s group 63. We show that this differential variance leads to discrimination for two reasonable baseline decision makers (group-oblivious and Bayesian optimal). Then we analyze the consequence on the selection utility of imposing fairness mechanisms such as demographic parity or its generalization; in particular we identify some cases for which imposing fairness can improve utility. In 62, we also study similar questions in the two-stage setting, and derive the optimal selector and the “price of local fairness’’ one pays in utility by imposing that the interim stage be fair.

Privacy and transparency in social computing system

Online services in general, and social networks in particular, collect massive amounts of data about their users (both online and offline). It is critical that (i) the users’ data is protected so that it cannot leak and (ii) users can know what data the service has about them and understand how it is used—this is the transparency requirement. In this context, we did two kinds of work. First, we studied social networks through measurement, in particular using the use case of Facebook. We showed that their advertising platform, through the PII1-based targeting option, allowed attackers to discover some personal data of users 105. We also proposed an alternative design—valid for any system that proposed PII-based targeting—and proved that it fixes the problem. We then audited the transparency mechanisms of the Facebook ad platform, specifically the “Ad Preferences’’ page that shows what interests the platform inferred about a user, and the “Why am I seeing this’’ button that gives some reasons why the user saw a particular ad. In both cases, we laid the foundation for defining the quality of explanations and we showed that the explanations given were lacking key desirable properties (they were incomplete and misleading, they have since been changed) 45. A follow-up work shed further light on the typical uses of the platform 44. In another work, we proposed an innovative protocol based on randomized withdrawal to protect public posts deletion privacy 93. Finally, in 70, we study an alternative data sharing ecosystem where users can choose the precision of the data they give. We model it as a game and show that, if users are motivated to reveal data by a public good component of the outcome’s precision, then certain basic statistical properties (the optimality of generalized least squares in particular) no longer hold.

Online markets

Market design operates at the intersection of computer science and economics and has become increasingly important as many markets are redesigned on digital platforms. Studying markets for commodities, in an ongoing project we evaluate how different fee models alter strategic incentives for both buyers and sellers. We identify two general classes of fees: for one, strategic manipulation becomes infeasible as the market grows large and agents therefore have no incentive to misreport their true valuation. On the other hand, strategic manipulation is possible and we show that in this case agents aim to maximally shade their bids. This has immediate implications for the design of such markets. By contrast, 89 considers a matching market where buyers and sellers have heterogeneous preferences over each other. Traders arrive at random to the market and the market maker, having limited information, aims to optimize when to open the market for a clearing event to take place. There is a tradeoff between thickening the market (to achieve better matches) and matching quickly (to reduce waiting time of traders in the market). The tradeoff is made explicit for a wide range of underlying preferences. These works are adding to an ongoing effort to better understand and design markets 988.

COVID

The COVID-19 pandemic has put humanity to one of the defining challenges of its generation and as such naturally trans-disciplinary efforts have been necessary to support decision making. In a series of articles 1096 we proposed Green Zoning. `Green zones’–areas where the virus is under control based on a uniform set of conditions–can progressively return to normal economic and social activity levels, and mobility between them is permitted. By contrast, stricter public health measures are in place in ‘red zones’, and mobility between red and green zones is restricted. France and Spain were among the first countries to introduce green zoning in April 2020. The initial success of this proposal opened up the way to a large amount of follow-up work analyzing and proposing various tools to effectively deploy different tools to combat the pandemic (e.g., focus-mass testing 99 and a vaccination policy 95). In a joint work with a group of leading economists, public health researchers and sociologists it was found that countries that opted to aim to eliminate the virus fared better not only for public health, but also for the economy and civil liberties 9. Overall this work has been characterized by close interactions with policy makers in France, Spain and the European Commission as well as substantial activity in public discourse (via TV, newspapers and radio).

Energy efficiency

Our work on energy efficiency spanned multiple different areas and applications such as embedded systems and smart grids. Minimizing the energy consumption of embedded systems with real-time constraints is becoming more important for ecological as well as practical reasons since batteries are becoming standard power supplies. Dynamically changing the speed of the processor is the most common and efficient way to reduce energy consumption 102. In fact, this is the reason why modern processors are equipped with Dynamic Voltage and Frequency Scaling (DVFS) technology 109. In a stochastic environment, with random job sizes and arrival times, combining hard deadlines and energy minimization via DVFS-based techniques is difficult because forcing hard deadlines requires considering the worst cases, hardly compatible with random dynamics. Nevertheless, progress have been made to solve these types of problems in a series of papers using constrained Markov decision processes, both on the theoretical side (proving existence of optimal policies and showing their structure 76, 74, 75) as well as on the experimental side (showing the gains of optimal policies over classical solutions 77).

In the context of a collaboration with Enedis and Schneider Electric (via the Smart Grid chair of Grenoble-INP), we also study the problem of using smart meters to optimize the behavior of electrical distribution networks. We made three kinds of contributions on this subject: (1) how to design efficient control strategies in such a system 106, 108, 107, (2) how to co-simulate an electrical network and a communication network 84, and (3) what is the performance of the communication protocol (PLC G3) used by the Linky smart meters 87.

Application domains Large Computing Infrastructures

Supercomputers typically comprise thousands to millions of multi-core CPUs with GPU accelerators interconnected by complex interconnection networks that are typically structured as an intricate hierarchy of network switches. Capacity planning and management of such systems not only raises challenges in term of computing efficiency but also in term of energy consumption. Most legacy (SPMD) applications struggle to benefit from such infrastructure since the slightest failure or load imbalance immediately causes the whole program to stop or at best to waste resources. To scale and handle the stochastic nature of resources, these applications have to rely on dynamic runtimes that schedule computations and communications in an opportunistic way. Such evolution raises challenges not only in terms of programming but also in terms of observation (complexity and dynamicity prevents experiment reproducibility, intrusiveness hinders large scale data collection, ...) and analysis (dynamic and flexible application structures make classical visualization and simulation techniques totally ineffective and require to build on ad hoc information on the application structure).

Next-Generation Wireless Networks

Considerable interest has arisen from the seminal prediction that the use of multiple-input, multiple-output (MIMO) technologies can lead to substantial gains in information throughput in wireless communications, especially when used at a massive level. In particular, by employing multiple inexpensive service antennas, it is possible to exploit spatial multiplexing in the transmission and reception of radio signals, the only physical limit being the number of antennas that can be deployed on a portable device. As a result, the wireless medium can accommodate greater volumes of data traffic without requiring the reallocation (and subsequent re-regulation) of additional frequency bands. In this context, throughput maximization in the presence of interference by neighboring transmitters leads to games with convex action sets (covariance matrices with trace constraints) and individually concave utility functions (each user's Shannon throughput); developing efficient and distributed optimization protocols for such systems is one of the core objectives of Theme 5.

Another major challenge that occurs here is due to the fact that the efficient physical layer optimization of wireless networks relies on perfect (or close to perfect) channel state information (CSI), on both the uplink and the downlink. Due to the vastly increased computational overhead of this feedback – especially in decentralized, small-cell environments – the ongoing transition to fifth generation (5G) wireless networks is expected to go hand-in-hand with distributed learning and optimization methods that can operate reliably in feedback-starved environments. Accordingly, one of POLARIS' application-driven goals will be to leverage the algorithmic output of Theme 5 into a highly adaptive resource allocation framework for next-géneration wireless systems that can effectively "learn in the dark", without requiring crippling amounts of feedback.

Energy and Transportation

Smart urban transport systems and smart grids are two examples of collective adaptive systems. They consist of a large number of heterogeneous entities with decentralised control and varying degrees of complex autonomous behaviour. We develop an analysis tools to help to reason about such systems. Our work relies on tools from fluid and mean-field approximation to build decentralized algorithms that solve complex optimization problems. We focus on two problems: decentralized control of electric grids and capacity planning in vehicle-sharing systems to improve load balancing.

Social Computing Systems

Social computing systems are online digital systems that use personal data of their users at their core to deliver personalized services directly to the users. They are omnipresent and include for instance recommendation systems, social networks, online medias, daily apps, etc. Despite their interest and utility for users, these systems pose critical challenges of privacy, security, transparency, and respect of certain ethical constraints such as fairness. Solving these challenges involves a mix of measurement and/or audit to understand and assess issues, and modeling and optimization to propose and calibrate solutions.

Social and environmental responsibility Footprint of research activities

The carbon footprint of the team has been quite minimal in 2021 since there has been no travel allowed with most of us working from home. Our team does not train heavy ML models requiring important processing power although some of us perform computer science experiments, mostly using the Grid5000 platforms. We keep this usage very reasonable and rely on cheaper alternatives (e.g., simulations) as much as possible.

Along this line, an interesting initiative lies in the PhD work of Tom Cornebize 33 who evaluated the carbon footprint of this thesis. The total amount of greenhouse gas emissions of this thesis due to airplane transportation is 7.6 t of CO2eq and he estimates that the total amount of greenhouse gas emissions of this thesis due to computing is 10.6 t of CO2eq (a total of 2,112,014 core hours on Grid5000). The initial goal of the thesis was to evaluate (and possibly improve) the reliability of SimGrid to predict the performance of an MPI application like HPL. About 10% have been devoted to running SMPI simulations (to obtain predictions), 30% to run MPI applications (so as to compare the predictions with real executions), and 60% for a systematic performance measurement of a dozen of Grid5000 clusters over a long period of time (over two years). We are still unsure about whether this last part was worth the cost but this long and large-scale measurement was motivated by the need to characterize the variability of modern platforms, which is documented but largely underestimated by observational studies. Although we made our best to decrease the cost of this experiment from the start and proceeded gradually, the overall cost remains high. This study raises the question of the responsibility of the verification of the platform state (user vs. administrator) and we expect the lessons learned will be useful to the community.

Sens Workshop

On November 19th, a Sens Workshop was held, organized within the Inria DataMove and Polaris teams. We were ten participants. All participants were permanent members of one of the two teams. Participation in the workshop was on a voluntary basis. The day's proceedings followed four main axes: (1) Why do you do research? (2) Construction of a map of the expectations of everyone in the team. (3) Selection of two texts to be read and exchange around questions about the goal of research; (3) Prospective.

The first axe took place mainly in small groups. We were interested in why we work in the academic world and why this subject. It emerged that the motivations are very varied (ranging from intellectual curiosity to the desire to change the world), as well as the desire of several participants to change their object of study. The second session aimed to map the goals and constraints that bind us to our profession as researchers and to organize these different themes into a mural. Rigorous scientific production and education seem to be at the center of our priorities, while a question remains about the lack of group emulation and the too great part of individualism (and competition) in the current academic world. The last axes were the occasion to exchange around several texts on the impact of our profession of researcher in the current digital world, in particular linked to the fact that digital technology deeply modifies human activities and relationships, with a strong societal and environmental impact.

Without taking a concrete direction, this day was rich in learning. We think that it will be followed by other days of this type in the future.

Raising awareness on the climate crisis

Romain Couillet has organized several introductory seminars on the Anthropocene, which he has presented to students at the UGA and Grenoble-INP, as well as to associations in Grenoble (FNE, AgirAlternatif). He is also co-responsible of the Digital Transformation DU. He has published three articles on the issue of "usability" of artificial intelligence, and is the organizer of a special session on "Signal processing and resilience" for the GRETSI 2022 conference. He is also co-creator of the sustainable AI transversal axis of the MIAI project in Grenoble. Finally, he is a trainer for the "Fresque du Climat" and a member of Adrastia and FNE Isère.

Impact of research results

The efforts of B. Pradelski on COVID policy has received lots of media coverage in Le Monde and other major newspapers. See Section 11.3 for more details.

Highlights of the year Awards

R. Couillet has received the IEEE/SEE Prix Glavieux 2021 for his work on large dimension statistics for artificial intelligence.

Y.-P. Hsieh, P. Mertikopoulos, and V. Cevher have been accepted for a long talk to present their work on The limits of min-max optimization algorithms: Convergence to spurious non-critical sets at ICML 2021 25.

L. Nesi, A. Legrand, and L. Schnorr have received a best paper paward from the ICPP'21 conference for their work on Exploiting system level heterogeneity to improve the performance of a GeoStatistics multi-phase task-based application29.

New software and platforms New software SimGrid Keywords:

Large-scale Emulators, Grid Computing, Distributed Applications

Scientific Description:

SimGrid is a toolkit that provides core functionalities for the simulation of distributed applications in heterogeneous distributed environments. The simulation engine uses algorithmic and implementation techniques toward the fast simulation of large systems on a single machine. The models are theoretically grounded and experimentally validated. The results are reproducible, enabling better scientific practices.

Its models of networks, cpus and disks are adapted to (Data)Grids, P2P, Clouds, Clusters and HPC, allowing multi-domain studies. It can be used either to simulate algorithms and prototypes of applications, or to emulate real MPI applications through the virtualization of their communication, or to formally assess algorithms and applications that can run in the framework.

The formal verification module explores all possible message interleavings in the application, searching for states violating the provided properties. We recently added the ability to assess liveness properties over arbitrary and legacy codes, thanks to a system-level introspection tool that provides a finely detailed view of the running application to the model checker. This can for example be leveraged to verify both safety or liveness properties, on arbitrary MPI code written in C/C++/Fortran.

Functional Description:

News of the Year:

There were 3 major releases in 2021. A new API was introduced to create the platform descriptions directly from the source code instead of XML, providing much more expressiveness to the experimenters. SMPI now reports memory leaks and correctly diagnoses API misuses, which makes it even more adapted to teaching settings. The documentation was thoroughly overhauled to ease the use of the framework. We also pursued our efforts to improve the overall framework, through bug fixes, code refactoring and other software quality improvement.

URL:

https://simgrid.org/

Contact:

Martin Quinson

Participants:

Adrien Lebre, Anne-Cécile Orgerie, Arnaud Legrand, Augustin Degomme, Emmanuelle Saillard, Frédéric Suter, Jean-Marc Vincent, Jonathan Pastor, Luka Stanisic, Martin Quinson, Samuel Thibault

Partners:

CNRS, ENS Rennes

PSI Name:

Perfect Simulator

Keywords:

Markov model, Simulation

Functional Description:

Perfect simulator is a simulation software of markovian models. It is able to simulate discrete and continuous time models to provide a perfect sampling of the stationary distribution or directly a sampling of functional of this distribution by using coupling from the past. The simulation kernel is based on the CFTP algorithm, and the internal simulation of transitions on the Aliasing method.

News of the Year:

No active development. Maintenance is ensured by the POLARIS team. The next generation of PSI lies in the MARTO project.

URL:

https://gitlab.inria.fr/PSI/psi3/

Contact:

Jean-Marc Vincent

marmoteCore Name:

Markov Modeling Tools and Environments - the Core

Keywords:

Modeling, Stochastic models, Markov model

Functional Description:

marmoteCore is a C++ environment for modeling with Markov chains. It consists in a reduced set of high-level abstractions for constructing state spaces, transition structures and Markov chains (discrete-time and continuous-time). It provides the ability of constructing hierarchies of Markov models, from the most general to the particular, and equip each level with specifically optimized solution methods.

This software was started within the ANR MARMOTE project: ANR-12-MONU-00019.

News of the Year:

No active development. Current development lies now in the MARTO project (next generations of PSI and marmoteCore) and in the forthcoming Marmote project.

URL:

https://gitlab.inria.fr/PSI/marmotecore/

Publications:

hal-01651940, hal-01276456

Contact:

Alain Jean-Marie

Participants:

Alain Jean-Marie, Hlib Mykhailenko, Benjamin Briot, Franck Quessette, Issam Rabhi, Jean-Marc Vincent, Jean-Michel Fourneau

Partners:

Université de Versailles St-Quentin-en-Yvelines, Université Paris Nanterre

MarTO Name:

Markov Toolkit for Markov models simulation: perfect sampling and Monte Carlo simulation

Keywords:

Perfect sampling, Markov model

Functional Description:

MarTO is a simulation software of markovian models. It is able to simulate discrete and continuous time models to provide a perfect sampling of the stationary distribution or directly a sampling of functional of this distribution by using coupling from the past. The simulation kernel is based on the CFTP algorithm, and the internal simulation of transitions on the Aliasing method. This software is a rewrite, more efficient and flexible, of PSI

News of the Year:

No official release yet. The code development is in progress.

URL:

https://gitlab.inria.fr/MarTo/marto

Contact:

Vincent Danjean

GameSeer Keyword:

Game theory

Functional Description:

GameSeer is a tool for students and researchers in game theory that uses Mathematica to generate phase portraits for normal form games under a variety of (user-customizable) evolutionary dynamics. The whole point behind GameSeer is to provide a dynamic graphical interface that allows the user to employ Mathematica's vast numerical capabilities from a simple and intuitive front-end. So, even if you've never used Mathematica before, you should be able to generate fully editable and customizable portraits quickly and painlessly.

News of the Year:

No new release but the development is still active.

URL:

http://polaris.imag.fr/panayotis.mertikopoulos/publications/s01-gameseer/

Contact:

Panayotis Mertikopoulos

rmf_tool Name:

A library to Compute (Refined) Mean Field Approximations

Keyword:

Mean Field

Functional Description:

The tool accepts three model types:

- homogeneous population processes (HomPP) - density dependent population processes (DDPPs) - heterogeneous population models (HetPP)

In particular, it provides a numerical algorithm to compute the constant of the "refined mean field approximation" provided in the paper A Refined Mean Field Approximation by N. Gast and B. Van Houdt, accepted at SIGMETRICS 2018. And a framework to compute heterogeneous mean field approximations Mean Field and Refined Mean Field Approximations for Heterogeneous Systems: It Works! by N. Gast and S. Allmeier, accepted at SIGMETRICS 2022.

URL:

https://github.com/ngast/rmf_tool

Publications:

hal-01622054, tel-02509756, hal-03485044

Contact:

Nicolas Gast

New results

The new results produced by the team in 2020 can be grouped into the following categories.

HPC application analysis and optimization TomCornebizeVincentDanjeanArnaudLegrandLucasLeandro NesiJean-MarcVincent

Finely tuning applications and understanding the influence of key parameters (number of processes, granularity, collective operation algorithms, virtual topology, and process placement) is critical to obtain good performance on supercomputers. With the high consumption of running applications at scale, doing so solely to optimize their performance is particularly costly. We have shown in

that SimGrid and SMPI could be used to obtain inexpensive but faithful predictions of expected performance. The methodology we propose decouples the complexity of the platform, which is captured through statistical models of the performance of its main components (MPI communications, BLAS operations), from the complexity of adaptive applications by emulating the application and skipping regular non-MPI parts of the code. We demonstrate the capability of our method with High-Performance Linpack (HPL), the benchmark used to rank supercomputers in the TOP500, which requires careful tuning. This work presents an extensive (in)validation study that compares simulation with real experiments and demonstrates our ability to predict the performance of HPL within a few percent consistently. This study allows us to identify the main modeling pitfalls (e.g., spatial and temporal node variability or network heterogeneity and irregular behavior) that need to be considered. Our “surrogate” also allows studying several subtle HPL parameter optimization problems while accounting for uncertainty on the platform. This work is part of the PhD work of Tom Cornebize

and the spatial and temporal node variability has also been investigated and quantified through a systematically measurement of the performance of more than 450 nodes from a dozen of clusters of the Grid’5000 testbed for two years using a rigorous experimental discipline. Using a simple statistical test, we managed to detect many performance changes, from subtle ones of 1% to much more severe degradation of more than 10%, but which could significantly impact the outcome of experiments. The root cause behind detected performance changes ranges from BIOS and system upgrades to cooling issues, faulty memory, and power instability. These events went unnoticed by both Grid’5000 technical team and Grid’5000 users, yet they could greatly harm the reproducibility of experiments and lead to wrong scientific conclusions. All this work heavily builds on reproducible research methodology: the data and metadata collected for this work are permanently and publicly archived under an open license

and presented at through a collection of Jupyter notebooks at

cornebize.net/g5k_test/

Overall, over the last few years, the quality of SimGrid predictions for HPC applications has reach an unprecedented quality, which allows investigating and optimizing the performance of complex applications in a very controled and reproducible yet realistic way. In 29, we study ExaGeoStat, a task-based machine learning framework specifically designed for geostatistics data. Every iteration of this application comprises several phases that do not scale in the same way, which makes the load particularly challenging to balance. In this work, we show how such applications with multiple phases with distinct resource necessities can take advantage of inter-node heterogeneity to improve performance and reduce resource idleness. We first show how to improve application phase overlap by optimizing runtime and scheduling decisions and then how to compute the optimal distribution for all the phases using a linear program leveraging node heterogeneity while limiting communication overhead. The performance gains of our phase overlap improvements are between 36% and 50% compared to the original base synchronous and homogeneous execution. We show that by adding some slow nodes to a homogeneous set of fast nodes, we can improve the performance by another 25% compared to a standard block-cyclic distribution, thereby harnessing any machine. Most of these algorithmic and scheduling improvements have been investigated in simulation with StarPU-SimGrid as it allows for controled tracing and debugging on specific platform configurations before being confirmed through real experiments on real testbeds such as Grid 5000 and Santos Dumont.

Finally, we have shown in 43 how the structure of complex applications such as a multifrontal sparse linear solvers could be exploited to detect and correct non-trivial performance problems. Efficiently exploiting computational resources in heterogeneous platforms is a real challenge which has motivated the adoption of the task-based programming paradigm where resource usage is dynamic and adaptive. Unfortunately, classical performance visualization techniques used in routine performance analysis often fail to provide any insight in this new context, especially when the application structure is irregular. We propose and implement in StarVZ several performance visualization techniques tailored for the analysis of task-based multifrontal sparse linear solvers and show that by building on both a performance model of irregular tasks and on structure of the application (in particular the elimination tree), we can detect and highlight anomalies and understand resource utilization from the application point-of-view in a very insightful way. We validate these novel performance analysis techniques with the QR_mumps sparse parallel solver by describing a series of case studies where we identify and address non trivial performance issues thanks to our visualization methodology.

Large system analysis and optimization JonathaAnselmiNicolasGastOlivierBilenneSebastianAllmeierJean-MarcVincent

Large systems can be (1) particularly difficult to analyze because of inherent state-space explosion and (2) require robust and scalable scheduling techniques. In this series of work, we contribute to a better understanding of both aspects.

In 6, we study the impact of communication latency on the classical Work Stealing load balancing algorithm by extending the reference model. By using a theoretical analysis and simulation, we study the overall impact of latency on the Makespan (maximum completion time) and we derive a new expression of the expected running time of a bag of independent tasks scheduled by Work Stealing. This expression enables us to predict under which conditions a given run will yield acceptable performance. For instance, we can easily calibrate the maximal number of processors to use for a given work/platform combination. All our results are validated through simulation on a wide range of parameters.

A complementary approach to work stealing in such system is replication but it is a double-edged weapon that must be handled with caution as the resource overhead may be detrimental when used too aggressively. In 1, we provide a queueing-theoretic framework for job replication schemes based on the principle "replicate a job as soon as the system detects it as a straggler". This is called job speculation. Recent works have analyzed replication on arrival, which we refer to as replication. Replication is motivated by its implementation in Google's BigTable. However, systems such as Apache Spark and Hadoop MapReduce implement speculative job execution. The performance and optimization of speculative job execution is not well understood. To this end, we propose a queueing network model for load balancing where each server can speculate on the execution time of a job. Specifically, each job is initially assigned to a single server by a frontend dispatcher. Then, when its execution begins, the server sets a timeout. If the job completes before the timeout, it leaves the network, otherwise the job is terminated and relaunched or resumed at another server where it will complete. We provide a necessary and sufficient condition for the stability of speculative queueing networks with heterogeneous servers, general job sizes and scheduling disciplines. We find that speculation can increase the stability region of the network when compared with standard load balancing models and replication schemes. We provide general conditions under which timeouts increase the size of the stability region and derive a formula for the optimal speculation time, i.e., the timeout that minimizes the load induced through speculation. We compare speculation with redundant-d and redundant-to-idle-queued rules under an S&X model. For light loaded systems, redundancy schemes provide better response times. However, for moderate to heavy loadings, redundancy schemes can lose capacity and have markedly worse response times when compared with the proposed speculative scheme.

A key challenge in such systems comes from the structure and from the high variability of the execution time distribution. We also study the dispatching to parallel servers problem in 2 where we seek to minimize the average cost experienced by the system over an infinite time horizon. A standard approach for solving this problem is through policy iteration techniques, which relies on the computation of value functions. In this context, we consider the continuous-space $M / G / 1 - F C F S$ queue endowed with an arbitrarily-designed cost function for the waiting times of the incoming jobs. The associated relative value function is a solution of Poisson's equation for Markov chains, which in this work we solve in the Laplace transform domain by considering an ancillary, underlying stochastic process extended to (imaginary) negative backlog states. This construction enables us to issue closed-form relative value functions for polynomial and exponential cost functions and for piecewise compositions of the latter, in turn permitting the derivation of interval bounds for the relative value function in the form of power series or trigonometric sums. We review various cost approximation schemes and assess the convergence of the interval bounds these induce on the relative value function. Namely: Taylor expansions (divergent, except for a narrow class of entire functions with low orders of growth), and uniform approximation schemes (polynomials, trigonometric), which achieve optimal convergence rates over finite intervals. This study addresses all the steps to implementing dispatching policies for systems of parallel servers, from the specification of general cost functions towards the computation of interval bounds for the relative value functions and the exact implementation of the first-policy improvement step.

Finally, when the number of entities is large, most computations are untractable but mean field approximation is a powerful technique to study the performance of very large stochastic systems represented as systems of interacting objects. Applications include load balancing models, epidemic spreading, cache replacement policies, or large-scale data centers, for which mean field approximation gives very accurate estimates of the transient or steady-state behaviors. In a series of recent papers, a new and more accurate approximation, called the refined mean field approximation has been presented. A key strength of this technique lies in its applicability to not-so-large systems. Yet, computing this new approximation can be cumbersome. In 13, we present a tool, called rmf tool and available at github.com/ngast/rmf_tool, that takes the description of a mean field model, and can numerically compute its mean field approximations and refinement.

Energy optimization JonathaAnselmiBrunoGaujalPanayotisMertikopoulosStéphanPlassartLouis-SébastienRebuffi

Energy consumption is a major concern in modern architectures. In 7, we consider the classical problem of minimizing off-line the total energy consumption required to execute a set of n real-time jobs on a single processor with a finite number of available speeds. Each real-time job is defined by its release time, size, and deadline (all bounded integers). The goal is to find a processor speed schedule, such that no job misses its deadline and the energy consumption is minimal. We propose a pseudo-linear time algorithm that checks the schedulability of the given set of n jobs and computes an optimal speed schedule. The time complexity of our algorithm is in $O (n)$ , to be compared with $O (n log (n))$ for the best known solution. Besides the complexity gain, the main interest of our algorithm is that it is based on a completely different idea: instead of computing the critical intervals, it sweeps the set of jobs and uses a dynamic programming approach to compute an optimal speed schedule. Our linear time algorithm is still valid (with some changes) when arbitrary (non-convex) power functions and when switching costs are taken into account.

In 36, we consider this problem in a dynamic setting where a Dynamic Voltage and Frequency Scaling (DVFS) processor should execute jobs with obsolescence deadlines: A job becomes obsolete and is removed from the system if it is not completed before its deadline. The objective is to design a dynamic speed policy for the processor that minimizes its average energy consumption plus an obsolescence cost per deadline miss. Under Poisson arrivals and exponentially distributed deadlines and job sizes, we show that this problem can be modeled as a continuous time Markov decision process (MDP) with unbounded state space and unbounded rates. While this MDP admits a continuous time optimality equation for its average cost, the standard uniformization approach is not applicable. Inspired by the scaling method introduced by Blok and Spieksma, we first define a family of truncated MDPs and we then show that the optimal speed profiles are increasing in the number of jobs in the system and are uniformly bounded by a constant. Finally, we show that these properties are inherited from the original (infinite) system. The proposed upper bound on the optimal speed profile is tight and is used to develop an extremely simple policy that accurately approximates the optimal average cost in heavy traffic conditions.

Finally, we study power management in a distributed and online context in 12 through learning and game design. We consider the target-rate power management problem for wireless networks and we propose two simple, distributed power management schemes that regulate power in a provably robust manner by efficiently leveraging past information. Both schemes are obtained via a combined approach of learning and “game design” where we (1) design a game with suitable payoff functions such that the optimal joint power profile in the original power management problem is the unique Nash equilibrium of the designed game; and (2) derive distributed power managment algorithms by directing the networks' users to employ a no-regret learning algorithm to maximize their individual utility over time. To establish convergence, we focus on the well-known online eager gradient descent learning algorithm in the class of weighted strongly monotone games. In this class of games, we show that when players only have access to imperfect stochastic feedback, multi-agent online eager gradient descent converges to the unique Nash equilibrium in mean square at an $O (1 / T)$ rate. In the context of power management in static networks, we show that the designed games are weighted strongly monotone if the network is feasible (i.e. when all users can concurrently attain their target rates). This allows us to derive geometric convergence rate to the joint optimal transmission power. More importantly, in stochastic networks where channel quality fluctuates over time, the designed games are also weighted strongly monotone and the proposed algorithms converge in mean square to the joint optimal transmission power at a $O (1 / T)$ rate, even when the network is only feasible on average (i.e. users may be unable to meet their requirements with positive probability). This comes in stark contrast to existing algorithms (like the seminal Foschini–Miljanic algorithm and its variants) that may fail to converge altogether.

Learning in time varying systems Yu GuanHsiehPanayotisMertikopoulosBarryPradelskiPatrickLoiseau

Some form of stationarity and time synchronism is generally required to guarantee the efficiency of distributed algorithms in large systems. In this series of work, we work on releasing some of these properties.

One of the most widely used methods for solving large-scale stochastic optimization problems is distributed asynchronous stochastic gradient descent (DASGD), a family of algorithms that result from parallelizing stochastic gradient descent on distributed computing architectures (possibly) asychronously. However, a key obstacle in the efficient implementation of DASGD is the issue of delays: when a computing node contributes a gradient update, the global model parameter may have already been updated by other nodes several times over, thereby rendering this gradient information stale. These delays can quickly add up if the computational throughput of a node is saturated, so the convergence of DASGD may be compromised in the presence of large delays. In 11, we show that, by carefully tuning the algorithm's step-size, convergence to the critical set is still achieved in mean square, even if the delays grow unbounded at a polynomial rate. We also establish finer results in a broad class of structured optimization problems (called variationally coherent), where we show that DASGD converges to a global optimum with probability 1 under the same delay assumptions. Together, these results contribute to the broad landscape of large-scale non-convex stochastic optimization by offering state-of-the-art theoretical guarantees and providing insights for algorithm design.

In 42, we provide a general framework for studying multi-agent online learning problems in the presence of delays and asynchronicities. Specifically, we propose and analyze a class of adaptive dual averaging schemes in which agents only need to accumulate gradient feedback received from the whole system, without requiring any between-agent coordination. In the single-agent case, the adaptivity of the proposed method allows us to extend a range of existing results to problems with potentially unbounded delays between playing an action and receiving the corresponding feedback. In the multi-agent case, the situation is significantly more complicated because agents may not have access to a global clock to use as a reference point; to overcome this, we focus on the information that is available for producing each prediction rather than the actual delay associated with each feedback. This allows us to derive adaptive learning strategies with optimal regret bounds, even in a fully decentralized, asynchronous environment. Finally, we also analyze an "optimistic" variant of the proposed algorithm which is capable of exploiting the predictability of problems with a slower variation and leads to improved regret bounds.

We also use the dual averaging technique in 24, where we address an open network (agents can join and leave the network at any time) context. In networks of autonomous agents (e.g., fleets of vehicles, scattered sensors), the problem of minimizing the sum of the agents' local functions has received a lot of interest. Leveraging recent online optimization techniques, we propose and analyze the convergence of a decentralized asynchronous optimization method for open networks.

Finally, we examine in 38 and 27 the long-run behavior of multi-agent online learning in games that evolve over time. Specifically, we examine the equilibrium tracking and convergence properties of no-regret learning algorithms in continuous games that evolve over time. We focus on learning via "mirror descent", a widely used class of noregret learning schemes where players take small steps along their individual payoff gradients and then "mirror" the output back to their action sets, and we show that the induced sequence of play (a) converges to Nash equilibrium in time-varying games that stabilize in the long run to a strictly monotone limit; and (b) it stays asymptotically close to the evolving equilibrium of the sequence of stage games (assuming they are strongly monotone). Our results apply to both gradient-based and payoff-based feedback, i.e., the "bandit" case where players only observe the payoffs of their chosen actions.

Advanced learning methods KimonAntonakopoulosYu GuanHsiehPanayotisMertikopoulosBaryPradelski

Variational inequalities – and, in particular, stochastic variational inequalities – have recently attracted considerable attention in machine learning and learning theory as a flexible paradigm for "optimization beyond minimization", i.e., for problems where finding an optimal solution does not necessarily involve minimizing a loss function.

In 17, we analyze the convergence rate of optimistic mirror descent methods in stochastic variational inequalities. Our analysis reveals an intricate relation between the algorithm's rate of convergence and the local geometry induced by the method's underlying Bregman function. We quantify this relation by means of the Legendre exponent, a notion that we introduce to measure the growth rate of the Bregman divergence relative to the ambient norm near a solution. We show that this exponent determines both the optimal step-size policy of the algorithm and the optimal rates attained, explaining in this way the differences observed for some popular Bregman functions (Euclidean projection, negative entropy, fractional power, etc.).

3, we develop a new stochastic algorithm for solving pseudo-monotone stochastic variational inequalities. Our method builds on Tseng’s forward-backward- forward (FBF) algorithm, which is known in the deterministic literature to be a valuable alternative to Korpelevich’s extragradient method when solving variational inequalities over a convex and closed set governed by pseudo-monotone, Lipschitz continuous operators. The main computational advantage of Tseng’s algorithm is that it relies only on a single projection step and two independent queries of a stochastic oracle. Our algorithm incorporates a mini-batch sampling mechanism and leads to almost sure (a.s.) convergence to an optimal solution. To the best of our knowledge, this is the first stochastic look-ahead algorithm achieving this by using only a single projection at each iteration.

In 16, we examine a flexible algorithmic framework for solving monotone variational inequalities in the presence of randomness and uncertainty. The proposed template encompasses a wide range of popular first-order methods, including dual averaging, dual extrapolation and optimistic gradient algorithms – both adaptive and non-adaptive. Our first result is that the algorithm achieves the optimal rates of convergence for cocoercive problems when the profile of the randomness is known to the optimizer: $O (1 / \sqrt{T})$ for absolute noise profiles, and $O (1 / T)$ for relative ones. Subsequently, we drop all prior knowledge requirements (the absolute/relative variance of the randomness affecting the problem, the operator's cocoercivity constant, etc.), and we analyze an adaptive instance of the method that gracefully interpolates between the above rates, i.e., it achieves $O (1 / \sqrt{T})$ and $O (1 / T)$ in the absolute and relative cases, respectively. To our knowledge, this is the first universality result of its kind in the literature and, somewhat surprisingly, it shows that an extra-gradient proxy step is not required to achieve optimal rates.

An other challenging and promising problem motivated by applications in machine learning and operations research is stochastic regret minimization in non-convex problems:

In 21, we study regret regret minimization with stochastic first-order oracle feedback in online constrained, and possibly non-smooth, non-convex problems. In this setting, the minimization of external regret is beyond reach, so we focus on a local regret measure defined via a proximal-gradient mapping. To achieve no (local) regret in this setting, we develop a prox-grad method based on stochastic first-order feedback, and a simpler method for when access to a perfect first-order oracle is possible. Both methods are min-max order-optimal, and we also establish a bound on the number of prox-grad queries these methods require. As an important application of our results, we also obtain a link between online and offline non-convex stochastic optimization manifested as a new prox-grad scheme with complexity guarantees matching those obtained via variance reduction techniques.

In 22, we propose a hierarchical version of dual averaging for zeroth-order online non-convex optimization-i.e., learning processes where, at each stage, the optimizer is facing an unknown non-convex loss function and only receives the incurred loss as feedback. The proposed class of policies relies on the construction of an online model that aggregates loss information as it arrives, and it consists of two principal components: (a) a regularizer adapted to the Fisher information metric (as opposed to the metric norm of the ambient space); and (b) a principled exploration of the problem's state space based on an adapted hierarchical schedule. This construction enables sharper control of the model's bias and variance, and allows us to derive tight bounds for both the learner's static and dynamic regret-i.e., the regret incurred against the best dynamic policy in hindsight over the horizon of play.

Adaptive algorithms KimonAntonakopoulosYu GuanHsiehPanayotisMertikopoulosDong QuanVu

Designing algorithms that perform well in a variety of regime is particularly challenging. In a series of work, we study how to get the best of both worlds in a variety of contexts:

In 30, we examine an adaptive learning framework for nonatomic congestion games where the players' cost functions may be subject to exogenous fluctuations (e.g., due to disturbances in the network, variations in the traffic going through a link). In this setting, the popular multiplicative/ exponential weights algorithm enjoys an $O (1 / \sqrt{T})$ equilibrium convergence rate. However, this rate is suboptimal in static environments, i.e., when the network is not subject to randomness. In this static regime, accelerated algorithms achieve an $O (1 / T^{2})$ convergence speed, but they fail to converge altogether in stochastic problems. To fill this gap, we propose a novel, adaptive exponential weights method that seamlessly interpolates between the $O (1 / T^{2})$ and $O (1 / \sqrt{T})$ in the static and stochastic regimes respectively. Importantly, this "best-of-both-worlds" guarantee does not require any prior knowledge of the problem's parameters or tuning by the optimizer. In addition, the method's convergence speed depends subquadratically on the size of the network (number of vertices and edges), so it scales gracefully to large, real-life urban networks.

In 14, we present a new family of min-max optimization algorithms that automatically exploit the geometry of the gradient data observed at earlier iterations to perform more informative extra-gradient steps in later ones. Thanks to this adaptation mechanism, the proposed method automatically detects whether the problem is smooth or not, without requiring any prior tuning by the optimizer. As a result, the algorithm simultaneously achieves order-optimal convergence rates, i.e., it converges to an ε-optimal solution within O(1/ε) iterations in smooth problems, and within O(1/ε 2) iterations in non-smooth ones. Importantly, these guarantees do not require any of the standard boundedness or Lipschitz continuity conditions that are typically assumed in the literature; in particular, they apply even to problems with singularities (such as resource allocation problems and the like). This adaptation is achieved through the use of a geometric apparatus based on Finsler metrics and a suitably chosen mirror-prox template that allows us to derive sharp convergence rates for the methods at hand.

In 15, we propose a new family of adaptive first-order methods for a class of convex minimization problems that may fail to be Lipschitz continuous or smooth in the standard sense. Specifically, motivated by a recent flurry of activity on non-Lipschitz (NoLips) optimization, we consider problems that are continuous or smooth relative to a reference Bregman function-as opposed to a global, ambient norm (Euclidean or otherwise). These conditions encompass a wide range of problems with singular objective, such as Fisher markets, Poisson tomography, D-design, and the like. In this setting, the application of existing order-optimal adaptive methods-like UnixGrad or AcceleGrad-is not possible, especially in the presence of randomness and uncertainty. The proposed method, adaptive mirror descent (AdaMir), aims to close this gap by concurrently achieving min-max optimal rates in problems that are relatively continuous or smooth, including stochastic ones.

Finally, we study how such no-regret strategies fare in a multi-agent context. In game-theoretic learning, several agents are simultaneously following their individual interests, so the environment is non-stationary from each player's perspective. In this context, the performance of a learning algorithm is often measured by its regret. However, no-regret algorithms are not created equal in terms of game-theoretic guarantees: depending on how they are tuned, some of them may drive the system to an equilibrium, while others could produce cyclic, chaotic, or otherwise divergent trajectories. To account for this, we propose in 23 a range of no-regret policies based on optimistic mirror descent, with the following desirable properties: i) they do not require any prior tuning or knowledge of the game; ii) they all achieve $O (\sqrt{T})$ regret against arbitrary, adversarial opponents; and iii) they converge to the best response against convergent opponents. Also, if employed by all players, then iv) they guarantee O(1) social regret; while v) the induced sequence of play converges to Nash equilibrium with O(1) individual regret in all variationally stable games (a class of games that includes all monotone and convex-concave zero-sum games).

Exploiting past information is thus essential. In 8, we study the dynamics of price discovery in decentralized two-sided markets. We show that there exist memoryless dynamics that converge to the core of the underlying assignment game in which agents' actions depend only on their current payoff. However, we show that for any such dynamic the convergence time can grow exponentially in relation to the population size. We present a natural dynamic in which a player's reservation value provides a summary of his past information and show that this dynamic converges to the core in polynomial time in homogeneous markets.

Learning in Games KimonAntonakopoulosYu GuanHsiehPanayotisMertikopoulosBaryPradelskiPatrickLoiseau

Learning in Games is considerably more difficult than in classical minimization games as the resulting equilibria may be attractive or not and the dynamic often exhibit cyclic behaviors.

In this context, we examine in 19 the Nash equilibrium convergence properties of no-regret learning in general N-player games. For concreteness, we focus on the archetypal "follow the regularized leader" (FTRL) family of algorithms, and we consider the full spectrum of uncertainty that the players may encounter-from noisy, oracle-based feedback, to bandit, payoff-based information. In this general context, we establish a comprehensive equivalence between the stability of a Nash equilibrium and its support: a Nash equilibrium is stable and attracting with arbitrarily high probability if and only if it is strict (i.e., each equilibrium strategy has a unique best response). This equivalence extends existing continuous-time versions of the "folk theorem" of evolutionary game theory to a bona fide algorithmic learning setting, and it provides a clear refinement criterion for the prediction of the day-today behavior of no-regret learning in games.

We study convergence rate of a wide range of regularized methods for learning in games in 20. To that end, we propose a unified algorithmic template that we call "follow the generalized leader" (FTGL), and which includes as special cases the canonical "follow the regularized leader" algorithm, its optimistic variants, extra-gradient schemes, and many others. The proposed framework is also sufficiently flexible to account for several different feedback models – from full information to bandit feedback. In this general setting, we show that FTGL algorithms converge locally to strict Nash equilibria at a rate which does not depend on the level of uncertainty faced by the players, but only on the geometry of the regularizer near the equilibrium. In particular, we show that algorithms based on entropic regularization-like the exponential weights algorithm-enjoy a linear convergence rate, while Euclidean projection methods converge to equilibrium in a finite number of iterations, even with bandit feedback.

In 41, we examine the long-run behavior of a wide range of dynamics for learning in nonatomic games with finite action spaces and population games, in both discrete and continuous time. The class of dynamics under consideration includes fictitious play and its regularized variants, the best reply dynamics (again, possibly regularized), as well as the dynamics of dual averaging / "follow the regularized leader" (which themselves include as special cases the replicator dynamics and Friedman's projection dynamics). Our analysis concerns both the actual trajectory of play and its time-average, and we cover potential and monotone games, as well as games with an evolutionarily stable state (global or otherwise). Nonatomic games with continuous action spaces will be treated in detail in a second part.

Finally, we study the limits of min-max optimization algorithms in 25. Compared to minimization problems, the min-max landscape in machine learning applications is considerably more convoluted because of the existence of cycles and similar phenomena. Such oscillatory behaviors are well-understood in the convexconcave regime, and many algorithms are known to overcome them. In this paper, we go beyond the convex-concave setting and we characterize the convergence properties of a wide class of zeroth-, first-, and (scalable) second-order methods in non-convex/nonconcave problems. In particular, we show that these state-of-the-art min-max optimization algorithms may converge with arbitrarily high probability to attractors that are in no way min-max optimal or even stationary. Spurious convergence phenomena of this type can arise even in two-dimensional problems, a fact which corroborates the empirical evidence surrounding the formidable difficulty of training GANs.

Bandits YanChenBrunoDonassoloKimangKhunNicolasGastBrunoGaujalArnaudLegrandPanayotisMertikopoulos

The Multi-armed Stochastic Bandit framework is a classic reinforcement learning problem to study the exploration exploitation trade-off dilemma and for which several optimal algorithms like UCB 3 and Thompson sampling4, whose optimality has only recently been proved by Kaufmann et al.5, have been proposed. Although the first strategy is an optimistic strategy which systematically chooses the "most promising" arm, the second ones build on a Bayesian perspective and samples the posterior to decide which arm to select. The Markovian Bandit allows to model situations where the reward distribution is modeled as a Markov chain and may thus exhibit temporal changes. A key challenge in this context is the curse of dimensionality, which basically says that the state size of the Markov process is exponential in the number of the system components so that the complexity of computing an optimal policy and its value are exponential.

In 39, we study learning algorithms for the classical restful Markovian bandit (in which the state of each arm evolves only when it is chosen) problem with discount and compare posterior sampling strategies with optimistic strategies in terms of scalability. We explain how to adapt PSRL and UCRL2 to exploit the problem structure (MB-PSRL and MB-UCRL2). While the regret bound and runtime of vanilla implementations of PSRL and UCRL2 are exponential in the number of bandits, we show that the episodic regret of MB-PSRL and MB-UCRL2 is $O (S \sqrt{n K})$ , where $K$ is the number of episodes, $n$ is the number of bandits, and $S$ is the number of states of each bandit. Up to a factor $\sqrt{S}$ , this matches the lower bound of $Ω (\sqrt{S n K})$ that we also derive in the paper. MB-PSRL is also computationally efficient: its runtime is linear in the number of bandits. We further show that this linear runtime cannot be achieved by adapting classical non-Bayesian algorithms such as UCRL2 or UCBVI to Markovian bandit problems. Finally, we perform numerical experiments that confirm that MB-PSRL outperforms other existing algorithms in practice, both in terms of regret and of computation time.

In 40, we further develop complexity results for finite horizon restless bandits (in which the state of each arm evolves according to a Markov process independently of the learner's actions). Most restless Markovian bandits problems in infinite horizon can be solved quasioptimally: the famous Whittle index policy is known to become asymptotically optimal exponentially fast as the number of arms grows, at least under certain conditions (including having a so-called indexable problem). For restless Markovian bandits problems in finite horizons no such optimal policy is known. In this paper, we define a new policy, based on a linear program relaxation of the finite horizon problem (called the LP-filling policy), that is asymptotically optimal under no condition. Furthermore we show that for regular problems (defined in the paper) the LP-filling policy becomes an index policy (called the LP-regular policy) and becomes optimal exponentially fast in the number of arms. We also introduce the LP-update policy that significantly improves the performance compared to the LP-filling policy for large time horizons. We provide numerical studies that show the prevalence of our LP-policies over previous solutions.

Finally, in 4, we evaluate the relevance of bandit-like strategies for the Fog computing context and explore the information-coordination trade-off. Fog computing emerges as a potential solution to handle the growth of traffic and processing demands, providing nearby resources to run IoT applications. In this paper, we consider the reconfiguration problem, i.e., how to dynamically adapt the placement of IoT applications running in the Fog, depending on application needs and evolution of resource usage. We propose and evaluate a series of reconfiguration algorithms, based on both online scheduling (dynamic packing) and online learning (bandit) approaches. Through an extensive set of experiments in a realistic testbed built on Grid5000 and FIT-IoT lab, we demonstrate that the performance strongly and mainly depends on the quality and availability of information from both Fog infrastructure and IoT applications. We show that a reactive and greedy strategy can overcome the performance of state-of-the-art online learning algorithms, as long as the strategy has access to a little extra information.

Fairness and equity in digital (recommendation, advertising, persistant storage) systems Dong QuanVuVitaliiEmelianovNicolasGastPatrickLoiseauBenjaminRoussillon

The general deployment of machine-learning systems in many domains ranging from security to recommendation and advertising to guide strategic decisions leads to an interesting line of research from a game theory perspective.

A first line of work in this context is related to fairness and adversarial classification. Discrimination in selection problems such as hiring or college admission is often explained by implicit bias from the decision maker against disadvantaged demographic groups. In 5, we consider a model where the decision maker receives a noisy estimate of each candidate's quality, whose variance depends on the candidate's group-we argue that such differential variance is a key feature of many selection problems. We analyze two notable settings: in the first, the noise variances are unknown to the decision maker who simply picks the candidates with the highest estimated quality independently of their group; in the second, the variances are known and the decision maker picks candidates having the highest expected quality given the noisy estimate. We show that both baseline decision makers yield discrimination, although in opposite directions: the first leads to under-representation of the low-variance group while the second leads to under-representation of the high-variance group. We study the effect on the selection utility of imposing a fairness mechanism that we term the $ $γ$ $-rule (it is an extension of the classical four-fifths rule and it also includes demographic parity). In the first setting (with unknown variances), we prove that under mild conditions, imposing the $ $γ$ $-rule increases the selection utility-here there is no trade-off between fairness and utility. In the second setting (with known variances), imposing the $ $γ$ $-rule decreases the utility but we prove a bound on the utility loss due to the fairness mechanism.

We also consider classification in adversarial context. In 31, we consider the problem of finding optimal classifiers in an adversarial setting where the class-1 data is generated by an attacker whose objective is not known to the defender – an aspect that is key to realistic applications but has so far been overlooked in the literature. To model this situation, we propose a Bayesian game framework where the defender chooses a classifier with no a priori restriction on the set of possible classifiers. The key difficulty in the proposed framework is that the set of possible classifiers is exponential in the set of possible data, which is itself exponential in the number of features used for classification. To counter this, we first show that Bayesian Nash equilibria can be characterized completely via functional threshold classifiers with a small number of parameters. We then show that this low-dimensional characterization enables us to develop a training method to compute provably approximately optimal classifiers in a scalable manner; and to develop a learning algorithm for the online setting with low regret (both independent of the dimension of the set of possible data). We illustrate our results through simulations.

The Colonel Blotto game is a well-known resource allocation games introduced by Borel (1921) that finds application in many domains like politics (where political parties distribute their budgets to compete over voters), cybersecurity (where effort is distributed to attack/defend targets), online advertising (where marketing campaigns allocate the time to broadcast ads to attract web users), or telecommunication (where network service providers distribute and lease their spectrum to the users). In 32, we introduce the Colonel Blotto game with favoritism, an extension where the winner-determination rule is generalized to include pre-allocations and asymmetry of the players' resources effectiveness on each battlefield. Such favoritism is found in many classical applications of the Colonel Blotto game. We focus on the Nash equilibrium. First, we consider the closely related model of all-pay auctions with favoritism and completely characterize its equilibrium. Based on this result, we prove the existence of a set of optimal univariate distributions-which serve as candidate marginals for an equilibrium-of the Colonel Blotto game with favoritism and show an explicit construction thereof. In several particular cases, this directly leads to an equilibrium of the Colonel Blotto game with favoritism. In other cases, we use these optimal univariate distributions to derive an approximate equilibrium with well-controlled approximation error. Finally, we propose an algorithm-based on the notion of winding number in parametric curves-to efficiently compute an approximation of the proposed optimal univariate distributions with arbitrarily small error.

Finally, we propose in 28, a game-theoretic analysis of the transaction ordering protocol in the Bitcoin blockchain. Most public blockchain protocols, including the popular Bitcoin and Ethereum blockchains, do not formally specify the order in which miners should select transactions from the pool of pending (or uncommitted) transactions for inclusion in the blockchain. Over the years, informal conventions or "norms" for transaction ordering have, however, emerged via the use of shared software by miners, e.g., the GetBlockTemplate (GBT) mining protocol in Bitcoin Core. Today, a widely held view is that Bitcoin miners prioritize transactions based on their offered "transaction fee-per-byte. " Bitcoin users are, consequently, encouraged to increase the fees to accelerate the commitment of their transactions, particularly during periods of congestion. In this paper, we audit the Bitcoin blockchain and present statistically significant evidence of mining pools deviating from the norms to accelerate the commitment of transactions for which they have (i) a selfish or vested interest, or (ii) received dark-fee payments via opaque (non-public) side-channels. As blockchains are increasingly being used as a record-keeping substrate for a variety of decentralized (financial technology) systems, our findings call for an urgent discussion on defining neutrality norms that miners must adhere to when ordering transactions in the chains.

Policy responses to Covid-19 NicolasGastBrunoGaujalBaryPradelski

The Covid-19 pandemic has deeply impacted our lives and caused more than 5.49 million deaths worldwide, making it one of the deadliest pandemic in history. Several policies have been proposed to respond this illness and mitigate both its spread and its impact on both populations' health and economy. We have been studying and supporting some of policies through our expertise in large system analysis (game theory, mean field, etc.)

Bary Pradelski has been among the first researchers to actively promote Green zoning which has emerged as a widely used policy response to tackle the Covid-19 pandemic 10. ‘Green zones’ – areas where the virus is under control based on a uniform set of conditions – can progressively return to normal economic and social activity levels, and mobility between them is permitted. By contrast, stricter public health measures are in place in ‘red zones’, and mobility between red and green zones is restricted. France and Spain were among the first countries to introduce green zoning in April 2020. Subsequently, more and more countries followed suit and the European Commission advocated for the implementation of a European green zoning strategy, which has been supported by the EU member states. While there remain coordination problems, green zoning has proven to be an effective strategy for containing the spread of the virus and limiting its negative economic and social impact. This strategy should provide important lessons and prove useful in future outbreaks. Research in epidemiology indicates that thoroughly implemented and operationalised green zoning can prevent the spread of a transmittable disease that is poorly understood, highly virulent, and potentially highly lethal. Finally, there is strong evidence that green zoning can reduce economic and societal damage as it avoids worst-in-class measures.

Unfortunately, locking down entire regions is not satisfactory and has dramatic consequences on health, the economy, and civil liberties, and several countries have responded in very different ways. Some countries have consistently aimed for elimination – i.e., maximum action to control SARS-CoV-2 and stop community transmission as quickly as possible – while others have opted for mitigation – i.e., action increased in a step-wise, targeted way to reduce cases so as not to overwhelm health-care systems. In 9, we show that the former ones have generally fared much better than the latter ones by comparing deaths, gross domestic product (GDP) growth, and strictness of lock-down measures during the first 12 months of the pandemic. Furthermore, the mitigation has favored the proliferation of new SARS-CoV-2 variants and countries that opt to live with the virus will likely pose a threat to other countries, notably those that have less access to COVID-19 vaccines. Although many scientists have called for a coordinated international strategy to eliminate SARS-CoV-2, it has unfortunately not been heard yet.

Finally, we analyze in 18 a virus propagation dynamics in a large population of agents (or nodes) with three possible states (Susceptible, Infected, Recovered) where agents may choose to vaccinate. We show that this system admits a unique symmetric equilibrium when the number of agents goes to infinity. We also show that the vaccination strategy that minimizes the social cost has the same threshold structure as the mean field equilibrium, but with a shorter threshold. This implies that, to encourage optimal vaccination behavior, vaccination should always be subsidized.

Bilateral contracts and grants with industry TillKlettiPatrickLoiseau

Patrick Loiseau has a Cifre contract with Naver labs (2020-2023) on "Fairness in multi-stakeholder recommendation platforms”, which supports the PhD student Till Kletti.

Partnerships and cooperations NicolasGastGuillaumeHuardPatrickLoiseauPanayotisMertikopoulosBaryPradelskiBenjaminRoussillon International initiatives Associate Teams in the framework of an Inria International Lab or in the framework of an Inria International Program ReDaS Title:

Reproducible Data Science

Duration:

2019 – 2021

Coordinator:

Guillaume Huard and Lucas Mello Schnorr

Partners:

Universidade Federal do Rio Grande do Sul

Inria contact:

Guillaume Huard

Summary:

The main scientific context of this project is to develop novel analysis techniques and workflow methodologies to support reproducible data science. We focus our efforts along three axes:

Analysis Techniques: large volumes of data are hard to summarize using simple statistics that hides important behavior in the data. Therefore, raw information visualization plays a key role to explore such data, in particular when curating data and trying to develop intuition about the mathematical models underlying data. Yet, such visualizations require data aggregation, which may lead to significant information loss. It is thus essential to investigate adaptive data aggregation schemes that enable the reduction of the data while controlling the information loss.

Workflow Methodologies: the analysis process often involves a mix of tools to produce the end result. The data has to be filtered before it can be passed to some standard statistical tool to, eventually, produce some projection of the transformed data that can be visualized and studied by the analyst. Furthermore, the process is interactive: when the analyst is unsatisfied with the end result, a part of the analysis has to be changed to produce a new visualization. These adaptations of the whole analysis typically start from intermediate data and only a part of the analysis has to be rerun. The issue comes with the increasing size of these analysis, the disparity of the analysis tools and the large space of analysis parameters.

Evaluation: In the previous work packages we will propose both a theoretical and practical methodology whose relevance should be evaluated with real case studies. We will build our evaluation on well identified and quite different datasets originating from the following three areas, on which we already have some past experience: Performance analysis of HPC applications

These applications and their underlying runtimes tend to be increasingly complex and dynamic. As a consequence, their execution traces become too large and impossible to analyze with classical tools.

Long-term phenology behavior analysis and correlation with climate change

The phenology is the study of plant grow through the use of digital cameras attached to towers installed in the middle of the natural environments. These cameras take photos every a certain number of minutes and enable the researcher to verify how certain species grow, including their relation with the climate.

General public datasets from governement transparency reports

All public Brazilian institutions are obliged by law to provide datasets about any publicly-financed data measurements. The city of Porto Alegre has long-term weather datasets that contain temperature, pressure and other indicators from different parts of the city. The goal in this case study is very exploratory, for example to envision a way to represent such data in a geographical manner to verify if certain parts of the city may suffer from flash flood more than others.

National initiatives ANR ANR ORACLESS (JCJC 2016-2021)

☆

Online Resource Allocation, unprediCtable wireLESs Systems[207K€]

ORACLESS is an ANR starting grant (JCJC) coordinated by Panayotis Mertikopoulos. The goal of the project is to develop highly adaptive resource allocation methods for wireless communication networks that are provably capable of adapting to unpredictable changes in the network. In particular, the project will focus on the application of online optimization and online learning methodologies to multi-antenna systems and cognitive radio networks.

ANR ALIAS (PRCI 2020-2021)

☆

Adaptive Learning for Interactive Agents and Systems[284K€]

Partners: Singapore University of Technology and Design (SUTD).

ALIAS is a bilateral PRCI (collaboration internationale) project joint with Singapore University of Technology and Design (SUTD), coordinated by Bary Pradelski (PI) and involving P. Mertikopoulos and P. Loiseau. The Singapore team consists of G. Piliouras and G. Panageas. The goal of the project is to provide a unified answer to the question of stability in multi-agent systems: for systems that can be controlled (such as programmable machine learning models), prescriptive learning algorithms can steer the system towards an optimum configuration; for systems that cannot (e.g., online assignment markets), a predictive learning analysis can determine whether stability can arise in the long run. We aim at identifying the fundamental limits of learning in multi-agent systems and design novel, robust algorithms that achieve convergence in cases where conventional online learning methods fail.

ANR REFINO (JCJC 2020-2024)

☆

Refined Mean Field Optimization[250K€]

REFINO is an ANR starting grant (JCJC) coordinated by Nicolas Gast. The main objective on this project is to provide an innovative framework for optimal control of stochastic distributed agents. Restless bandit allocation is one particular example where the control that can be sent to each arm is restricted to an on/off signal. The originality of this framework is the use refined mean field approximation to develop control heuristics that are asymptotically optimal as the number of arms goes to infinity and that also have a better performance than existing heuristics for a moderate number of arms. As an example, we will use this framework in the context of smart grids, to develop control policies for distributed electric appliances.

ANR FAIRPLAY (JCJC 2021-2025)

☆

Fair algorithms via game theory and sequential learning[245K€]

FAIRPLAY is an ANR starting grant (JCJC) coordinated by Patrick Loiseau. Machine learning algorithms are increasingly used to optimize decision making in various areas, but this can result in unacceptable discrimination. The main objective of this project is to propose an innovative framework for the development of learning algorithms that respect fairness constraints. While the literature mostly focuses on idealized settings, the originality of this framework and central focus of this project is the use of game theory and sequential learning methods to account for constraints that appear in practical applications: strategic and decentralized aspects of the decisions and the data provided and absence of knowledge of certain parameters key to the fairness definition.

DGA Grants

Patrick Loiseau and Panayotis Mertikopoulos have a grant from DGA (2018-2021) that complements the funding of PhD student (Benjamin Roussillon) to work on game theoretic models for adversarial classification.

IRS/UGA

Projet DISCMAN (projet IRS de l'UGA). DISCMAN (Distributed Control for Multi-Agent systems and Networks) is a joint IRS project funded by IDEX Université Grenoble-Alpes. Its main objectives is to develop distributed equilibrium convergence algorithms for large-scale control and optimization problems, both offline and online. It is being coordinated by P. Mertikopoulos (POLARIS), and it involves a joint team of researchers from the LIG and LJK laboratories in Grenoble.

Dissemination JonathaAnselmiRomainCouilletVincentDanjeanNicolasGastBrunoGaujalGuillaumeHuardArnaudLegrandPatrickLoiseauPanayotisMertikopoulosBaryPradelski Promoting scientific activities Scientific events: organisation General chair, scientific chair

P. Loiseau has been chair of the steering committee of NetEcon;

Scientific events: selection Chair of conference program committees

P. Mertikopoulos has been a TPC co-chair for NetGCoOp 2020 (online, postponed from 2020), area chair for NeurIPS 2021, and area chair for ICLR 2021

Member of the conference program committees

P. Loiseau has been a PC member of AAAI 2021, ECML-PKDD 2021 (Area Chair), FAccT 2021 (Area Chair), and NetEcon;

J. Anselmi has been a PC member of IFIP Performance 2021 and IEEE MASCOTS 2021

N. Gast has been a PC member of Sigmetrics 2021 and NeurIPS 2021.

B. Pradelski has been a PC member of FAT* 2021.

Reviewer

B. Gaujal has been a reviewer for ACM Sigmetrics

Journal Member of the editorial boards

N. Gast is associate editor for Performance Evaluation and Stochastic Models.

R. Couillet is associate editor for Random Matrix Theory and Applications.

P. Mertikopoulos is an associate editor for Operations Research Letters, RAIRO Operations Research, EURO Journal on Computational Optimization, Journal on Dynamics and Games, Methodology and Computing in Applied Probability, Reviewer for:, Games and Economic Behavior, Journal of Economic Theory, Journal of Optimization Theory and Applications, Mathematics of Operations Research, Mathematical Programming, Operations Research, SIAM Journal on Control and Optimization, and SIAM Journal on Optimization

Reviewer - reviewing activities

P. Loiseau has been a reviewer for Games and Economics Behavior

B. Gaujal has been a reviewer for CHAOS and Games and Economic Behavior.

B. Pradelski has been a reviewer for Games and Economic Behavior, Operations Research, and International Journal of Game Theory

Invited talks

B. Gaujal has been invited to give a talk on Discrete Mean Field Games to the monthly seminar of the Gipsa Lab, Grenoble, in October 2021;

J. Anselmi has been invited to give a talk on Stability and Optimization of Speculative Queueing Networks at the 2021 INFORMS Annual Meeting;

R. Couillet has been invited several time to expose his work on sustainable AI and the anthropocene (ENS Paris, Ecole Polytechnique, TU Berlin, etc.).

B. Pradelsky has been invited to present his work on The effect of COVID certificates on vaccine uptake, health outcomes, and the economy to the Brussels-based economic think tank Bruegel.

P. Mertikopoulos has been invited to present his work on:

Adaptive Routing in Large-scale Networks: Optimality Under Uncertainty at NYU Operations Management Series

Online learning in games at NTUA (Computation and Reasoning Laboratory)

Generalized Robbins-Monro algorithms for min-min and min-max optimization at RWTH Aachen (Mathematics and Information Processing Seminar)

Online optimization: A unified view through the lens of stochastic approximation at Télécom ParisTech (Signal, Statistics & Learning Seminar)

Dynamics, (min-max) optimization, and games at TSE (MAD-Stat Seminar)

Spurious attractors in min-max optimization at Montréal Machine Learning and Optimization Seminar

Optimization, games, and dynamical systems at TOUTELIA 2021, Toulouse

A. Legrand has been invited to give lectures on Open Science and Reproducible Research at:

Laboratory Notebooks and Reproducible Research at the DKM department of the IRISA laboratory, Feb 2021.

Reproducibility Crisis and Open Science at the UGA Reproducible Research seminar, Apr 2021.

Obtaining Faithful/Reproducible Measurements on Modern CPUs at the Journée SIF on research reproducibility, May 2021.

Generating a Controled Software Environment with Debian Snapshot Archive at the GUIX workhop on software environment reproducibility, May 2021.

Reproducibility Crisis and Open Science at the reproducible research workshop of the Montpellier Bio-Stats network

Reproducibility Crisis and Open Science at the Sciences de l'information géographique reproductibles thematic school of the CIST-CNRS.

Research Administration

P. Loiseau is an expert for the FRS-FNRS

A. Legrand has participated to the Electronic Laboratory Notebook working group of the French CoSO (Comité pour la Science Ouverte), whose report has recently been produced.

A. Legrand is a member of the Inria Grenoble COS (conseil scientifique);

A. Legrand is a member of the CoSO UGA board (commission consultative science ouverte);

A. Legrand is in charge of the Distributed Systems, Parallel Computing, and Networks research area theme of the LIG;

N. Gast is co-responsible of the PhD program in Computer Science at the University Grenoble-Alpes (vice director of ED MSTII).

Teaching - Supervision - Juries

B. Gaujal has been a member of the jury for CR2 selection in Inria Grenoble

A. Legrand is a member of the section 6 of the CoNRS (Comité National de la Recherche Scientifique)

N. Gast served in the « Comité de spécialistes » (COS) for a MCF position in the G-SCOP laboratory (Grenoble)

P. Mertikopoulos served in the « Comité de spécialistes » (COS) for a MCF position in the Université Gustave Eiffel.

A. Legrand served in the « Comité de spécialistes » (COS) for a Professor position in the ENS Rennes.

A. Legrand was a reviewer for the PhD thesis of Mrs Yiqin Gao, from ENS Lyon: Scheduling independent tasks under budget and time constraints

P. Mertikopoulos was a member of the PhD thesis committee of Mr. Yassine Laguel from UGA: Optimisation convexe pour l’apprentissage robuste au risque

A. Legrand is a member of the mid-term PhD committee of Adeyemi Adetula (UGA)

P. Loiseau is a member of the mid-term PhD committee of O. Boufous (U. Avignon) and C. Pinzon (Ecole Polytechnique)

B. Gaujal is a member of the mid-term PhD committee of Michel Davydov (ENS Paris)

Teaching

J. Anselmi teaches the Probability and Simulation and the Performance Evaluation lectures in M1, Polytech Grenoble.

P. Loiseau teaches the M1 course at Ecole Polytechnique on Advanced Machine Learning and Autonomous Agents

A. Legrand and J.-M. Vincent teach the transversal Scientific Methodology and Empirical Evaluation lecture (36h) at the M2 MOSIG.

B. Gaujal teaches the M2 course on Optimization under uncertainties in M2 ORCO Grenoble

N. Gast is responsible of the course Reinforcement Learning in Master MOSIG/MSIAM (Grenoble) and of the course « Introduction to machine learning » (License 3, Grenoble).

G. Huard is responsible of the courses UNIX & C programming in the L1 and L3 INFO, of Object oriented and event-driven programming in the L3 INFO, and of the Objet oriented design in M1 INFO.

During the COVID crisis, teaching as been particularly difficult and has required major involvements and adaptations of teachers. G. Huard has implemented the following developments related to teaching activities:

Moodle VPL backend & engine for local execution

VSCode extensions to use VPLs and help with C programming (here and here)

Several VPL activities for the course in L1 INFO on Moodle

V. Danjean teaches the Operating Systems, Programming Languages, Algorithms, Computer Science and Mediation lectures in L3, M1 and Polytech Grenoble.

V. Danjean organized with J.M. Vincent the end of the DIU-EIL that have been setup for training high school professors to teach computer science.

R. Couillet is the initiator of a new lecture on the introduction to artificial intelligence in the L1 INFO.

The 3rd edition of the MOOC of A. Legrand, K. Hinsen and C. Pouzat on Reproducible research: Methodological principles for a transparent science is still running. Over the 3 editions (Oct.-Dec. 2018, Apr.-June 2019, March 2020 - end of 2022), about 17,800 persons have followed this MOOC and 1620 certificates of achievement have been delivered. 54% of participants are PhD students and 12% are undergraduates.

A. Legrand has organized with F. Theoleyre, B. Dnnassolo, M. Simonin, and G. Schreiner the 16th edition of the RSD GDR research school on Reproducibility and experimental research in networks and systems. This school has attrated 40 PhD students and has offered them to experiment with both the Grid5000 and FIT-IoTLab platforms throughout the whole week using reproducible research technologies (Jupyter notebooks, docker containers, git, the EnOSLib experiment engine, R for data analysis and experiment design, etc.). See here and here for more details.

Popularization

V. Danjean organized a Linux Install Party for all the students of INFO and IESE3 department of Polytech'Grenoble

R. Couillet has been interviewed by France Bleu Isère in the Retour sur Terre show (Dec. 9th 2021)

The efforts of B. Pradelski on COVID policy has received lots of media coverage in Le Monde and other major newspapers. See for example:

Le Monde: Couverture vaccinale: nombre de décès et PIB, le très fort impact du passe sanitaire

Les Echos: CoViD: le passe sanitaire a sauvé 4000 vies en France entre mi-juillet et fin 2021

Financial Times: Covid passes boosted economies and vaccine uptake, study shows

The Lancet article SARS-CoV-2 elimination, not mitigation, creates best outcomes for health, the economy, and civil liberties9 has been covered in the New York Times, Financial Times, Daily Mail, Süddeutsche Zeitung, Le Monde, Libération, Le Temps, Euronews, Le Devoir, The Hill, 20 minutos, Die Welt, The Guardian (five times), New Scientist, The Atlantic, Exame, L’Express, Die Zeit, Science, Politico

Several opinion articles have also been published in media:

‘Covid-19 : « Le passe sanitaire doit servir à accélérer les doses de rappel »’ with Miquel Oliu-Barton, Le Monde (24 November 2021).

‘Covid-19 : le faux dilemme entre santé, économie et libertés’ with Philippe Aghion, Patrick Artus, and Miquel Oliu-Barton, Commentaire (June 2021).

‘Le recours temporaire à un passe sanitaire est nécessaire si nous voulons une sortie de crise durable’ with Philippe Aghion, Philippe Martin, and Miquel Oliu-Barton, Le Monde (2 June 2021).

‘Covid-19 : « Nous avons besoin d’un “passe sanitaire” fiable, temporaire et accessible pour tout le monde »’ with Anne Bucher and Miquel Oliu-Barton, Le Monde (22 March 2021).

‘Aiming for zero Covid-19: Europe needs to take action’ with a group of international scientists, published by Bruegel, de Volkskrant, El Pais, la Repubblica, Le Monde, Rzeczpospolita, Süddeutsche Zeitung (15 February 2021).

Stability and Optimization of Speculative Queueing Networks J. Jonatha Anselmi N. Neil Walton IEEE/ACM Transactions on Networking 2021 1-12 Dispatching to parallel servers: Solutions of Poisson's equation for first-policy improvement O. Olivier Bilenne Queueing Systems October 2021 99 3 199–230 Minibatch forward-backward-forward methods for solving stochastic variational inequalities R. I. Radu Ioan Bot P. Panayotis Mertikopoulos M. Mathias Staudigl P. T. Phan Tu Vuong Stochastic Systems June 2021 11 2 112-139 Online Reconfiguration of IoT Applications in the Fog: The Information-Coordination Trade-off B. Bruno Donassolo A. Arnaud Legrand P. Panayotis Mertikopoulos I. Ilhem Fajjari IEEE Transactions on Parallel and Distributed Systems 2022 33 5 1156-1172 On fair selection in the presence of implicit and differential variance V. Vitalii Emelianov N. Nicolas Gast K. P. Krishna P Gummadi P. Patrick Loiseau Artificial Intelligence October 2021 302 1-20 Analysis of Work Stealing with latency N. Nicolas Gast M. Mohammed Khatiri D. Denis Trystram F. Frédéric Wagner Journal of Parallel and Distributed Computing July 2021 153 119-129 A Pseudo-Linear Time Algorithm for the Optimal Discrete Speed Minimizing Energy Consumption B. Bruno Gaujal A. Alain Girault S. Stéphan Plassart Discrete Event Dynamic Systems 2021 The importance of memory for price discovery in decentralized markets J. D. Jacob D Leshno B. S. Bary S R Pradelski Games and Economic Behavior January 2021 125 62-78 SARS-CoV-2 elimination, not mitigation, creates best outcomes for health, the economy, and civil liberties M. Miquel Oliu-Barton B. Bary Pradelski P. Philippe Aghion P. Patrick Artus I. Ilona Kickbusch J. Jeffrey Lazarus D. Devi Sridhar S. Samantha Vanderslott The Lancet June 2021 397 10291 2234-2236 Green zoning: An effective policy tool to tackle the Covid-19 pandemic B. Bary Pradelski M. Miquel Oliu-Barton Health Policy August 2021 125 8 981-986 Distributed stochastic optimization with large delays Z. Zhengyuan Zhou P. Panayotis Mertikopoulos N. Nicholas Bambos P. W. Peter W Glynn Y. Yinyu Ye Mathematics of Operations Research 2021 1-41 Robust power management via learning and game design Z. Zhengyuan Zhou P. Panayotis Mertikopoulos A. L. Aris L. Moustakas N. Nicholas Bambos P. W. Peter W. Glynn Operations Research January 2021 69 1 331-345 rmftool - A library to Compute (Refined) Mean Field Approximation(s) S. Sebastian Allmeier N. Nicolas Gast TOSME 2021 Online conference, France November 2021 Adaptive extra-gradient methods for min-max optimization and games K. Kimon Antonakopoulos V. E. Veronica E Belmega P. Panayotis Mertikopoulos ICLR 2021 - 9th International Conference on Learning Representations Virtual, Unknown Region May 2021 1-28 Adaptive first-order methods revisited: Convex optimization without Lipschitz requirements K. Kimon Antonakopoulos P. Panayotis Mertikopoulos NeurIPS 2021 - 35th International Conference on Neural Information Processing Systems Virtual, Unknown Region 2021 Sifting through the noise: Universal first-order methods for stochastic variational inequalities K. Kimon Antonakopoulos T. Thomas Pethick A. Ali Kavis P. Panayotis Mertikopoulos V. Volkan Cevher NeurIPS 2021 - 35th International Conference on Neural Information Processing Systems Virtual, Unknown Region December 2021 1-39 The last-iterate convergence rate of optimistic mirror descent in stochastic variational inequalities W. Waïss Azizian F. Franck Iutzeler J. Jérôme Malick P. Panayotis Mertikopoulos COLT 2021 - 34th Annual Conference on Learning Theory Boulder, United States August 2021 1-32 Vaccination in a Large Population: Mean Field Equilibrium versus Social Optimum B. Bruno Gaujal J. Josu Doncel N. Nicolas Gast NETGCOOP 2020 - 10th International Conference on NETwork Games, COntrol and OPtimization Cargèse, France September 2021 1-9 Survival of the strictest: Stable and unstable equilibria under regularized learning with partial information A. Angeliki Giannou E. V. Emmanouil Vasileios Vlatakis-Gkaragkounis P. Panayotis Mertikopoulos COLT 2021 - 34th Annual Conference on Learning Theory Boulder, United States August 2021 1-30 The convergence rate of regularized learning in games: From bandits and uncertainty to optimism and beyond A. Angeliki Giannou E. V. Emmanouil Vasileios Vlatakis-Gkaragkounis P. Panayotis Mertikopoulos NeurIPS 2021 - 35th International Conference on Neural Information Processing Systems Virtual, Unknown Region December 2021 1-28 Regret minimization in stochastic non-convex learning via a proximal-gradient approach N. Nadav Hallak P. Panayotis Mertikopoulos V. Volkan Cevher ICML 2021 - 38th International Conference on Machine Learning Vienna, Austria July 2021 Zeroth-order non-convex learning via hierarchical dual averaging A. Amélie Héliou M. Matthieu Martin T. Thibaud Rahier P. Panayotis Mertikopoulos ICML 2021 - 38th International Conference on Machine Learning Vienna, Austria July 2021 1-34 Adaptive learning in continuous games: Optimal regret bounds and convergence to Nash equilibrium Y.-G. Yu-Guan Hsieh K. Kimon Antonakopoulos P. Panayotis Mertikopoulos COLT 2021 - the 34th Annual Conference on Learning Theory Boulder, United States August 2021 1-34 Optimization in open networks via dual averaging Y.-G. Yu-Guan Hsieh F. Franck Iutzeler J. Jérôme Malick P. Panayotis Mertikopoulos CDC 2021 - 60th IEEE Annual Conference on Decision and Control Austin, United States December 2021 IEEE 1-7 The limits of min-max optimization algorithms: Convergence to spurious non-critical sets Y.-P. Ya-Ping Hsieh P. Panayotis Mertikopoulos V. Volkan Cevher ICML 2021 - 38th International Conference on Machine Learning Vienna, Austria July 2021 Introducing the Expohedron for Efficient Pareto-optimal Fairness-Utility Amortizations in Repeated Rankings T. Till Kletti J.-M. Jean-Michel Renders P. Patrick Loiseau WSDM 2022 - 15th ACM International Conference on Web Search and Data Mining Phoenix (virtual), United States February 2022 ACM 1-10 Equilibrium tracking and convergence in dynamic games P. Panayotis Mertikopoulos M. Mathias Staudigl CDC 2021 - 60th IEEE Annual Conference on Decision and Control Austin, United States December 2021 1-8 Selfish & Opaque Transaction Ordering in the Bitcoin Blockchain: The Case for Chain Neutrality J. Johnnatan Messias M. Mohamed Alzayat B. Balakrishnan Chandrasekaran K. P. Krishna P Gummadi P. Patrick Loiseau A. Alan Mislove IMC 2021 - ACM Internet Measurement Conference Virtual Event, France November 2021 1-16 Exploiting system level heterogeneity to improve the performance of a GeoStatistics multi-phase task-based application L. L. Lucas Leandro Nesi A. Arnaud Legrand L. Lucas Mello Schnorr ICPP 2021 - 50th International Conference on Parallel Processing Lemont, United States August 2021 1-10 Fast routing under uncertainty: Adaptive learning in congestion games with exponential weights D. Dong Quan Vu K. Kimon Antonakopoulos P. Panayotis Mertikopoulos NeurIPS 2021 - 35th International Conference on Neural Information Processing Systems Virtual, Unknown Region December 2021 1-36 Scalable Optimal Classifiers for Adversarial Settings under Uncertainty B. Benjamin Roussillon P. Patrick Loiseau GameSec 2021 - 12th Conference on Decision and Game Theory for Security Prague, Czech Republic 2021 1-20 Colonel Blotto Games with Favoritism: Competitions with Pre-allocations and Asymmetric Effectiveness D. Q. Dong Quan Vu P. Patrick Loiseau Proceedings of the 22nd ACM Conference on Economics and Computation (ACM EC '21) Budapest, Hungary July 2021 862-863 High Performance Computing : towards better Performance Predictions and Experiments T. Tom Cornebize June 2021 Toward transparent and parsimonious methods for automatic performance tuning P. H. Pedro Henrique Rocha Bruel July 2021 Learning in the Presence of Strategic Data Sources: Models and Solutions B. Benjamin Roussillon September 2021 Optimal Speed Profile of a DVFS Processor under Soft Deadlines J. Jonatha Anselmi B. Bruno Gaujal L. Louis Sébastien Rebuffi October 2021 Simulation-based Optimization and Sensibility Analysis of MPI Applications: Variability Matters T. Tom Cornebize A. Arnaud Legrand January 2022 Multi-agent online learning in time-varying games B. Benoît Duvocelle P. Panayotis Mertikopoulos M. Mathias Staudigl D. Dries Vermeulen 2021 Reinforcement Learning for Markovian Bandits: Is Posterior Sampling more Scalable than Optimism? N. Nicolas Gast B. Bruno Gaujal K. Kimang Khun June 2021 (Close to) Optimal Policies for Finite Horizon Restless Bandits N. Nicolas Gast B. Bruno Gaujal C. Chen Yan June 2021 Learning in nonatomic games, Part I: Finite action spaces and population games S. Saeed Hadikhanloo R. Rida Laraki P. Panayotis Mertikopoulos S. Sylvain Sorin September 2021 Multi-agent online optimization with delays: Asynchronicity, adaptivity, and optimism Y.-G. Yu-Guan Hsieh F. Franck Iutzeler J. Jérôme Malick P. Panayotis Mertikopoulos 2021 Performance Analysis of Irregular Task-Based Applications on Hybrid Platforms: Structure Matters M. C. Marcelo Cogo Miletto L. L. Lucas Leandro Nesi L. Lucas Mello Schnorr A. Arnaud Legrand July 2021 Measuring the Facebook Advertising Ecosystem A. Athanasios Andreou M. Marcio Silva F. Fabrício Benevenuto O. Oana Goga P. Patrick Loiseau A. Alan Mislove February 2019 1-15 Investigating Ad Transparency Mechanisms in Social Media: A Case Study of Facebook's Explanations A. Athanasios Andreou G. Giridhari Venkatadri O. Oana Goga K. P. Krishna P Gummadi P. Patrick Loiseau A. Alan Mislove February 2018 1-15 Combining Size-Based Load Balancing with Round-Robin for Scalable Low Latency J. Jonatha Anselmi IEEE Transactions on Parallel and Distributed Systems 2019 1-3 Asymptotically Optimal Size-Interval Task Assignments J. Jonatha Anselmi J. Josu Doncel IEEE Transactions on Parallel and Distributed Systems 2019 30 11 2422-2433 Power-of-d-Choices with Memory: Fluid Limit and Optimality J. Jonatha Anselmi F. François Dufour Mathematics of Operations Research 2020 45 3 862-888 Dimemas: Predicting MPI Applications Behaviour in Grid Environments R. M. Rosa M. Badia J. Jesús Labarta J. Judit Giménez F. Francesc Escalé 2003 xSim: The Extreme-Scale Simulator S. Swen Böhm C. Christian Engelmann 2011 Autotuning under Tight Budget Constraints: A Transparent Design of Experiments Approach P. Pedro Bruel S. Steven Quinito Masnada B. Brice Videau A. Arnaud Legrand J.-M. Jean-Marc Vincent A. Alfredo Goldman May 2019 IEEE 1-10 Comprehensive Performance Tracking with VAMPIR 7 H. Holger Brunst D. Daniel Hackenberg G. Guido Juckeland H. Heide Rohling 2010 Springer Berlin Heidelberg Penalty-Regulated Dynamics and Robust Learning Procedures in Games P. Pierre COUCHENEY B. Bruno Gaujal P. Panayotis Mertikopoulos Mathematics of Operations Research 2015 40 3 611-633 Performance analysis methods for list-based caches with non-uniform access G. Giuliano Casale N. Nicolas Gast IEEE/ACM Transactions on Networking December 2020 1-18 Equality of Voice: Towards Fair Representation in Crowdsourced Top-K Recommendations A. Abhijnan Chakraborty G. K. Gourab K Patro N. Niloy Ganguly K. P. Krishna P Gummadi P. Patrick Loiseau Proceedings of the ACM Conference on Fairness, Accountability, and Transparency (FAT*) January 2019 ACM 129-138 Fast and Faithful Performance Prediction of MPI Applications: the HPL Case Study T. Tom Cornebize A. Arnaud Legrand F. C. Franz C Heinrich September 2019 Simulating MPI applications: the SMPI approach A. Augustin Degomme A. Arnaud Legrand G. Georges Markomanolis M. Martin Quinson M. L. Mark Lee Stillwell F. Frédéric Suter IEEE Transactions on Parallel and Distributed Systems August 2017 28 8 14 Load Aware Provisioning of IoT Services on Fog Computing Platform B. Bruno Donassolo I. Ilhem Fajjari A. Arnaud Legrand P. Panayotis Mertikopoulos May 2019 IEEE Are mean-field games the limits of finite stochastic games? J. Josu Doncel N. Nicolas Gast B. Bruno Gaujal June 2016 Discrete Mean Field Games: Existence of Equilibria and Convergence J. Josu Doncel N. Nicolas Gast B. Bruno Gaujal Journal of Dynamics and Games 2019 6 3 1-19 Learning in time-varying games B. Benoît Duvocelle P. Panayotis Mertikopoulos M. Mathias Staudigl D. Dries Vermeulen October 2018 The Price of Local Fairness in Multistage Selection V. Vitalii Emelianov G. George Arvanitakis N. Nicolas Gast K. P. Krishna P Gummadi P. Patrick Loiseau August 2019 International Joint Conferences on Artificial Intelligence Organization 5836-5842 On Fair Selection in the Presence of Implicit Variance V. Vitalii Emelianov N. Nicolas Gast K. P. Krishna P. Gummadi P. Patrick Loiseau July 2020 ACM 649–675 No-regret learning and mixed Nash equilibria: They do not mix L. Lampros Flokas E. V. Emmanouil V Vlatakis-Gkaragkounis T. Thanasis Lianeas P. Panayotis Mertikopoulos G. Georgios Piliouras December 2020 1-24 A Visual Performance Analysis Framework for Task-based Parallel Applications running on Hybrid Clusters V. Vinicius Garcia Pinto L. M. Lucas Mello Schnorr L. Luka Stanisic A. Arnaud Legrand S. Samuel Thibault V. Vincent Danjean Concurrency and Computation: Practice and Experience April 2018 30 18 1-31 Size Expansions of Mean Field Approximation: Transient and Steady-State Analysis N. Nicolas Gast L. Luca Bortolussi M. Mirco Tribastone Performance Evaluation 2018 1-15 Expected Values Estimated via Mean-Field Approximation are 1/N-Accurate N. Nicolas Gast ACM SIGMETRICS / International Conference on Measurement and Modeling of Computer Systems , SIGMETRICS '17 June 2017 1 26 Exponential Convergence Rate for the Asymptotic Optimality of Whittle Index Policy N. Nicolas Gast B. Bruno Gaujal C. Chen Yan December 2020 A Refined Mean Field Approximation N. Nicolas Gast B. V. Benny Van Houdt June 2018 1 Linear Regression from Strategic Data Sources N. Nicolas Gast S. Stratis Ioannidis P. Patrick Loiseau B. Benjamin Roussillon ACM Transactions on Economics and Computation May 2020 8 2 1-24 A Refined Mean Field Approximation for Synchronous Population Processes N. Nicolas Gast D. Diego Latella M. Mieke Massink June 2018 1-3 Asymptotically Exact TTL-Approximations of the Cache Replacement Algorithms LRU(m) and h-LRU N. Nicolas Gast B. Benny Van Houdt September 2016 TTL Approximations of the Cache Replacement Algorithms LRU(m) and h-LRU N. Nicolas Gast B. Benny Van Houdt Performance Evaluation September 2017 A Linear Time Algorithm for Computing Off-line Speed Schedules Minimizing Energy Consumption B. Bruno Gaujal A. Alain Girault S. Stéphan Plassart November 2019 1-14 Discrete and Continuous Optimal Control for Energy Minimization in Real-Time Systems B. Bruno Gaujal A. Alain Girault S. Stéphan Plassart September 2020 IEEE 1-8 Dynamic Speed Scaling Minimizing Expected Energy Consumption for Real-Time Tasks B. Bruno Gaujal A. Alain Girault S. Stéphan Plassart Journal of Scheduling July 2020 1-25 Exploiting Job Variability to Minimize Energy Consumption under Real-Time Constraints B. Bruno Gaujal A. Alain Girault S. Stéphan Plassart November 2019 RR-9300 23 Visualizing the performance of parallel programs M. MT Heath J. JA Etheridge IEEE software 1991 8 5 Predicting the Energy Consumption of MPI Applications at Scale Using a Single Node F. C. Franz C. Heinrich T. Tom Cornebize A. Augustin Degomme A. Arnaud Legrand A. Alexandra Carpen-Amarie S. Sascha Hunold A.-C. Anne-Cécile Orgerie M. Martin Quinson September 2017 LogGOPSim - Simulating Large-Scale Applications in the LogGOPS Model T. Torsten Hoefler T. Timo Schneider A. Andrew Lumsdaine 2010 Scaling applications to massively parallel machines using Projections performance analysis tool L. V. Laxmikant V. Kalé G. Gengbin Zheng C. W. Chee Wai Lee S. Sameer Kumar Future Generation Comp. Syst. 2006 22 3 Using Simulation to Evaluate and Tune the Performance of Dynamic Load Balancing of an Over-decomposed Geophysics Application R. Rafael Keller Tesser L. Lucas Mello Schnorr A. Arnaud Legrand F. Fabrice Dupros P. O. Philippe O A Navaux August 2017 15 Performance Modeling of a Geophysics Application to Accelerate the Tuning of Over-decomposition Parameters through Simulation R. Rafael Keller Tesser L. Lucas Mello Schnorr A. Arnaud Legrand C. Christian Heinrich F. Fabrice Dupros P. O. Philippe Olivier Alexandre Navaux Concurrency and Computation: Practice and Experience 2018 1-21 ASGriDS: Asynchronous Smart-Grids Distributed Simulator T.-E. Takai-Eddine Kennouche F. Florent Cadoux N. Nicolas Gast B. Benoît Vinot December 2019 IEEE 1-5 Selection Problems in the Presence of Implicit Bias J. M. Jon M. Kleinberg M. Manish Raghavan 2018 33:1--33:17 Adapting Batch Scheduling to Workload Characteristics: What can we expect From Online Learning? A. Arnaud Legrand D. Denis Trystram S. Salah Zrigui May 2019 IEEE 686-695 Collisions groupées lors du mécanisme d'évitement de collisions de CPL-G3 M. Mouhcine Mendil N. Nicolas Gast H.-J. Henry-Joseph Audéoud September 2020 1-4 Optimistic Mirror Descent in Saddle-Point Problems: Going the Extra (Gradient) Mile P. Panayotis Mertikopoulos B. Bruno Lecouat H. Houssam Zenati C.-S. Chuan-Sheng Foo V. Vijay Chandrasekhar G. Georgios Piliouras May 2019 1-23 Quick or cheap? Breaking points in dynamic markets P. Panayotis Mertikopoulos H. Heinrich Nax B. Bary Pradelski July 2020 1-32 Cycles in adversarial regularized learning P. Panayotis Mertikopoulos C. H. Christos Harilaos Papadimitriou G. Georgios Piliouras January 2018 2703-2717 Learning in games via reinforcement learning and regularization P. Panayotis Mertikopoulos W. H. William H. Sandholm Mathematics of Operations Research November 2016 41 4 1297-1324 Riemannian game dynamics P. Panayotis Mertikopoulos W. H. William H. Sandholm Journal of Economic Theory September 2018 177 315-364 Forgetting the Forgotten with Lethe: Conceal Content Deletion from Persistent Observers M. Mohsen Minaei M. Mainack Mondal P. Patrick Loiseau K. P. Krishna P Gummadi A. Aniket Kate July 2019 1-21 VAMPIR: Visualization and Analysis of MPI Resources W. W.E. Nagel A. A. Arnold M. M. Weber H. H.C. Hoppe K. K. Solchenbach Supercomputer 1996 12 1 A vaccination policy by zones M. Miquel Oliu-Barton B. Bary Pradelski October 2020 Green bridges: Reconnecting Europe to avoid economic disaster M. Miquel Oliu-Barton B. Bary Pradelski 2020 PARAVER: A tool to visualise and analyze parallel code V. V. Pillet J. J. Labarta T. T. Cortes S. S. Girona 1995 44 Market sentiments and convergence dynamics in decentralized assignment economies B. S. Bary S R Pradelski H. H. Heinrich H Nax International Journal of Game Theory March 2020 49 1 275-298 Focus mass testing: How to overcome low test accuracy B. Bary Pradelski M. Miquel Oliu-Barton December 2020 Scalable performance analysis: the Pablo performance analysis environment D. DA Reed P. PC Roth R. RA Aydt K. KA Shields L. LF Tavera R. RJ Noe B. BW Schwartz 1993 The eyes have it: A task by data type taxonomy for information visualizations B. Ben Shneiderman 1996 Power Management and Dynamic Voltage Scaling: Myths and Facts D. C. David C. Snowdon S. Sergio Ruocco G. Gernot Heiser September 2005 Potential for Discrimination in Online Targeted Advertising T. Till Speicher M. Muhammad Ali G. Giridhari Venkatadri F. Filipe Ribeiro G. George Arvanitakis F. Fabrício Benevenuto K. P. Krishna P Gummadi P. Patrick Loiseau A. Alan Mislove February 2018 81 1-15 PSINS: An Open Source Event Tracer and Execution Simulator for MPI Applications M. Mustafa Tikir M. Michael Laurenzano L. Laura Carrington A. Allan Snavely 2009 Privacy Risks with Facebook’s PII-based Targeting: Auditing a Data Broker’s Advertising Interface G. Giridhari Venkatadri A. Athanasios Andreou Y. Yabing Liu A. Alan Mislove K. P. Krishna P Gummadi P. Patrick Loiseau O. Oana Goga Proceedings of the 39th IEEE Symposium on Security and Privacy (S&P) 2018 Congestion Avoidance in Low-Voltage Networks by using the Advanced Metering Infrastructure B. Benoıt Vinot F. Florent Cadoux N. Nicolas Gast December 2018 1-3 Decentralized Optimization of Energy Exchanges in an Electricity Microgrid B. Benoît Vinot F. Florent Cadoux R. Rodolphe Heliot June 2016 Decentralized optimization of energy exchanges in an electricity microgrid B. Benoît Vinot F. Florent Cadoux R. Rodolphe Héliot October 2016 IEEE 1-6 Scheduling for Reduced CPU Energy M. Mark Weiser B. Brent Welch A. Alan Demers S. Scott Shenker OSDI '94 Monterey, California 1994 USENIX Association 2–es Validation and Uncertainty Assessment of Extreme-Scale HPC Simulation through Bayesian Inference J. Jeremiah Wilke K. Khachik Sargsyan J. Joseph Kenny B. Bert Debusschere H. Habib Najm G. Gilbert Hendry 2013 Toward Scalable Performance Visualization with Jumpshot O. O. Zaki E. E. Lusk W. W. Gropp D. D. Swider International Journal of High Performance Computing Applications 1999 13 3 BigSim: A Parallel Simulator for Performance Prediction of Extremely Large Parallel Machines G. Gengbin Zheng G. Gunavardhan Kakulapati L. Laxmikant Kalé 2004 Improving the Performance of Batch Schedulers Using Online Job Runtime Classification S. Salah Zrigui R. Y. Raphael Y De Camargo A. Arnaud Legrand D. Denis Trystram October 2020