CONCACE

CONCACE - 2025

2025Activity reportProject-Team‌CONCACE

RNSR: 202224319T

Research‌‌ center Inria Centre at the University of Bordeaux‌
In partnership with:Airbus‌ Central Research & Technology,‌‌ Centre Européen de Recherche et de Formation Avancée‌ en Calcul Scientifique
Team‌ name: Numerical and Parallel‌‌ Composability for High Performance‌ Computing

Creation of the Project-Team: 2022 September 01‌

Each year, Inria research teams publish an Activity‌ Report presenting their work and results over the‌ reporting period. These reports follow a common structure,‌ with some optional sections depending on the specific‌ team. They typically begin by outlining the overall‌ objectives and research programme, including the main research‌ themes, goals, and methodological approaches. They also describe‌ the application domains targeted by the team, highlighting‌ the scientific or societal contexts in which their‌ work is situated.

The reports then present the‌ highlights of the year, covering major scientific achievements,‌ software developments, or teaching contributions. When relevant, they‌ include sections on software, platforms, and open data,‌ detailing the tools developed and how they are‌ shared. A substantial part is dedicated to new‌ results, where scientific contributions are described in detail,‌ often with subsections specifying participants and associated keywords.‌

Finally, the Activity Report addresses funding, contracts, partnerships,‌ and collaborations at various levels, from industrial agreements‌ to international cooperations. It also covers dissemination and‌ teaching activities, such as participation in scientific events,‌ outreach, and supervision. The document concludes with a‌ presentation of scientific production, including major publications and‌ those produced during the year.

Keywords

Computer Science‌ and Digital Science

A1.1.4. High performance computing
A1.1.5.‌ Exascale
A1.1.9. Fault tolerant systems
A6.2.5. Numerical Linear‌ Algebra
A6.2.7. HPC for machine learning
A6.3. Computation-data‌ interaction
A7.1. Algorithms
A8.2. Optimization
A8.10. Computer arithmetic‌
A9.2. Machine learning
A9.7. AI algorithmics
A9.10. Hybrid‌ approaches for AI

1 Team members,‌ visitors, external collaborators

Research Scientists

Luc Giraud [‌Team leader, INRIA, Senior Researcher,‌ HDR]
Carola Kruse [Team leader,‌ CERFACS, Senior Researcher]
Jean-Marie Couteyen [‌Team leader, AIRBUS, Senior Researcher,‌ from Feb 2025]
Emmanuel Agullo [INRIA‌, Researcher]
Pierre Benjamin [AIRBUS,‌ Senior Researcher]
Olivier Coulaud [INRIA,‌ Senior Researcher, HDR]
Clement Guillet [‌INRIA, ISFP, from Oct 2025]‌
Sofiane Haddad [AIRBUS, Senior Researcher]‌
Paul Mycek [CERFACS, Senior Researcher]‌
Guillaume Sylvand [AIRBUS, Senior Researcher]‌

Post-Doctoral Fellows

Joanna Bisch [INRIA, from‌ Sep 2025, Post-Doctoral Fellow]
Théo Boivin‌ [CERFACS, from Apr 2025, Post-Doctoral‌ Fellow]
Augustin Leclerc [INRIA, Post-Doctoral‌ Fellow]
Stojche Nakov [INRIA, Post-Doctoral‌ Fellow]

PhD Students

Theo Briquet [INRIA‌]
Sai Aakash Dasari [CERFACS]
Hugo‌ Dodelin [INRIA]
El Mehdi Ettaouchi [‌EDF, CIFRE]
Antoine Gicquel [INRIA‌]
Alexandre Malhene [INRIA]
Clément Peaucelle‌ [AIRBUS, CIFRE]
Amine El Mehdi‌ Zekri [TOULOUSE INP]

Technical Staff

Ludovic‌ Courtes [INRIA, Engineer, (SED, 20 %)]
Arthur Gouinguenet‌ [INRIA, Engineer‌, from Oct 2025‌‌]
Esragul Korkmaz [INRIA, Engineer]‌
Gilles Marait [INRIA‌, Engineer, (SED,‌‌ 90%)]
Florent Pruvost [INRIA, Engineer‌, (SED, 40%)]‌

Interns and Apprentices

Mathis‌‌ Foussac [INRIA, Apprentice, from Nov‌ 2025]
Mathis Foussac‌ [INRIA, Intern‌‌, from Jun 2025 until Aug 2025]‌
Vivien Nebout [INRIA‌, Intern, from‌‌ Jun 2025 until Aug 2025]
Juliette Petit‌ [INRIA, Intern‌, from Jun 2025‌‌ until Aug 2025]

Administrative Assistants

Catherine Cattaert‌ Megrat [INRIA]‌
Marie-Melissandre Roy [INRIA‌‌]

External Collaborators

Luciano Drozda [CERFACS]‌
Marek Felsoci [UVSQ‌, from Dec 2025‌‌]
Jean-Rene Poirier [TOULOUSE INP, HDR‌]

2 Overall objectives‌

Over the past few‌‌ decades, there have been innumerable science, engineering and‌ societal breakthroughs enabled by‌ the development of high‌‌ performance computing (HPC) applications, algorithms and architectures. These‌ powerful tools have enabled‌ researchers to find computationally‌‌ efficient solutions to some of the most challenging‌ scientific questions and problems‌ in medicine and biology,‌‌ climate science, nanotechnology, energy, and environment – to‌ name a few –‌ in the field of‌‌ model-driven computing. Meanwhile the advent of network‌ capabilities and IoT, next‌ generation sequencing, ... tend‌‌ to generate a huge amount of data that‌ deserves to be processed‌ to extract knowledge and‌‌ possible forecasts. These calculations are often referred to‌ as data-driven calculations.‌ These two classes of‌‌ challenges have a common ground in terms of‌ numerical techniques that lies‌ in the field of‌‌ linear and multi-linear algebra. They do also share‌ common bottlenecks related to‌ the size of the‌‌ mathematical objects that we have to represent and‌ work on; those challenges‌ retain a growing attention‌‌ from the computational science community.

In this context,‌ the purpose of the‌ concace project, is to‌‌ contribute to the design of novel numerical tools‌ for model-driven and data-driven‌ calculations arising from challenging‌‌ academic and industrial applications. The solution of these‌ challenging problems requires a‌ multidisciplinary approach involving applied‌‌ mathematics, computational and computer sciences. In applied mathematics,‌ it essentially involves advanced‌ numerical schemes both in‌‌ terms of numerical techniques and data representation of‌ the mathematical objects (e.g.,‌ compressed data, low-rank tensor‌‌ 85, 92, 81 low-rank hierarchical matrices‌ 83, 69).‌ In computational science, it‌‌ involves large scale parallel heterogeneous computing and the‌ design of highly composable‌ algorithms. Through this‌‌ approach, concace intends to contribute to all the‌ steps that go from‌ the design of new‌‌ robust and accurate numerical schemes to the flexible‌ implementations of the associated‌ algorithms on large computers.‌‌ To address these research challenges, researchers from Inria,‌ Airbus Central R&T and‌ Cerfacs have decided to‌‌ combine their skills and research efforts to create‌ the Inria concace project‌ team, which will allow‌‌ them to cover the‌ entire spectrum, from fundamental methodological concerns to full‌ validations on challenging industrial test cases. Such a‌ joint project will enable a real synergy between‌ basic and applied research with complementary benefits to‌ all the partners. The main benefits for each‌ partner are given below:

Airbus Central R&T
- Push‌ our specific needs and use-cases towards the academic‌ world to stimulate research in particular directions;
- Remain‌ at the level of the scientific state of‌ the art, this collaboration allows us to facilitate‌ the feedback by exposing directly our challenges and‌ industrial applications to eventually facilitate the transfer of‌ research in our design tools;
- The Inria research‌ model will naturally be extended to Airbus, allowing‌ for the multiplication of ambitious, very upstream and‌ long-term research, while at the same time directly‌ applying to the needs expressed by Airbus;
- Benefit‌ from the very high-level international network of the‌ Inria team (e.g., Univ. of Tennessee Knoxville, Barcelona‌ supercomputing center, Julich supercomputing center, Lawrence Berkeley National‌ Lab, Sandia National Lab, etc.).
Cerfacs
- Join forces,‌ in terms of skills and expertise, with Inria‌ and Airbus to make faster and more effective‌ progress on the research areas addressed by the‌ team;
- Bring scientific challenges from industrial applications through‌ our privileged relationship with our industrial partners;
- Reciprocally,‌ promote the developed methodologies and the obtained results‌ towards our industrial partners;
- Naturally interact with the‌ national and european HPC ecosystems, as a member‌ of the EuroHPC national competence center on HPC,‌ to promote the research activities and tools of‌ the team and to meet novel scientific challenges‌ where our methodologies or tools apply.
Inria
- Reinforce‌ the impact of our research through a direct‌ contact and close interactions with real scientific and‌ technical challenges;
- Feed the virtuous feedback cycle between‌ academic research and industrially-relevant applications enabling the emergence‌ of new research avenues;
- Create a privileged space‌ for an open scientific dialogue enabling the fostering‌ of existing synergies and to create new ones,‌ in particular when one of the industrial partners‌ is a large group whose spectrum of scientific‌ problems is very broad.

In addition to the‌ members of these entities, two other external collaborators‌ will be strongly associated: Jean-René Poirier, from Laplace‌ Laboratory at University of Toulouse) and Oguz Kaya,‌ from LISN (Laboratoire Interdisciplinaire des Sciences du Numérique)‌ at University of Saclay.

The scientific objectives described‌ in Section 4 contain two main topics which‌ cover numerical and computational methodologies. Each of the‌ topic is composed of a methodological component and‌ its validation counterpart to fully assess the relevance,‌ robustness and effectiveness of the proposed solutions. First,‌ we address numerical linear and multilinear algebra methodologies‌ for model- and data-driven scientific computing. Second, because‌ there is no universal single solution but rather‌ a large panel of alternatives combining many of‌ the various building boxes, we also consider research‌ activities in the field of composition of parallel‌ algorithms and data distributions to ease the investigation of this combinatorial problem‌ toward the best algorithm‌ for the targeted problem.‌‌

To illustrate on a single but representative example‌ of model-driven problems that‌ the joint team will‌‌ address we can mention one encountered at Airbus‌ that is related to‌ large aero-acoustic calculations. The‌‌ reduction of noise produced by aircraft during take-off‌ and landing has a‌ direct societal and environmental‌‌ impact on the populations (including citizen health) located‌ around airports. To comply‌ with new noise regulation‌‌ rules, novel developments must be undertaken to preserve‌ the competitiveness of the‌ European aerospace industry. In‌‌ order to design and optimize new absorbing materials‌ for acoustics and reduce‌ the perceived sound, one‌‌ must be able to simulate the propagation of‌ an acoustic wave in‌ an aerodynamic flow: The‌‌ physical phenomenon at stake is aero-acoustics. The‌ complex and chaotic nature‌ of fluid mechanics requires‌‌ simplifications in the models used. Today, we consider‌ the flow as non-uniform‌ only in a small‌‌ part of the space (in the jet flow‌ of the reactors mainly)‌ which will be meshed‌‌ in volume finite elements, and everywhere else the‌ flow will be considered‌ as uniform, and the‌‌ acoustic propagation will be treated with surface finite‌ elements. This brings us‌ back to the solution‌‌ of a linear system with dense and sparse‌ parts, an atypical form‌ for which there is‌‌ no "classical" solver available. We therefore have to‌ work on the coupling‌ of methods (direct or‌‌ iterative, dense or sparse, compressed or not, etc.),‌ and to compose different‌ algorithms in order to‌‌ be able to handle very large industrial cases.‌ While there are effective‌ techniques to solve each‌‌ part independently from one another, there is no‌ canonical, efficient solution for‌ the coupled problem, which‌‌ has been much less studied by the community.‌ Among the possible improvements‌ to tackle such a‌‌ problem, hybridizing simulation and learning represents an alternative‌ which allows one to‌ reduce the complexity by‌‌ avoiding as much as possible local refinements and‌ therefore reduce the size‌ of the problem.

Regarding‌‌ data-driven calculation, climate data analysis is one‌ of the application domains‌ that generate huge amounts‌‌ of data, either in the form of measurements‌ or computation results. The‌ ongoing effort between the‌‌ climate modeling and weather forecasting community to mutualize‌ digital environement, including codes‌ and models, leads the‌‌ climate community to use finer models and discretization‌ generating an ever growing‌ amount of data. The‌‌ analysis of these data, mainly based on classical‌ numerical tools with a‌ strong involvement of linear‌‌ algebra ingredients, is facing new scalability challenges due‌ to this growing amount‌ of data. Computed and‌‌ measured data have intrinsic structures that could be‌ naturally exploited by low‌ rank tensor representations to‌‌ best reveal the hidden structure of the data‌ while addressing the scalability‌ problem. The close link‌‌ with the CECI team at Cerfacs will provide‌ us with the opportunity‌ to study novel numerical‌‌ methodologies based on tensor‌ calculation. Contributing to a better understanding of the‌ mechanisms governing the climate change would obviously have‌ significant societal and economical impacts on the population‌. This is just an illustration of a‌ possible usage of our work, we could also‌ have possibly mentioned an on-going collaboration where our‌ tools will be used in the context of‌ a steel company to reduce the data volume‌ generated by IoT to be transferred on the‌ cloud for the analysis. The methodological part described‌ in Section 4 covers mostly two complementary topics:‌ the first in the field of numerical scientific‌ computing and the second in the core of‌ computational sciences.

To sum-up, for each of the‌ methodological contributions, we aim to find at least‌ one dimensioning application, preferably from a societal challenge,‌ which will allow us to validate these methods‌ and their implementations at full-scale. The search for‌ these applications will initially be carried out among‌ those available at Airbus or Cerfacs, but the‌ option of seeking them through collaborations outside the‌ project will remain open. The ambition remains to‌ develop generic tools whose implementations will be made‌ accessible via their deposit in the public domain.‌

3 Research program

The methodological component of our‌ proposal concerns the expertise for the design as‌ well as the efficient and scalable implementation of‌ highly parallel numerical algorithms. We intend to go‌ from numerical methodology studies to design novel numerical‌ schemes up to the full assessment at scale‌ in real case academic and industrial applications thanks‌ to advanced HPC implementations.

Our view of the‌ research activity to be developed in Concace is‌ to systematically assess the methodological and theoretical developments‌ in real scale calculations mostly through applications under‌ investigations by the industrial partners (namely Airbus Central‌ R&T and Cerfacs).

We first consider in Section‌ 4.1 topics concerning parallel linear and multi-linear algebra‌ techniques that currently appear as promising approaches to‌ tackle huge problems both in size and in‌ dimension on large numbers of cores. We highlight‌ the linear problems (linear systems or eigenproblems) because‌ they are in many large scale applications the‌ main bottleneck and the most computational intensive numerical‌ kernels. The second research axis, presented in Section‌ 4.2, is related to the challenge faced when‌ advanced parallel numerical toolboxes need to be composed‌ to easily find the best suited solution both‌ from a numerical but also parallel performance point‌ of view.

In short the research activity will‌ rely on two scientific pillars, the first dedicated‌ to the development of new mathematical methods for‌ linear and mutilinear algebra (both for model-driven and‌ data-driven calculations). The second pillar will be on‌ parallel computational methods enabling to easily compose in‌ a parallel framework the packages associated with the‌ methods developed as outcome of the first pillar.‌ The mathematical methods from the first pillar can‌ mathematically be composed, the challenge will be to‌ do on large parallel computers thank to the outcome of the second‌ pillar. We will still‌ validate on real applications‌‌ and at scale (problem and platform) in close‌ collaborations with application experts.‌

3.1 Numerical algebra methodologies‌‌ in model and data-driven scientific computing

At the‌ core of many simulations,‌ one has to solve‌‌ a linear algebra problem that is defined in‌ a vector space and‌ that involves linear operators,‌‌ vectors and scalars, the unknowns being usually vectors‌ or scalars, e.g. for‌ the solution of a‌‌ linear system or an eigenvalue problem. For many‌ years, in particular in‌ model-driven simulations, the problems‌‌ have been reformulated in classical matrix formalism possibly‌ unfolding the spaces where‌ the vectors naturally live‌‌ (typically 3D PDEs) to end up with classical‌ vectors in $R^{n‌}$ or $C^{n}$ .‌‌ For some problems, defined in higher dimension (‌e.g., time dependent‌ 3D PDE), the other‌‌ dimensions are dealt in a problem specific fashion‌ as unfolding those dimensions‌ would lead to too‌‌ large matrices/vectors. The concace research program on numerical‌ methodology intends to address‌ the study of novel‌‌ numerical algorithms to continue addressing the mainstream approaches‌ relying on classical matrix‌ formalism but also to‌‌ investigate alternatives where the structure of the underlying‌ problem is kept preserved‌ and all dimensions are‌‌ dealt with equally. This latter research activity mostly‌ concerns linear algebra in‌ tensor spaces. In‌‌ terms of algorithmic principles, we will lay an‌ emphasis on hierarchy as‌ a unifying principle for‌‌ the numerical algorithms, the data representation and processing‌ (including the current hierarchy‌ of arithmetic) and the‌‌ parallel implementation towards scalability.

3.1.1 Scientific computing in‌ large size linear algebra‌

As an extension of‌‌ our past and on-going research activities, we will‌ continue our works on‌ numerical linear algebra for‌‌ model-driven applications that rely on classical vectorial spaces‌ defined on $R^{n‌}$ and $C^{n}$ ,‌‌ where vectors and matrices are classical sparse or‌ dense objects encountered in‌ regular numerical linear algebra‌‌ computations.

The main numerical algorithms we are interested‌ in are:

Matrix decompositions‌ including classical ones such‌‌ as the $Q R$ factorization that plays a‌ central role in block‌ Krylov solvers 65,‌‌ 80, randomized range finder algorithms 68,‌ 67, to name‌ a few, as building‌‌ orthonormal basis of subspaces guarantees numerical robustness. But‌ also other factorizations, not‌ used in classical linear‌‌ algebra for model-driven calculation, such as non-negative factorization‌ encountered in data-science for‌ multi-variable analysis 79,‌‌ 74.
Iterative solvers both for linear system‌ solutions and for eigenproblems.‌ Regarding linear systems, we‌‌ will pay a particular attention to advanced numerical‌ techniques such as multi-level‌ preconditioning, hybrid direct-iterative (both‌‌ algebraic and PDE driven interface boundary conditions) and‌ the solution of augmented‌ systems (e.g., Karush-Kuhn-Tucker or‌‌ KKT) 86, 87. We will investigate‌ variants of nested subspace‌ methods, possibly with subspace‌‌ augmentation or deflation. In the multiple right-hand sides‌ or left-hand sides cases,‌ we will further study‌‌ the possible orthogonalization variants‌ and the trade-off between the associated parallel scalabilty‌ and robustness. A particular attention will be paid‌ to the communication hiding approaches and the investigation‌ of their block extensions. For eigenproblem solutions, we‌ will consider novel nested subspace techniques to further‌ extend the numerical capabilities of the recently proposed‌ AVCI 91, 88 technique as well as‌ countour based integral equations (that intensively use linear‌ systems techniques mentioned above).

In that context, we‌ will consider the benefit of using hybridization between‌ simulation and learning in order to reduce the‌ complexity of classical approaches by diminishing the problem‌ size or improving preconditioning techniques. In a longer‌ term perspective, we will also conduct an active‌ technological watch activity with respect to quantum computing‌ to better understand how such a advanced computing‌ technology can be synergized with classical scientific computing.‌

3.1.2 Scientific computing in large dimension multi-linear algebra‌

This work will mostly address linear algebra problems‌ defined in large dimensional spaces as they might‌ appear either in model-driven simulations or data-driven calculations.‌ In particular we will be interested in tensor‌ vectorial spaces where the intrinsic mathematical structures of‌ the objects have to be exploited to design‌ efficient and effective numerical techniques.

The main numerical‌ algorithms we are interested in are:

Low-rank tensor‌ decompositions for model- and data-driven, some of them‌ rely on some numerical techniques considered in the‌ previous section 76, 78;
Extension of‌ iterative numerical linear solvers (linear systems and eigensolvers)‌ to tensor vectorial spaces to handle problems that‌ were previously vectorized to be amenable to solution‌ by classical linear algebra techniques;
Study preconditioning and‌ domain decomposition techniques suited for the solution of‌ stochastic PDEs (encountered in some Uncertainty Quantification context)‌ 96 leading to large dimension or preconditioning based‌ on a low-rank approximation of the tensorization of‌ the dense matrix in Boundary Element Method solver‌ 63, 66, 93.

3.1.3 Scientific‌ continuum between large size and large dimension

Novel‌ techniques for large size and large dimension problems‌ tend to reduce the memory footprint and CPU‌ consumption through data compression such as low-rank approximations‌ (hierarchical matrices for dense and sparse calculation, tensor‌ decomposition 77, 94, 89) or‌ speed up the algorithm (fast multipole method, randomized‌ algorithm 84, 9095, 67 to‌ reduce the time and energy to solution. Because‌ of the compression, the genuine data are represented‌ with lower accuracy possibly in a hierarchical manner.‌ Understanding the impact of this lower precision data‌ representation through the entire algorithm is an important‌ issue for developing robust, “accurate” and efficient numerical‌ schemes for current and emerging computing platforms from‌ laptop commodity to supercomputers. Mastering the trade-off between‌ performance and accuracy will be part of our‌ research agenda 72, 75.

Because the‌ low precision data representation can have diverse origins,‌ this research activity will naturally cover the multi-precision‌ arithmetic calculation in which the data perturbation comes entirely from the data‌ encoding, representation and calculation‌ in IEEE (or more‌‌ exotic Nvidia GPU or Google TPU) floating point‌ numbers. This will result‌ in variable accuracy calculations.‌‌ This general framework will also enable us to‌ address soft error detection‌ 62 and study possible‌‌ mitigation schemes to design resilient algorithms.

3.2 Composition‌ of parallel numerical algorithms‌ from a sequential expression‌‌

A major breakthrough for exploiting multicore machine 71‌ is based on a‌ data format and computational‌‌ technique originally used in an out-of-core context 82‌. This is itself‌ a refinement of a‌‌ broader class of numerical algorithms – namely, “updating‌ techniques” – that were‌ not originally developed with‌‌ specific hardware considerations in mind. This historical anecdote‌ perfectly illustrates the need‌ to separate data representation,‌‌ algorithmic and architectural concerns when developing numerical methodologies.‌ In the recent past,‌ we have contributed to‌‌ the study of the sequential task flow (STF)‌ programming paradigm, that enabled‌ us to abstract the‌‌ complexity of the underlying computer architecture 60,‌ 61, 59.‌ In the concace project,‌‌ we intend to go further by abstracting the‌ numerical algorithms and their‌ dedicated data structures. We‌‌ strongly believe that combining these two abstractions will‌ allow us to easily‌ compose toolbox algorithms and‌‌ data representations in order to study combinatorial alternatives‌ towards numerical and parallel‌ computational efficiency. We‌‌ have demonstrated this potential on domain decomposition methods‌ for solving sparse linear‌ systems arising from the‌‌ discretisation of PEDs, that has been implemented in‌ the maphys++ parallel package.‌

Regarding the abstraction of‌‌ the target architecture in the design of numerical‌ algorithms, the STF paradigm‌ has been shown to‌‌ significantly reduce the difficulty of programming these complex‌ machines while ensuring high‌ computational efficiency. However, some‌‌ challenges remain. The first major difficulty is related‌ to the scalability of‌ the model at large‌‌ scale where handling the full task graph associated‌ with the STF model‌ becomes a severe bottleneck.‌‌ Another major difficulty is the inability (at a‌ reasonable runtime cost) to‌ efficiently handle fine-grained dynamic‌‌ parallelism, such as numerical pivoting in the Gaussian‌ elimination where the decision‌ to be made depends‌‌ on the outcome of the current calculation and‌ cannot be known in‌ advance or described in‌‌ a task graph. These two challenges are the‌ ones we intend to‌ study first.

With respect‌‌ to the second ingredient, namely the abstraction of‌ the algorithms and data‌ representation, we will also‌‌ explore whether we can provide additional separation of‌ concerns beyond that offered‌ by a task-based design.‌‌ As a seemingly simple example, we will investigate‌ the possibility of abstracting‌ the matrix-vector product, basic‌‌ kernel at the core of many numerical linear‌ algebra methods, to cover‌ the case of the‌‌ fast multipole method (FMM, at the core of‌ the ScalFMM library). FMM‌ is mathematically a block‌‌ matrix-vector product where some of the operations involving‌ the extra-diagonal blocks with‌ hierachical structure would be‌‌ compressed analytically. Such a‌ methodological step forward will consequently allow the factorisation‌ of a significant part of codes (so far‌ completely independent because no bridge has been made‌ upstream) including in particular the ones dealing with‌ $ℋ -matrices$ . The easy composition of these‌ different algorithms will make it possible to explore‌ the combinatorial nature of the possible options in‌ order to best adapt them to the size‌ of the problem to be treated and the‌ characteristics of the target computer. *Offering such a‌ continuum of numerical methods rather than a discrete‌ set of tools is part of the team's‌ objectives* It is a very demanding effort in‌ terms of HPC software engineering expertise to coordinate‌ the overall technical effort.

We intend to strengthen‌ our engagement in reproducible and open science. Consequently,‌ we will continue our joint effort to ensure‌ consistent deployment of our parallel software; this will‌ contribute to improve its impact on academic and‌ industrial users. The software engineering challenge is related‌ to the increasing number of software dependencies induced‌ by the desired capability of combining the functionality‌ of different numerical building boxes, e.g., a domain‌ decomposition solver (such as maphys++) that requires‌ advanced iterative schemes (such as those provided by‌ fabulous) as well as state-of-the-art direct methods (such‌ as pastix, mumps, or qr_mumps),‌ deploying the resulting software stack can become tedious‌ 64.

In that context, we will consider‌ the benefit of using hybridization between simulation and‌ learning in order to reduce the complexity of‌ classical approaches by diminishing the problem size or‌ improving preconditioning techniques. In a longer term perspective,‌ we will also conduct an active technological watch‌ activity with respect to quantum computing to better‌ understand how such a advanced computing technology can‌ be synergized with classical scientific computing.

4 Application‌ domains

Participants: Emmanuel Agullo, Pierre Benjamin,‌ Théo Briquet, Olivier Coulaud, Antoine Gicquel‌, Luc Giraud, Sofiane Haddad, Esragul‌ Korkmaz, Carola Kruse, Paul Mycek,‌ Gilles Marait, Clément Peaucelle, Guillaume Sylvand‌.

We have a major application domain in‌ acoustic simulations that is provided by Airbus CR‌ & T and a few more through collaborations‌ in the context of ongoing projects, that include:‌ plasma simulation (ESA contract and ANR Maturation), Electric‌ device design (ANR TensorVim) and nanoscale simulation platform‌ (ANR Diwina).

4.1 Aeroacoustics Simulation

This domains is‌ in the context of a long term collaboration‌ with Airbus Research Centers. Wave propagation phenomena intervene‌ in many different aspects of systems design at‌ Airbus. They drive the level of acoustic vibrations‌ that mechanical components have to sustain, a level‌ that one may want to diminish for comfort‌ reason (in the case of aircraft passengers, for‌ instance) or for safety reason (to avoid damage‌ in the case of a payload in a‌ rocket fairing at take-off). Numerical simulations of these‌ phenomena plays a central part in the upstream design phase of any‌ such project 73.‌ Airbus Central R &‌‌ T has developed over the last decades an‌ in-depth knowledge in the‌ field of Boundary Element‌‌ Method (BEM) for the simulation of wave propagation‌ in homogeneous media and‌ in frequency domain. To‌‌ tackle heterogeneous media (such as the jet engine‌ flows, in the case‌ of acoustic simulation), these‌‌ BEM approaches are coupled with volumic finite elements‌ (FEM). We end up‌ with the need to‌‌ solve large (several millions unknowns) linear systems of‌ equations composed of a‌ dense part (coming for‌‌ the BEM domain) and a sparse part (coming‌ from the FEM domain).‌ Various parallel solution techniques‌‌ are available today, mixing tools created by the‌ academic world (such as‌ the Mumps and Pastix‌‌ sparse solvers) as well as parallel software tools‌ developed in-house at Airbus‌ (dense solver SPIDO, multipole‌‌ solver, $ℋ$ -matrix solver with an open sequential‌ version available online). In‌ the current state of‌‌ knowledge and technologies, these methods do not permit‌ to tackle the simulation‌ of aeroacoustics problems at‌‌ the highest acoustic frequencies (between 5 and 20‌ kHz, upper limits of‌ human audition) while considering‌‌ the whole complexity of geometries and phenomena involved‌ (higher acoustic frequency implies‌ smaller mesh sizes that‌‌ lead to larger unknowns number, a number that‌ grows like $f^{2‌}$ for BEM and ${f‌‌}^{3}$ for FEM, where f is the studied‌ frequency). The purpose of‌ the study in this‌‌ domain is to develop advanced solvers able to‌ tackle this kind of‌ mixed dense/sparse linear systems‌‌ efficiently on parallel architectures.

5 Highlights of the‌ year

5.1 Awards

We‌ are delighted to welcome‌‌ a new member to the team. Clément Guillet‌ has joined us as‌ an ISFP for the‌‌ 2025 campaign.
A new release of Composyx has‌ been released to the‌ public.
The Concace Steering‌‌ Committee approved the continuation of the team, subject‌ to the completion of‌ the ongoing Inria scientific‌‌ evaluation process.

6 Latest software developments, platforms, open‌ data

6.1 Latest software‌ developments

6.1.1 composyx

Name:‌‌
Numerical and parallel composability for high performance computing‌
Keywords:
Numerical algorithm, Parallel‌ computing, Linear algebra, Task-based‌‌ algorithm, Dense matrix, Sparse matrix, Hierarchical matrix, FMM,‌ C++
Functional Description:
Composable‌ numerical and parallel linear‌‌ algebra library
URL:
https://gitlab.inria.fr/composyx/composyx
Contact:
Emmanuel Agullo

6.1.2‌ ScalFMM

Name:
Scalable Fast‌ Multipole Method
Keywords:
N-body,‌‌ Fast multipole method, Parallelism, MPI, OpenMP
Scientific Description:‌

ScalFMM is a software‌ library to simulate N-body‌‌ interactions using the Fast Multipole Method. The library‌ offers two methods to‌ compute interactions between bodies‌‌ when the potential decays like 1/r. The first‌ method is the classical‌ FMM based on spherical‌‌ harmonic expansions and the second is the Black-Box‌ method which is an‌ independent kernel formulation (introduced‌‌ by E. Darve @ Stanford). With this method,‌ we can now easily‌ add new non oscillatory‌‌ kernels in our library. For the classical method,‌ two approaches are used‌ to decrease the complexity‌‌ of the operators. We‌ consider either matrix formulation that allows us to‌ use BLAS routines or rotation matrix to speed‌ up the M2L operator.

ScalFMM intends to offer‌ all the functionalities needed to perform large parallel‌ simulations while enabling an easy customization of the‌ simulation components: kernels, particles and cells. It works‌ in parallel in a shared/distributed memory model using‌ OpenMP and MPI. The software architecture has been‌ designed with two major objectives: being easy to‌ maintain and easy to understand. There is two‌ main parts: the management of the octree and‌ the parallelization of the method the kernels. This‌ new architecture allow us to easily add new‌ FMM algorithm or kernels and new paradigm of‌ parallelization.

The version 3.0 of the library is‌ a partial rewriting of the version 2.0 in‌ modern C++ ( C++17) to increase the genericity‌ of the approach. This version is also the‌ basic framework for studying numerical and parallel composability‌ within Concace.
Functional Description:
Compute N-body interactions using‌ the Fast Multipole Method for large number of‌ objects
Release Contributions:
ScalFmm is a high performance‌ library for solving n-body problems in astrophysics and‌ electrostatics. It is based on the fast nultipole‌ method (FMM) and is highly parallel
News of‌ the Year:
Performance improvements in version 3.0. For‌ the moment, this version only considers the interpolation‌ approach. New features - the target particles can‌ be different from the source particles - possibility‌ to consider a non-mutual approach in the direct‌ field - the low rank approximation of the‌ transfer operator is taken into account.
URL:
https://gitlab.inria.fr/solverstack/ScalFMM‌
Publications:
hal-00911856, hal-01359458, hal-00746089, hal-01517153‌, hal-01474556, hal-01517153
Contact:
Olivier Coulaud
Participants:‌
Olivier Coulaud, Pierre Estérie

6.1.3 CPPDiodon

Name:
Parallel‌ C++ library for Multivariate Data Analysis of large‌ datasets.
Keywords:
SVD, PCA, Classification
Scientific Description:
Diodon‌ provides executables and functions to compute multivariate data‌ Analysis such as: Singular Value Decomposition (SVD), Principal‌ Component Analysis (PCA) and variants (with different pre-treatments),‌ Multidimensional Scaling (MDS), Correspondence Analysis (CoA), Canonical Correlation‌ Analysis (CCA, future work), Multiple Correspondence Analysis (MCoA,‌ future work). All these methods rely on a‌ Singular Value Decomposition (SVD) of a 2D matrix.‌ For small size matrices the SVD can be‌ directly computed using a sequential or multi-threaded LAPACK‌ solver such as OpenBlas or Intel MKL. For‌ large matrices the SVD becomes time consuming and‌ we use a Randomized Singular Value Decomposition method‌ (rSVD) instead of the exact SVD which implementation‌ is given by the FMR library. FMR can‌ perform computations of the rSVD on parallel shared‌ and distributed memory machines using adequate parallel dense‌ linear algebra routines internally such as OpenBlas or‌ Intel MKL on a shared memory node and‌ Chameleon for distributed memory nodes (MPI).
Functional Description:‌
Dimension reduction by multivariate data analysis. Diodon is‌ a list of functions and drivers that implement‌ in C++ and Python (i) pre-processing, SVD and‌ post-processing with a wide variety of methods, (ii) random projection methods for‌ SVD execution which allows‌ to circumvent the time‌‌ limitation in the calculation of the SVD, and‌ (iii) a C++ implementation‌ of the SVD with‌‌ random projection to an imposed range or precision,‌ connected to the MDS,‌ PCA, CoA.
Release Contributions:‌‌
Initial release of cppdiodon : a parallel C++‌ library for Multivariate Data‌ Analysis of large datasets.‌‌ Contains methods to compute Singular Value Decomposition (SVD),‌ Randomized SVD, Principal Component‌ Analysis (PCA), Multidimensional Scaling‌‌ (MDS) and Correspondence Analysis (CoA). Handles text and‌ hdf5 files. Parallel (mpi,‌ threads, cuda) randomized SVD‌‌ and EVD (for symmetric matrices) provided by FMR.‌ Use multithreaded Lapack or‌ Chameleon (distributed systems +‌‌ GPUs).
URL:
https://gitlab.inria.fr/diodon/cppdiodon
Publications:
tel-04468314, hal-03773985
Contact:‌
Florent Pruvost
Partner:
INRAE‌

6.1.4 FMR

Name:
Fast‌‌ Methods for Randomized numerical linear algebra
Keyword:
SVD‌
Scientific Description:
Fast Dense‌ Standard and Randomized Numerical‌‌ Linear Algebra is a library that allows to‌ compute singular values or‌ eigenvalues of large dense‌‌ matrices by random linear algebra techniques. It is‌ based on the random‌ projection method (Gaussian or‌‌ fast Hadamard/Fourier) or row/column selection (Nystrom method and‌ variants). The library is‌ developed in C++ and‌‌ proposes a shared memory parallelization and a distributed‌ approach with Chameleon (https://gitlab.inria.fr/solverstack/chameleon).‌
Functional Description:
Fast Dense‌‌ Standard and Randomized Numerical Linear Algebra is a‌ library that allows to‌ compute singular values or‌‌ eigenvalues of large dense matrices by random linear‌ algebra techniques. It is‌ based on the random‌‌ projection method (Gaussian or fast Hadamard/Fourier) or row/column‌ selection (Nystrom method and‌ variants). The library is‌‌ developed in C++ and proposes a shared memory‌ parallelization and a distributed‌ approach with Chameleon (https://gitlab.inria.fr/solverstack/chameleon).‌‌
URL:
https://gitlab.inria.fr/compose/legacystack/fmr
Publications:
hal-01228519, hal-01334747, hal-03773985‌
Contact:
Olivier Coulaud
Participants:‌
Olivier Coulaud, Florent Pruvost‌‌

7 New results

Participants: All team members.‌

7.1 Error Estimates for‌ Sparse Tensor Products of‌‌ B-spline Approximation Spaces

This work introduces and analyzes‌ B-spline approximation spaces defined‌ on general geometric domains‌‌ obtained through a mapping from a parameter domain.‌ These spaces are constructed‌ as sparse-grid tensor products‌‌ of univariate spaces in the parameter domain and‌ are mapped to the‌ physical domain via a‌‌ geometric parametrization. Both the univariate approximation spaces and‌ the geometric mapping are‌ built using maximally smooth‌‌ B-splines. We construct two such spaces, employing either‌ the sparse-grid combination technique‌ or the hierarchical subspace‌‌ decomposition of sparse-grid tensor products, and we prove‌ their mathematical equivalence. Furthermore,‌ we derive approximation error‌‌ estimates and inverse inequalities that highlight the advantages‌ of sparse-grid tensor products.‌ Specifically, under suitable regularity‌‌ assumptions on the solution, these spaces achieve the‌ same approximation order as‌ standard tensor product spaces‌‌ while using significantly fewer degrees of freedom. Additionally,‌ our estimates indicate that,‌ in the case of‌‌ non-tensor-product domains, stronger regularity assumptions on the solution-particularly‌ concerning isotropic (non-mixed) derivatives-are‌ required to achieve optimal‌‌ convergence rates compared to sparse-grid methods defined on‌ tensor-product domains.

For more‌ details on this work‌‌ we refer to 48‌.

7.2 On some orthogonalization schemes in Tensor‌ Train format

In the framework of tensor spaces,‌ we consider orthogonalization algorithms to generate an orthogonal‌ basis of a tensor subspace from a set‌ of linearly independent tensors. All variants, except for‌ the Householder transformation, are straightforward extensions of well-known‌ algorithms in matrix computation to tensors. In particular,‌ we experimentally study the loss of orthogonality of‌ six orthogonalization methods: Classical and Modified Gram-Schmidt with‌ (CGS2, MGS2) and without (CGS, MGS) re-orthogonalization, the‌ Cholesky-QR, and the Householder transformation. To overcome the‌ curse of dimensionality, we represent tensors with a‌ low-rank approximation using the Tensor Train (TT) formalism.‌ Additionally, we introduce recompression steps in the standard‌ algorithm outline through the TT-round method at a‌ prescribed accuracy. After describing the structure and properties‌ of the algorithms, we illustrate their loss of‌ orthogonality with numerical experiments. Although no formal proof‌ exists at this time, we observe very clearly‌ that the well-established properties verified over decades of‌ research by the round error analysis community in‌ matrix computation appear to extend to the case‌ of low-rank tensors, with the unit round-off replaced‌ by the TT-round accuracy. The computational analysis for‌ each orthogonalization scheme in terms of memory requirements‌ and computational complexity, measured as a function of‌ the number of TT-round operations, which happens to‌ be the most computationally expensive operation, completes the‌ study.

For more details on this work we‌ refer to 18.

7.3 A note on‌ TT-GMRES for the solution of parametric linear systems‌

We study the solution of linear systems with‌ tensor product structure using the Generalized Minimal RESidual‌ (GMRES) algorithm. To manage the computational complexity of‌ high dimensional problems our approach relies on low-rank‌ tensor representation, focusing specifically on the Tensor Train‌ format. We implement and experimentally study the TT-GMRES‌ algorithm. Our analysis bridges the heuristic methods proposed‌ for TT-GMRES by Dolgov in [Russian J. Numer.‌ Anal. Math. Modelling, 28 (2013), pp. 149–172] and‌ the theoretical framework of inexact GMRES by Simoncine‌ and Szyld [SIAM J. Sci. Comput. 25 (2003),‌ pp. 454–477]. This approach is particularly relevant in‌ a scenario where a $(d + 1‌)$ -dimensional problem arises from concatenating a sequence‌ of $d$ -dimensional problems, as in the case‌ of a parametric linear operator or parametric right-hand-side‌ formulation. Thus, we provide backward error bounds that‌ link the accuracy of the computed $(d‌ + 1)$ -dimensional solution to the numerical‌ quality of the extracted $d$ -dimensional solutions. This‌ facilitates the prescription of a convergence threshold ensuring‌ that the $d$ -dimensional solutions extracted from the‌ $(d + 1)$ -dimensional result have‌ the desired accuracy once the solver converges. We‌ illustrate these results with academic examples across varying‌ dimensions and sizes. Our experiments indicate that the‌ TT-GMRES retains the theoretical rounding error properties observed‌ in matrix-based GMRES.

For more details on this‌ work we refer to 17.

7.4 Solving eigenvalue problems in high‌ dimensions using contour integration‌ and Tensor Train format‌‌

In high-dimensional settings, solving eigenvalue problems is hindered‌ by the curse of‌ dimensionality, particularly when only‌‌ a subset of eigenpairs within a prescribed spectral‌ interval is sought. In‌ this work, we investigate‌‌ an adaptation of the FEAST algorithm, originally developed‌ for symmetric eigenproblems based‌ on contour integration, to‌‌ computations where both operators and vectors are represented‌ in the Tensor Train‌ (TT) format. This representation‌‌ drastically reduces memory and computational demands. We introduce‌ an adaptive scheme for‌ determining the projection subspace‌‌ dimension by incorporating a rank-revealing Modified Gram–Schmidt procedure‌ with pivoting tailored to‌ TT-vectors. A perturbation-based analysis‌‌ provides explicit bounds on the attainable residual accuracy,‌ from which we derive‌ a robust stopping criterion‌‌ for the proposed TT-FEAST algorithm. Moreover, we design‌ a continuation strategy that‌ gradually refines convergence and‌‌ rounding tolerances to effectively control memory growth during‌ iterations. To demonstrate the‌ effectiveness of TT-FEAST as‌‌ a viable alternative to existing high-dimensional eigensolvers when‌ a few eigenvalues are‌ required, we present numerical‌‌ experiments on problems up to twelve dimensions, including‌ the Laplacian and a‌ vibrational Hamiltonian operator.

For‌‌ more details on this work we refer to‌ 44.

7.5 A‌ Tensor Train solver for‌‌ the Magnetic Moment Method

In this work, we‌ focus on enhancing the‌ computational efficiency of the‌‌ Magnetic Moment Method (MMM) using low-rank tensor representations,‌ specifically the Tensor Train‌ (TT) formats. By transforming‌‌ the dense linear system generated by MMM into‌ TT format, we achieve‌ significant reductions in both‌‌ computational cost and memory usage. Furthermore, when the‌ problem structure allows—such as‌ in cases with regular‌‌ grids and binary-compatible sizes—the Quantized Tensor Train (QTT)‌ format is employed to‌ exploit additional compression through‌‌ quantization, leading to even larger performance gains. The‌ proposed methods are tested‌ on several problems involving‌‌ a ferromagnetic part with a regular mesh. For‌ simple cases, our approach‌ demonstrates superior performance compared‌‌ to traditional techniques. As the problem complexity increases,‌ the TT- and QTT-based‌ methods remaincompetitive, maintaining efficiency‌‌ while addressing the added computational challenges.

This work‌ is part of Amine‌ Zekri's PhD thesis and‌‌ is carried out in the context of the‌ TensoVim ANR project. For‌ more details on this‌‌ work we refer to 25.

7.6 Generalized‌ Golub–Kahan Bidiagonalization for Nonsymmetric‌ Saddle-Point Systems

The generalized‌‌ Golub–Kahan bidiagonalization has been used to solve saddle-point‌ systems where the leading‌ block is symmetric and‌‌ positive definite. We extend this iterative method for‌ the case where the‌ symmetry condition no longer‌‌ holds. We do so by relying on the‌ known connection the algorithm‌ has with the conjugate‌‌ gradient method and following the line of reasoning‌ that adapts the latter‌ into the full orthogonalization‌‌ method. We propose appropriate stopping criteria based on‌ the residual and an‌ estimate of the energy‌‌ norm for the error associated with the primal‌ variable. Numerical comparison with‌ GMRES highlights the advantages‌‌ of our proposed strategy‌ regarding its low memory requirements and the associated‌ implications.

For more details on this work we‌ refer to 20.

7.7 A note on‌ the partial convergence management for the solution of‌ symmetric linear systems with multiple right-hand sides

We‌ consider the solution of large sparse symmetric linear‌ systems with multiple right-hand sides available simultaneously. Based‌ on the partial convergence detection and management, described‌ in IB-BGMRES [Linear Algebra Appl., 419 (2006), pp.‌ 265-285] and the breakdown-free idea discussed in [BIT‌ Numer. Math., 57 (2017), pp. 379-403], the block‌ conjugate residual and block conjugate gradient methods with‌ partial convergence management are proposed. It enable to‌ select the directions to use for extending the‌ search space from one iteration to the next‌ by choosing the directions that contribute the most‌ to the residual norms. We illustrate the numerical‌ and computational benefits of these two novel block‌ conjugate direction variants on a set of simple‌ academic examples enabling reproducible experiments.

For more details‌ on this work we refer to 47.‌

7.8 A Scalable and Parameter-Robust Preconditioner for a‌ Second Gradient of Dilation Regularization Applied to a‌ Mechanics Problem

We propose a rigorous analysis of‌ the second gradient of dilation regularization as it‌ is used in many geomechanical applications. In this‌ method, a new primal unknown, modeling the trace‌ of the displacement gradient, is introduced leading to‌ a saddle-point system. The resulting balance equations depend‌ on parameters that range on several orders of‌ magnitude. We prove the well-posedness of the continuous‌ and discrete problems using parameter dependent norms. This‌ allows the definition of a robust block preconditioner‌ for the discrete problem with respect to the‌ parameter variations and to mesh refinement. Numerical results‌ confirm our theoretical findings.

For more details on‌ this work we refer to 49.

7.9‌ Neural network preconditioning: a case study for the‌ solution of the parametric Helmholtz equation

This work‌ presents a hybrid numerical approach for solving linear‌ systems arising from the discretization of the two-dimensional‌ parametric Helmholtz equation. A convolutional neural network based‌ on the U-Net architecture is trained in an‌ unsupervised manner to approximate the inverse of the‌ discretized Helmholtz operator, using a loss function involving‌ the residual norm of the linear system. The‌ trained network is used as a nonlinear preconditioner‌ within the Flexible GMRES (FGMRES) algorithm. Numerical experiments‌ show that while the neural network is not‌ accurate enough to act as a standalone solver,‌ it significantly improves the convergence of FGMRES when‌ employed as a preconditioner. The neural preconditioner demonstrates‌ robust performance and generalization capabilities with respect to‌ variations in the velocity field and the domain‌ size. Comparisons with classical algebraic preconditioners based on‌ sparsified LU factorizations indicate superior efficiency of the‌ neural approach under equivalent conditions. We believe that‌ the proposed method is not tied to a‌ specific neural architecture and can be extended to‌ other parametric PDEs.

For more details on this work we refer to‌ 46.

7.10 Memory-and‌ compute-optimized geometric multigrid GMGPolar‌‌ for curvilinear coordinate representations -Applications to fusion plasma‌

Tokamak fusion reactors are‌ actively studied as a‌‌ means of realizing energy production from plasma fusion.‌ However, due to the‌ substantial cost and time‌‌ required to construct fusion reactors and run physical‌ experiments, numerical experiments are‌ indispensable for understanding plasma‌‌ physics inside tokamaks, supporting the design and engineering‌ phase, and optimizing future‌ reactor designs. Geometric multigrid‌‌ methods are optimal solvers for many problems that‌ arise from the discretization‌ of partial differential equations.‌‌ It has been shown that the multigrid solver‌ GMGPolar solves the 2D‌ gyrokinetic Poisson equation in‌‌ linear complexity and with only small memory requirements‌ compared to other state-of-the-art‌ solvers. In this paper,‌‌ we present a completely refactored and object-oriented version‌ of GMGPolar which offers‌ two different matrix-free implementations.‌‌ Among other things, we leverage the Sherman-Morrison formula‌ to solve cyclic tridiagonal‌ systems from circular line‌‌ solvers without additional fill-in and we apply reordering‌ to optimize cache access‌ of circular and radial‌‌ smoothing operations. With the Give approach, memory requirements‌ are further reduced and‌ speedups of four to‌‌ seven are obtained for usual test cases. For‌ the Take approach, speedups‌ of 16 to 18‌‌ can be attained.

For more details on this‌ work we refer to‌ 50.

7.11 Complexity‌‌ analysis and scalability of a matrix-free extrapolated geometric‌ multigrid solver for curvilinear‌ coordinates representations from fusion‌‌ plasma applications

Tokamak fusion reactors are promising alternatives‌ for future energy production.‌ Gyrokinetic simulations are important‌‌ tools to understand physical processes inside tokamaks and‌ to improve the design‌ of future plants. In‌‌ gyrokinetic codes such as Gysela, these simulations involve‌ at each time step‌ the solution of a‌‌ gyrokinetic Poisson equation defined on disk-like cross sections.‌ The authors of [14,15]‌ proposed to discretize a‌‌ simplified differential equation using symmetric finite differences derived‌ from the resulting energy‌ functional and to use‌‌ an implicitly extrapolated geometric multigrid scheme tailored to‌ problems in curvilinear coordinates.‌ In this article, we‌‌ extend the discretization to a more realistic partial‌ differential equation and demonstrate‌ the optimal linear complexity‌‌ of the proposed solver, in terms of computation‌ and memory. We provide‌ a general framework to‌‌ analyze floating point operations and memory usage of‌ matrix-free approaches for stencil-based‌ operators. Finally, we give‌‌ an efficient matrix-free implementation for the considered solver‌ exploiting a task-based multithreaded‌ parallelism which takes advantage‌‌ of the disk-shaped geometry of the problem. We‌ demonstrate the parallel efficiency‌ for the solution of‌‌ problems of size up to 50 million unknowns.‌

For more details on‌ this work we refer‌‌ to 23.

7.12 Fault-tolerant numerical iterative algorithms‌ at scale

This work‌ investigates how to protect‌‌ numerical iterative algorithms from all types of errors‌ that can strike at‌ scale: fail-stop errors (a.k.a.‌‌ failures) and silent errors, striking both as computation‌ errors and memory bit-flips.‌ We combine various techniques:‌‌ detectors for computation errors,‌ checksums for memory errors, and checkpoint/restart for failures.‌ The objective is to minimize the expected time‌ per iteration of the algorithm. We design a‌ hierarchical pattern that combines and interleaves all these‌ fault-tolerance mechanisms, and we determine the optimal periodic‌ pattern that achieves this objective. We instantiate these‌ results for the performance analysis of the Preconditioned‌ Conjugate Gradient (PCG) algorithm: we report several scenarios‌ where the optimal pattern dramatically decreases the overhead‌ due to error mitigation.

For more details on‌ this work we refer to 24.

7.13‌ A filtered multilevel Monte Carlo method for estimating‌ the expectation of cell-centered discretized random fields

We‌ investigate the use of multilevel Monte Carlo (MLMC)‌ methods for estimating the expectation of discretized random‌ fields. Specifically, we consider a setting in which‌ the input and output vectors of numerical simulators‌ have inconsistent dimensions across the multilevel hierarchy. This‌ requires the introduction of grid transfer operators borrowed‌ from multigrid methods. By adapting mathematical tools from‌ multigrid methods, we perform a theoretical spectral analysis‌ of the MLMC estimator of the expectation of‌ discretized random fields, in the specific case of‌ linear, symmetric and circulant simulators. We then propose‌ filtered MLMC (F-MLMC) estimators based on a filtering‌ mechanism similar to the smoothing process of multigrid‌ methods, and we show that the filtering operators‌ improve the estimation of both the small- and‌ large-scale components of the variance, resulting in a‌ reduction of the total variance of the estimator.‌ Next, the conclusions of the spectral analysis are‌ experimentally verified with a one-dimensional illustration. Finally, the‌ proposed F-MLMC estimator is applied to the problem‌ of estimating the discretized variance field of a‌ diffusion-based covariance operator, which amounts to estimating the‌ expectation of a discretized random field. The numerical‌ experiments support the conclusions of the theoretical analysis‌ even with non-linear simulators, and demonstrate the improvements‌ brought by the F-MLMC estimator compared to both‌ a crude MC and an unfiltered MLMC estimator.‌

For more details on this work we refer‌ to 15.

7.14 Multilevel Monte Carlo methods‌ for ensemble variational data assimilation

Ensemble variational data‌ assimilation relies on ensembles of forecasts to estimate‌ the background error covariance matrix B. The ensemble‌ can be provided by an Ensemble of Data‌ Assimilations (EDA), which runs independent perturbed data assimilation‌ and forecast steps. The accuracy of the ensemble‌ estimator of B is strongly limited by the‌ small ensemble size that is needed to keep‌ the EDA computationally affordable. We investigate here the‌ potential of the multilevel Monte Carlo (MLMC) method,‌ a type of multifidelity Monte Carlo method, to‌ improve the accuracy of the standard Monte-Carlo estimator‌ of B while keeping the computational cost of‌ ensemble generation comparable. MLMC exploits the availability of‌ a range of discretization grids, thus shifting part‌ of the computational work from the original assimilation‌ grid to coarser ones. MLMC differs from the‌ mere averaging of statistical estimators, as it ensures that no bias from‌ the coarse resolution grids‌ is introduced in the‌‌ estimation. The implications for ensemble variational data assimilation‌ systems based on EDAs‌ are discussed. Numerical experiments‌‌ with a quasi-geostrophic model demonstrate the potential of‌ the approach, as MLMC‌ yields more accurate background‌‌ error covariances and reduced analysis error. The challenges‌ involved in cycling a‌ multilevel variational data assimilation‌‌ system are identified and discussed.

For more details‌ on this work we‌ refer to 19.‌‌

7.15 Convergence analysis of overlapping domain decomposition preconditioners‌ for nonlinear problems

Numerical‌ simulations of nonlinear partial‌‌ differential equations often involve solving large nonlinear systems,‌ for which Newton's method‌ is widely employed due‌‌ to its fast convergence near the solution. However,‌ its performance can deteriorate‌ in the presence of‌‌ strong nonlinearities or poor initial guesses. Nonlinear overlapping‌ domain decomposition methods, such‌ as RASPEN and Substructured‌‌ RASPEN (SRASPEN), have proven effective in addressing these‌ challenges. Because SRASPEN reduces‌ the problem size by‌‌ restricting computations to a substructure, it does not‌ update the solution outside‌ the substructure, so that‌‌ no natural initial guesses for the nonlinear local‌ solution exists that might‌ lead to additional inner‌‌ subdomain nonlinear iterations or even prevent the local‌ solvers to converge. In‌ this study, we analyze‌‌ the convergence of RASPEN. We show how domain‌ decomposition improves the convergence‌ rate of the Newton's‌‌ method by highlighting the key role of the‌ substructure on the global‌ error contraction. Moreover, our‌‌ analysis provides insight into an inexpensive modification to‌ SRASPEN that mitigates the‌ lack of iterations outside‌‌ the substructure. The proposed variant significantly reduces computational‌ cost while improving overall‌ efficiency compared to existing‌‌ techniques in the literature. Numerical experiments confirm the‌ computational performance and robustness‌ of the improved SRASPEN,‌‌ establishing it as a reliable approach for solving‌ large-scale nonlinear systems.

This‌ work is part of‌‌ Ettaouchi El Mehdi's PhD thesis and is carried‌ out in collaboration with‌ Nicolas Tardieu (EDF). For‌‌ more details on this work we refer to‌ 21

7.16 Robustness and‌ reliability of state-space, frame-based‌‌ modeling for thermoacoustics

The Galerkin modal expansion is‌ a well-known method used‌ to develop reduced order‌‌ models for thermoacoustics. A known issue is the‌ appearance of Gibbs-type oscillations‌ on velocity fluctuations at‌‌ the interface between subdomains and at boundary conditions.‌ Recent work of Laurent‌ et al. (2019) and‌‌ Laurent et al. (2021) have shown that it‌ is possible to overcome‌ this issue by using‌‌ an over-completed frame, instead of a Galerkin modal‌ basis. However, the low-order‌ modeling based on this‌‌ frame modal expansion may generate spurious modes. In‌ this paper, the origin‌ of these non-physical modes‌‌ is identified and a method is proposed to‌ automatically remove them from‌ the outcome. By preventing‌‌ any interaction between the physical and non-physical components,‌ the proposed methodology drastically‌ improves the robustness and‌‌ reliability of the frame modal expansion modeling for‌ thermoacoustics.

For more details‌ on this work we‌‌ refer to 16.‌

7.17 Juxtaposing the fourth order vibrational operator perturbation‌ theory CVPT(4) and the adaptive VCI (A-VCI): Accuracy,‌ vibrational resonances and polyads of C2H4 and C2D4‌

Ab initio prediction of anharmonic vibrational spectra produces‌ an increasing computational overhead for larger molecules, requesting‌ a balance between an accuracy and resources. Two‌ complementary fundamental quantum mechanical approaches, the perturbative and‌ variational, have various strong and weak features, depending‌ on a specific target problem. The vibrational perturbation‌ theory (VPT) treats weak couplings and strong resonances‌ separately, relying on somewhat artificial criteria. In contrast,‌ the more precise but computationally intense variational configuration‌ interaction (VCI) method treats all couplings in universal‌ manner. The active ongoing development of approaches to‌ solving vibrational problems requires an update of comparative‌ benchmarks, helping to choose the best theoretical tools‌ for a particular target. In this work, the‌ performance of two particular modern implementations of these‌ methods was juxtaposed: the second and fourth order‌ operator canonical perturbation theory CVPT(2,4) and a recently‌ proposed adaptive vibrational configuration interaction method (A-VCI). Two‌ practically important C2H4 and C2D4 molecules and an‌ accurate CCSD(T)/cc-pVQZ four-body sextic normal mode PES were‌ employed for benchmarking. The comprehensive picture of vibrational‌ resonances and the polyad quantum number was revealed.‌ A new quadratic resonance criterion is proposed and‌ its efficiency in elucidating polyad structures is demonstrated.‌ A striking observation was made that CVPT(2) often‌ produces better predictions of fundamental frequencies, while CVPT(4)‌ demonstrates an excellent level of correlation with A-VCI‌ results for both fundamentals and two-quanta states.

For‌ more details on this work we refer to‌ 22

7.18 Approximation Algorithms for Scheduling with/without Deadline‌ Constraints where Rejection Costs are Proportional to Processing‌ Times

In this work, we address two offline‌ job scheduling problems, where jobs can either be‌ processed on a limited supply of energy-efficient machines‌ on the edge, or offloaded to an unlimited‌ supply of energy-inefficient machines on the cloud (called‌ rejected in our context). The goal is to‌ minimize the total energy consumed in processing all‌ tasks. We consider a first scheduling problem with‌ no due date (or deadline) constraints, and we‌ formulate it as a scheduling problem with rejection,‌ where the cost of rejecting a job is‌ directly proportional to its processing time. We introduce‌ a novel 5/4(1+ε) approximation algorithm BEKP by associating‌ it with a Multiple Subset Sum problem for‌ this version. Our algorithm is an improvement over‌ the existing literature, which provides a (3/2 -1/2m)‌ approximation for scenarios with arbitrary rejection costs. In‌ the second scheduling problem, jobs have due date‌ (or deadline) constraints, and the goal is to‌ minimize the weighted number of late jobs. In‌ our context, if a job is late, it‌ is offloaded (rejected) to an energy-inefficient machine on‌ the cloud, which incurs a cost directly proportional‌ to its processing time of the job. We‌ position this problem in the literature, and introduce‌ a novel $(1 - {(m - 1)}^{m} /‌ m^{m})$ -approximation‌ algorithm MDP for this‌‌ version, where we got our inspiration from an‌ algorithm for the interval‌ selection problem with a‌‌ $(1 - {m}^{m} / {(m‌ + 1)}^{m‌})$ approximation ratio for‌‌ arbitrary rejection costs. We evaluate and discuss the‌ effectiveness of our approaches‌ through a series of‌‌ experiments, comparing them to existing algorithms.

For more‌ details on this work‌ we refer to 14‌‌

7.19 Guix-HPC Activity Report 2023-2024

Guix-HPC is a‌ collaborative effort to bring‌ reproducible software deployment to‌‌ scientific workflows and high-performance computing (HPC). Guix-HPC builds‌ upon the GNU Guix‌ software deployment tools and‌‌ aims to make them useful for HPC practitioners‌ and scientists concerned with‌ dependency graph control and‌‌ customization and, uniquely, reproducible research. This report—our seventh‌ report!—highlights key achievements of‌ Guix-HPC between our previous‌‌ report a year ago and today, February 2025.‌ This year was marked‌ by exciting developments for‌‌ HPC and reproducible workflows. Significant advances were made‌ in integrating Guix into‌ the complex software landscape‌‌ of HPC, taking the roles of software manager,‌ workflow execution engine, backend‌ for generating container images,‌‌ or provider for the complete operating system layer.‌ Support for reproducing computations‌ from the past was‌‌ also much improved. And, as usual, we have‌ been using Guix for‌ research, and teaching other‌‌ researchers how to get started.

For more details‌ on this work we‌ refer to 42.‌‌

8 Partnerships and cooperations

Participants: All permanent members‌.

8.1 European initiatives‌

8.1.1 H2020 projects

EoCoE-3‌‌

Title:
Energy oriented Centre of Excellence for computer‌ applications
Duration:
2024-2026
Coordinator:‌
CEA
Inria coordinator:
Bruno‌‌ Raffin
Concace contact:
Emmanuel Agullo
Partners:
- Agenzia Nazionale‌ per le Nuove Tecnologie,‌ l'Energia e lo Sviluppo‌‌ Economico Sostenibile (Italy)
- Barcelona Supercomputing Center - Centro‌ Nacional de Supercomputacion (Spain)‌
- Centre Europeen de Recherche‌‌ et de Formation Avancee en Calcul Scientifique (France)‌
- Centre National de la‌ Recherche Scientifique CNRS (France)‌‌
- Commissariat a l'Energie Atomique et aux Energies Alternatives‌ (France)
- Consiglio Nazionale delle‌ Ricerche (Italy)
- Forschungszentrum Julich‌‌ GmbH (Germany)
- Fraunhofer Gesellschaft zur Foerderung der Angewandten‌ Forschung E.V. (Germany)
- Inria‌
- Max-Planck-Gesellschaft zur Forderung der‌‌ Wissenschaften EV (Germany)
- Rheinisch-Westfaelische Technische Hochschule Aachen (Germany)‌
- Universita degli Studi di‌ Roma Torvergata (Italy)
- Universita‌‌ degli Studi di Trento (Italy)
- Universite Libre de‌ Bruxelles (Belgium)
- Universite Paris-Sud‌ (France)
Inria contact:
Bruno‌‌ Raffin (Datamove)
Summary:
The Concace team (Inria, Cerfacs)‌ participates in the Energy-oriented‌ Centre of Excellence (EoCoE-III),‌‌ starting in January 2024. The project applies cutting-edge‌ exascale computational methods in‌ its mission to accelerate‌‌ the transition to the production, storage and management‌ of clean, decarbonized energy.‌ EoCoE-III is anchored in‌‌ the High Performance Computing (HPC) community and targets‌ research institutes and key‌ commercial players who develop‌‌ and enable energy-relevant numerical models to be run‌ on exascale supercomputers, demonstrating‌ their benefits for the‌‌ net-zero energy transition. The project will draw on‌ the experience of two‌ successful previous projects EoCoE-I‌‌ and -II, where a‌ large set of diverse computer applications from four‌ such energy domains achieved significant efficiency gains thanks‌ to a multidisciplinary expertise in applied mathematics and‌ supercomputing. EoCoE-III channels its efforts into 5 exascale‌ lighthouse applications in the low-carbon sectors of Energy‌ Materials, Water, Wind and Fusion. This multidisciplinary effort‌ will harness innovations in computer science and mathematical‌ algorithms within a tightly integrated co-design approach to‌ overcome performance bottlenecks and to anticipate HPC hardware‌ developments. A world-class consortium of 16 complementary partners‌ forms a unique network of expertise in energy‌ science, scientific computing and HPC, including 3 leading‌ European supercomputing centres.

8.2 National initiatives

MAMBO

Duration:‌
2018 – 2026
Concace contact:
Guillaume Sylvand
Funding:‌
DGAC
Partners:
- CEA
- Inria
- CNRS
Summary:
MAMBO ("Méthodes‌ Avancées pour la Modélisation du Bruit moteur et‌ aviOn") is a project funded by the DGAC,‌ bringing together 21 academic and industrial partners from‌ France and Europe, including Airbus, Cerfacs, and Inria.‌ For Inria, the key challenge of this project‌ lies in addressing new research problems and developing‌ innovative numerical methods by enhancing the capabilities of‌ its software, particularly in the field of high-performance‌ computing.

SANTANA

Duration:
2024 - 2027
Concace contact:‌
Carola Kruse
Funding:
DGAC
Partners:
- Airbus
- Cerfacs
- DLR‌
- ONERA
Summary:
Santana is a project funded by‌ the DGAC that focuses on the development and‌ enhancement of the CODA solver, which is mainly‌ used for the computation of aerodynamic effects. Within‌ this CFD software, the Newton-Krylov solver has been‌ identified as the most computationally expensive component. The‌ role of Cerfacs/Concace is to optimize the linear‌ solution step in the Newton-Krylov solver, with the‌ goal of achieving substantial reductions in computation time.‌

PEPR Numpex

Duration:
2023 – 2028
Concace contact:‌
Emmanuel Agullo, Luc Giraud
Funding:
ANR
Partners:
- CEA‌
- Inria
- CNRS
Summary:
NumPEx is a French program‌ dedicated to Exascale: High-performance computing (HPC), high-performance data‌ analytics (HPDA), and Artificial Intelligence (AI) pose significant‌ challenges across scientific, societal, economic, and ethical realms.‌ These technologies, including modeling and data analysis, are‌ crucial decision support tools addressing societal issues and‌ competitiveness in French research and development. Digital resources,‌ essential across science and industry, demand high-performance hardware.‌ HPC enables advanced modeling, while HPDA handles heterogeneous‌ and massive data. The solution to exploding demand‌ is the upcoming “exascale” computers, a new generation‌ with extraordinary capabilities.

In this context, the French‌ Exascale program NumPEx aims at designing and developing‌ the software components that will equip future exascale‌ machines. NumPEx will deliver Exascale-grade numerical methods, softwares,‌ and training, allowing France to remain one of‌ the leaders in the field. It will contribute‌ to take bridging the gap between cutting-edge software‌ development and application domains to prepare the major‌ scientific and industrial application codes to fully exploit‌ the capabilities of these machines. Application domains of‌ the NumPEx program include, but are not limited‌ to, weather forecasting and climate, aeronautics, automotive, astrophysics,‌ high energy physics, material science, energy production and management, biology and health.‌

Numpex is organized in‌ 7 scientific pillar projects,‌‌ we are directly involved in two of them‌ namely:
- Exa-MA : Methods‌ and Algorithms for Exascale;‌‌
- Exa-SofT : HPC softwares and tools.

TensorVIM

Duration:‌
2023 – 2026
Coordinator:‌
LAPLACE
Concace contact: Olivier‌‌ Coulaud
Funding:
ANR
Partners:
- Inria
- LAPLACE
- G2ELaB
Summary:‌
The aim of this‌ project is to develop‌‌ high-performance computational tools for the rapid implementation of‌ low-frequency electromagnetic simulations for‌ electrical applications. We consider‌‌ an approach based on volume integral methods using‌ low-rank approximations. Instead of‌ using classical compression techniques‌‌ such as the fast multipole method or the‌ hierarchical matrix approach, we‌ propose to investigate the‌‌ use of low-rank tensors to accelerate the computation‌ of the solution of‌ the linear system. The‌‌ tools developed will be used for the modeling‌ of various devices (PCB‌ modeling, Electrical Machines) with‌‌ the main goal of improving their energy performance.‌

Maturation

Title:
MAssively parallel‌ sparse grid PIC algorithms‌‌ for low TemperatURe plAsmas SimulaTIONs
Duration:
2023 –‌ 2026
Coordinator:
Laurent Garrigues‌ (Laplace)
Concace contact: Luc‌‌ Giraud
Funding:
ANR
Partners:
- Laplace Lab
- IMT
- Inria‌
Summary:

The simulation under‌ real conditions of partially‌‌ magnetized low temperature plasmas by Lagrangian approaches, though‌ using powerful Particle-In-Cell (PIC)‌ techniques supplemented with efficient‌‌ high-performance computing methods, requires considerable computing resources for‌ large plasma densities. This‌ is explained by two‌‌ main limitations. First, stability conditions that constrain the‌ numerical parameters to resolve‌ the small space and‌‌ time scales. These numerical parameters are the mesh‌ size of the grid‌ used to compute the‌‌ electric field and the time step between two‌ consecutive computations. Second, PIC‌ methods rely on a‌‌ sampling of the distribution function by numerical particles‌ whose motion is time‌ integrated in the self-consistent‌‌ electric field. The PIC algorithm remains close to‌ physics and offers an‌ incomparable efficiency with regard‌‌ to Eulerian methods, discretizing the distribution function onto‌ a mesh. It is‌ widely and successfully operated‌‌ for the discretization of kinetic plasma models for‌ more than 40 years.‌ Nonetheless, to spare the‌‌ computational resources, the number of numerical particles is‌ limited compared to that‌ of the physical particles.‌‌ Inherent to this “coarse” sampling, PIC algorithms produce‌ numerical approximations prone to‌ statistical fluctuations that vanish‌‌ slowly with the mean number of particles per‌ cell. The mesh accessible‌ on typical high performance‌‌ computing machines may ${10}^{9}$ cells, which brings‌ the mesh size close‌ to the scale of‌‌ the physics, but the mean number of numerical‌ particles in each cell‌ shall be limited, to‌‌ mitigate the memory footprint as well as the‌ computational time. A breakthrough‌ is therefore necessary to‌‌ reduce the computational resources by orders of magnitude‌ and make possible the‌ use of explicit PIC‌‌ method for large scale and/or densities for 3D‌ computations.

This is the‌ issue addressed within the‌‌ MATURATION project aiming at introducing a new class‌ of PIC algorithms with‌ an unprecedented computational efficiency,‌‌ by analyzing and improving,‌ parallelizing and optimizing as well as benchmarking, in‌ the demanding context of partially magnetized low temperature‌ plasmass through 2D large scale and 3D computations,‌ a method recently proposed in the literature, based‌ on a combination of sparse grid techniques and‌ PIC algorithm.

Diwina

Title:
Magnetic Digital Twins for‌ Spintronics : nanoscale simulation platform
Duration:
2023 –‌ 2026
Coordinator:
Institut Neel
Concace contact:
Olivier Coulaud‌
Funding:
ANR
Partners:
- CMAP
- Inria
- Institut Neel
- SPINTEC‌
Summary:
The DiWiNa project aims at developing a‌ unified open-access platform for spintronic numerical twins, ie,‌ codes for micromagnetic/spintronic simulations with sufficiently-high reliability and‌ speed so that they can be trusted and‌ used as reality. The simulations will be bridged‌ to the advanced microcopy techniques used by the‌ community, through plugins to convert the statics or‌ time-resolved 3D vector- fields into contrast maps for‌ the various techniques, including their experimental transfer functions.‌ To achieve this, we bring together experts from‌ different disciplines to address the various challenges: spintronics‌ for the core simulations, mathematics for trust, algorithmics‌ for speed, experimentalists for the bridge with microscopy.‌ Practical work consists of checking the time-integration stability‌ of spintronic torque involved in the dynamics when‌ implemented in the versatile finite-element framework, improve the‌ calculation speed through advanced libraries, build the bridge‌ with microscopies through rendering tools, and encapsulate these‌ three key ingredients into a user-friendly Python ecosystem.‌ Through open-access and versatile user-friendly encapsulation, we expect‌ that this platform is suited to serve the‌ needs of the entire physics and engineering community‌ of spintronics. The platform will be unique in‌ its features, ranging from simulation to the direct‌ and practical comparison with experiments. It will contribute‌ to reduce considerably the number of experimental screening‌ for the faster development of new spintronic devices,‌ which are expected to play a key role‌ in energy saving.

9 Dissemination

All permanent members‌

9.1 Promoting scientific activities

9.1.1 Scientific events: organisation‌

General chair, scientific chair

E. Agullo: co-chair of‌ Compas 2025 - Bordeaux, June 2025

9.1.2 Scientific‌ events: selection

Member of the conference program committees‌

PDSEC: Olivier Coulaud, Luc Giraud,
SC25: Emmanuel Agullo‌
Luc Giraud is member of the Gene Golub‌ SIAM Summer School. The thirteen Gene Golub SIAM‌ Summer School was entitled “Frontiers in multi-dimensional pattern‌ formation", Concordia University - Montreal, Quebec, CA -‌ August 4th to 15th, 2025

9.1.3 Journal

Reviewer‌ - reviewing activities

ACM TOMS, SIAM SISC, SIAM‌ SIMAX, International Journal for Numerical Methods in Engineering.‌

9.1.4 Invited talks

A journey through some numerical‌ linear algebra algorithms with variable accuracy storage; Luc‌ Giraud, Emmanuel Agullo, Olivier Coulaud, Martina Iannacito, Mohammad‌ Issa, Gilles Marait, Miroslav Rozloznik CAS-ANLA 2025 -‌ The Chinese Academy of Sciences Workshop on Approximate‌ computing in Numerical Linear Algebra, Apr 2025, Beijing,‌ China.

9.1.5 Scientific expertise

Emmanuel Agullo is member‌ of the Cerfacs Evaluation committee
Luc Giraud is‌
- member of the board on Modelization, Simulation and‌ data analysis of the Competitiveness Cluster for Aeronautics, Space and Embedded Systems.‌
- member of the scientific‌ council of the ONERA‌‌ Lab LMA2S (Laboratoire de Mathématiques Appliquées à l'Aéronautique‌ et au Spatial).
- member‌ of the scientific council‌‌ of GDR Calcul,
- scientific advisor at Cerfacs.
Carola‌ Kruse referee for Icelandic‌ Research Fund proposals.
Guillaume‌‌ Sylvand is
- expert in Numerical Simulation and HPC‌ at Airbus.
- member of‌ the scientific council of‌‌ the ORAP.

9.1.6 Research administration

Emmanuel Agullo is‌ member of the Technological‌ Development Commission (CDT) and‌‌ Bureau du Comité des Projets (BCP) at the‌ Inria Centre at the‌ University of Bordeaux.
Luc‌‌ Giraud is
- techniques pilot for the expert group‌ for the evaluation of‌ French research entities (UMRs‌‌ and EAs) relatively to the protection of scientific‌ and technological properties (PPST)‌ on information and communication‌‌ sciences and technologies (STIC),
- the representative of Inria‌ at the GENCI evaluation‌ and ressource allocation committees,‌‌
- scientific expert of the GENCI committee on scientific‌ computing.

9.2 Teaching -‌ Supervision - Juries -‌‌ Educational and pedagogical outreach

Post graduate level/Master:
- E.‌ Agullo: Numerical algorithms 20h,‌ advanced Numerical Numerical Algebra‌‌ 8h, and Implementation of HPC dense linear algebra‌ kernels 8h, at Bordeaux‌ INP (ENSEIRB-MatMeca).
- L. Giraud:‌‌ Introduction to intensive computing and related programming tools‌ 20h, INSA Toulouse; Advanced‌ numerical linear algebra 10h,‌‌ ENSEEIHT Toulouse.
- C. Kruse: Iterative methods in linear‌ algebra, 28h, ENSEEIHT Toulouse.‌
- P. Mycek: Multifidelity methods‌‌ 14h, ModIA (cursus en alternance, INSA/N7), Toulouse.

9.2.1‌ Supervision

PhD in progress:‌ Alexandre Malhene; Abstraction of‌‌ subspace methods in numerical linear algebra; started October‌ 2024, E. Agullo, L.‌ Giraud.
PhD in progress:‌‌ Hugo Dodelin; Abstraction of parallel execution models; started‌ October 2024, E. Agullo,‌ O. Coulaud.
PhD in‌‌ progress: Théo Briquet; machine learning techniques for rank‌ prediction of $ℋ$ -matrices;‌ started October 2023, L.‌‌ Giraud, P. Mycek, G. Sylvand.
PhD in progress:‌ El Mehdi Ettaouchi; nonlinear‌ domain decomposition techniques in‌‌ geosciences; started March 2023, L. Giraud, C. Kruse,‌ N. Tardieu (EDF).
PhD‌ in progress: Sai Aakash‌‌ Dasari; Scalable multigrid methods for tokamak geometries; started‌ Oct. 2024, C. Kruse,‌ P. Mycek.
PhD in‌‌ progress: Antoine Gicquel; Acceleration of the matrix-vector product‌ by the fast multipole‌ method for heterogeneous machine‌‌ clusters; started Nov. 2023, O. Coulaud, B. Bramas.‌
PhD in progress: Andrea‌ Lagardère; Méthode Quasi-Trefftz Couplée‌‌ pour l'Aéroacoustique; started April 2024, G. Sylvand, S.‌ Tordeux.
PhD in progress:‌ Clément Peaucelle; Composabilité en‌‌ Algèbre Linéaire Haute Performance - Application à l'Aéroacoustique‌ et à l'Électromagnétisme; started‌ Jan. 2025, E. Agullo,‌‌ G. Sylvand.
PhD in progress: Amine Zekri ;‌ Low-rank tensor solver for‌ magnetostatic problems for electric‌‌ power applications, started Ocotober 2023; O. Coulaud, J.R.‌ Poirier
PhD thesis defended‌ on December 12, 2025:‌‌ Atte Tori; Towards a fast task-based parallel tensor‌ solver for high-dimensional problems,‌ O. Coulaud, O. Kaya‌‌ (LISN, Paris-Saclay) ; Jury: Grey Ballard (Reviewer), Julien‌ Langou, Professor (Reviewer), Alfredo‌ Buttari (Examiner), Thomas Hérault‌‌ (Examiner), Mariya Ishteva (Examiner), Samuel Thibault (Examiner)

9.2.2‌ Juries

PhD defense

El‌ Hachimi Anas, "Tensor-Based Computational‌‌ Methods: Algorithms, Theory, and‌ Applications"; Spécialité : Mathématiques Appliquées, Université du Littoral‌ Côte d’Opale and Université Mohammed VI Polytechnique, Maroc;‌ referees: Nicolas Gillis, Université de Mons - Luc‌ Giraud, Inria, Stefano Serra Capizzano - Université Insubria;‌ members: Lahcen Maniar - Université Cadi Ayyad Marrakech,‌ Hassane Sadok (president) - Université du Littoral Côte‌ d’Opale, Françoise Tisseur - Université de Manchester, Khalide‌ Jbilou - Université du Littoral Côte d’Opale, Ahmed‌ Ratnani - Université Mohammed 6 Polytechnique; Jul. 22,‌ 2025.
Antoine Ronsain, "Modélisation robuste des politiques climatiques‌ : Une analyse critique et stochastique des modèles‌ DICE et RICE"; Spécialité : Mathématiques et Applications,‌ ENAC-LAB - Laboratoire de Recherche ENAC; referees: Aude‌ Pommeret - Université Savoie Mont Blanc, Christine Solnon‌ - INSA Lyon (president), Cyril Allignol - École‌ Nationale de l’Aviation Civile, Julien Lefevre - CIRED,‌ Alexandre Gondran - École Nationale de l’Aviation Civile,‌ Estelle Malavolti - École Nationale de l’Aviation Civile,‌ Pierre Benjamin - Airbus; Dec. 9, 2025.
Mouhssine‌ Abdellatif, "Vector extrapolation methods with applications to geometric‌ multigrid and nonlinear least-squares problems"; Spécialité : Mathématiques‌ Appliquées, University of the Littoral Opal Coast and‌ Mohammed VI Polytechnic University; referees: Kees Vuik -‌ Delft University of Technology , Martin Gander -‌ University of Geneva , Luc Giraud - Inria;‌ members: Lahcen Maniar (president) - Cadi Ayyad University‌ , Carole Rosier - University of the Littoral‌ Opal Coast , Ahmed Ratnani - Mohammed VI‌ Polytechnic University, Hassane Sadok - University of the‌ Littoral Opal Coast; Dec. 29, 2025.

HDR defense‌

Nicole Spilanne, "High Performance Krylov Subspace Solvers with‌ Preconditioning and Deflation"; Habilitation à Diriger des Recherches‌ Institut Polytechnique de Paris CMAP (CNRS, École polytechnique);‌ referees: Grégoire Allaire, Professor at École polytechnique -‌ Martin Gander, Professor at Université de Genève -‌ Yousef Saad, Professor at University of Minesota; members:‌ Stéphanie Chaillat, CNRS Senior Researcher at EPFL -‌ Marc Embree, Professor at Virginia Tech - Virginie‌ Erlacher, Professor at École des Ponts - Luc‌ Giraud, Senior Researcher at Inria (President) - Laura‌ Grigori, Professor at EPFL - Axel Klawonn, Professor‌ at University of Cologne; Oct. 24, 2025.

10‌ Scientific production

10.1 Major publications

1 articleE.‌Emmanuel Agullo, B.Bérenger Bramas, O.‌Olivier Coulaud, E.Eric Darve, M.‌Matthias Messner and T.Toru Takahashi. Task-Based‌ FMM for Multicore Architectures.SIAM Journal on‌ Scientific Computing3612014, 66-93HAL‌DOI
2 articleE.Emmanuel Agullo, A.‌Alfredo Buttari, A.Abdou Guermouche, J.‌Julien Herrmann and A.Antoine Jego. Task-based‌ parallel programming for scalable matrix product algorithms.‌ACM Transactions on Mathematical Software2023HAL
3‌ articleE.Emmanuel Agullo, L.Luc Giraud‌ and L.Louis Poirel. Robust preconditioners via‌ generalized eigenproblems for hybrid sparse linear solvers.‌SIAM Journal on Matrix Analysis and Applications40‌22019, 417–439HAL DOI
4 misc‌E.Emily Bourne, P.Philippe Leleux,‌ K.Katharina Kormann, C.Carola Kruse, V.Virginie Grandgirard,‌ Y.Yaman Güçlü,‌ M. J.Martin J‌‌ Kühn, U.Ulrich Rüde, E.Eric‌ Sonnendrücker and E.Edoardo‌ Zoni. Solver comparison‌‌ for Poisson-like equations on tokamak geometries.September‌ 2022HAL
5 article‌B.Bérenger Bramas,‌‌ O.Olivier Coulaud and G.Guillaume Sylvand.‌ Time-domain BEM for the‌ wave equation on distributed-heterogeneous‌‌ architectures: A blocking approach.Parallel Computing49‌July 2015, 66-82‌HAL DOI
6 article‌‌F.Fabien Casenave, A.Alexandre Ern and‌ G.Guillaume Sylvand.‌ Coupled BEM-FEM for the‌‌ convected Helmholtz equation with non-uniform flow in a‌ bounded domain.Journal‌ of Computational Physics257‌‌AJanuary 2014, 627-644HAL DOI
7‌ articleS.Siegfried Cools‌, E. F.Emrullah‌‌ Fatih Yetkin, E.Emmanuel Agullo, L.‌Luc Giraud and W.‌Wim Vanroose. Analyzing‌‌ the Effect of Local Rounding Error Propagation on‌ the Maximal Attainable Accuracy‌ of the Pipelined Conjugate‌‌ Gradient Method.SIAM Journal on Matrix Analysis‌ and Applications391‌March 2018, 426‌‌ - 450HAL DOI
8 articleV.Vincent‌ Darrigrand, A.Andrei‌ Dumitrasc, C.Carola‌‌ Kruse and U.Ulrich Rüde. Inexact inner–outer‌ Golub–Kahan bidiagonalization method: A‌ relaxation strategy.Numerical‌‌ Linear Algebra with ApplicationsDecember 2022HAL DOI‌
9 articleD. A.‌Daniele Antonio Di Pietro‌‌, P.Pierre Matalon, P.Paul Mycek‌ and U.Ulrich Rüde‌. High-order multigrid strategies‌‌ for HHO discretizations of elliptic equations.Numerical‌ Linear Algebra with Applications‌June 2022HAL DOI‌‌
10 articleA.Alain Franc, P.Pierre‌ Blanchard and O.Olivier‌ Coulaud. Nonlinear mapping‌‌ and distance geometry.Optimization Letters142‌2020, 453-467HAL‌DOI
11 articleL.‌‌Luc Giraud, Y.-F.Yan-Fei Jing and Y.‌Yanfei Xiang. A‌ block minimum residual norm‌‌ subspace solver with partial convergence management for sequences‌ of linear systems.‌SIAM Journal on Matrix‌‌ Analysis and Applications4322022, 710-739‌HAL DOI
12 misc‌Y.Yongseok Jang,‌‌ J.Jerome Bonelle, C.Carola Kruse,‌ F.Frank Hülsemann and‌ U.Ulrich Ruede.‌‌ Fast Linear Solvers for Incompressible CFD Simulations with‌ Compatible Discrete Operator Schemes‌.April 2023HAL‌‌
13 articleM.Marc Odunlami, V.Vincent‌ Le Bris, D.‌Didier Bégué, I.‌‌Isabelle Baraille and O.Olivier Coulaud. A-VCI:‌ A flexible method to‌ efficiently compute vibrational spectra‌‌.Journal of Chemical Physics14621June‌ 2017HAL DOI

10.2‌ Publications of the year‌‌

International journals

14 articleO.Olivier Beaumont,‌ R.Rémi Bouzel,‌ L.Lionel Eyraud-Dubois,‌‌ E.Esragul Korkmaz, L. L.Laércio Lima‌ Pilla and A.Alexandre‌ van Kempen. Approximation‌‌ Algorithms for Scheduling with/without Deadline Constraints where Rejection‌ Costs are Proportional to‌ Processing Times.IEEE‌‌ Transactions on Parallel and Distributed Systems3612‌December 2025, 2596-2608‌HAL DOI back to‌‌ text
15 articleJ.‌Jérémy Briant, P.Paul Mycek, M.‌Mayeul Destouches, O.Olivier Goux, S.‌Serge Gratton, S.Selime Gürol, E.‌Ehouarn Simon and A. T.Anthony T. Weaver‌. A filtered multilevel Monte Carlo method for‌ estimating the expectation of cell-centered discretized random fields‌.International Journal for Uncertainty Quantification2025.‌ In press. HAL DOIback to text
16‌ articleM.Mathieu Cances, L.Luc Giraud‌, M.Michael Bauerheim, L. Y.Laurent‌ Y.M. Gicquel and F.Franck Nicoud. Robustness‌ and reliability of state-space, frame-based modeling for thermoacoustics‌.Journal of Computational Physics5202025,‌ 113472HAL DOI back to text
17 article‌O.Olivier Coulaud, L.Luc Giraud and‌ M.Martina Iannacito. A note on TT-GMRES‌ for the solution of parametric linear systems.‌Electronic Transactions on Numerical Analysis62January 2025‌, 163-187HAL DOIback to text
18‌ articleO.Olivier Coulaud, L.Luc Giraud‌ and M.Martina Iannacito. On some orthogonalization‌ schemes in Tensor Train format.BIT Numerical‌ Mathematics6545October 2025HAL DOI back‌ to text
19 articleM.Mayeul Destouches,‌ P.Paul Mycek, S.Selime Gürol,‌ A.Anthony Weaver, S.Serge Gratton and‌ E.Ehouarn Simon. Multilevel Monte Carlo methods‌ for ensemble variational data assimilation.Nonlinear Processes‌ in Geophysics322June 2025, 167-187‌HAL DOI back to text
20 articleA.‌Andrei Dumitrasc, C.Carola Kruse and U.‌Ulrich Rüde. Generalized Golub–Kahan Bidiagonalization for Nonsymmetric‌ Saddle-Point Systems.SIAM Journal on Matrix Analysis‌ and Applications461February 2025, 370-392‌HAL DOI back to text
21 articleE.‌ M.El Mehdi Ettaouchi, L.Luc Giraud‌, C.Carola Kruse and N.Nicolas Tardieu‌. Convergence analysis of overlapping domain decomposition preconditioners‌ for nonlinear problems.SIAM Journal on Scientific‌ Computing476November 2025, A3464-A3493HAL‌DOI back to text
22 articleV.Vincent‌ Le Bris, S.Sergey Krasnoshchekov, E.‌Egor Dobrolyubov, I.Ilya Efremov, I.‌Igor Polyakov, O.Olivier Coulaud and D.‌Didier Bégué. Juxtaposing the fourth order vibrational‌ operator perturbation theory CVPT(4) and the adaptive VCI‌ (A-VCI): Accuracy, vibrational resonances and polyads of C2H4‌ and C2D4.Journal of Quantitative Spectroscopy and‌ Radiative Transfer346December 2025, 109586HAL‌DOI back to text
23 articleP.Philippe‌ Leleux, C.Christina Schwarz, M.Martin‌ Kühn, C.Carola Kruse and U.Ulrich‌ Rüde. Complexity analysis and scalability of a‌ matrix-free extrapolated geometric multigrid solver for curvilinear coordinates‌ representations from fusion plasma applications.Journal of‌ Parallel and Distributed Computing205November 2025,‌ 105143HAL DOI back to text
24 article‌A.Alix Tremodeux, A.Anne Benoit,‌ E.Emmanuel Agullo, T.Thomas Herault,‌ L.Luc Giraud and Y.Yves Robert. Fault-tolerant numerical iterative algorithms‌ at scale.International‌ Journal of High Performance‌‌ Computing Applications2025HALback to text
25‌ articleA.A. Zekri‌, O.Olivier Coulaud‌‌, L.Luc Giraud and J.-R.J.-R. Poirier‌. A Tensor Train‌ solver for the Magnetic‌‌ Moment Method.IEEE Transactions on Magnetics2025‌, 1-1HAL DOI‌back to text

Invited‌‌ conferences

26 inproceedingsL.Luc Giraud, E.‌Emmanuel Agullo, O.‌Olivier Coulaud, M.‌‌Martina Iannacito, M.Mohammad Issa, G.‌Gilles Marait and M.‌Miroslav Rozloznik. A‌‌ journey through some numerical linear algebra algorithms with‌ variable accuracy storage.‌CAS-ANLA 2025 - The‌‌ Chinese Academy of Sciences Workshop on Approximate computing‌ in Numerical Linear Algebra‌Beijing, ChinaApril 2025‌‌HAL

International peer-reviewed conferences

27 inproceedingsH.Hadrien‌ Godé, C.Carola‌ Kruse, R.Richard‌‌ Angersbach, H.Harald Köstler, M.Michaël‌ Bauerheim and U.Ulrich‌ Rüde. Comparison of‌‌ Multigrid and Machine Learning-Based Poisson Solvers.Parallel‌ Processing and Applied Mathematics15th‌ International Conference, PPAM 2024‌‌PPAM 2024 - Parallel Processing and Applied Mathematics‌Lecture Notes in Computer‌ ScienceLecture Notes in‌‌ Computer ScienceLNCS-15581Ostrava, Czech RepublicSpringer Nature‌ SwitzerlandApril 2025,‌ 174-189HAL DOI
28‌‌ inproceedingsA.-K. M.Aboul-Karim Mohamed El Maarouf,‌ L.Luc Giraud,‌ A.Abdou Guermouche and‌‌ T.Thomas Guignon. Sparse Matrix Ordering for‌ Fine Grain Parallel Triangular‌ Solve Using SIMD.‌‌Parallel Processing and Applied Mathematics (PPAM 2024)PPAM‌ 2024 - 15th International‌ Conference on Parallel Processing‌‌ & Applied MathematicsLNCSLecture Notes in Computer‌ Science15579Ostrava, Czech‌ RepublicSpringer Nature Switzerland‌‌April 2025, 51-64HAL DOI
29 inproceedings‌A.Atte Torri,‌ P.Przemysław Dominikowski,‌‌ B.Brice Pointal, O.Oguz Kaya,‌ L.Laércio Lima Pilla‌ and O.Olivier Coulaud‌‌. Near-Optimal Contraction Strategies for the Scalar Product‌ in the Tensor-Train Format‌.Euro-Par 2025: Parallel‌‌ ProcessingEuro-Par 2025 - 31 International European Conference‌ on Parallel and Distributed‌ Computing15902Lecture Notes‌‌ in Computer ScienceDresden, GermanySpringer Nature Switzerland‌August 2025, 63-77‌HAL DOI
30 inproceedings‌‌Y.Yanfei Xiang and L.Luc Giraud.‌ Convolution neural operator preconditioning‌ for the solution of‌‌ some heterogeneous PDES.DTE - AICOMAS 2025‌ - Digital Twins in‌ Engineering & Artificial Intelligence‌‌ and Computational Methods in Applied ScienceParis, France‌February 2025HAL

National‌ peer-reviewed Conferences

31 inproceedings‌‌ H.Hugo Dodelin. Peut-on exprimer un produit‌ de matrices distribué haute‌ performance avec un modèle‌‌ Map-Reduce ? COMPAS2025 Conférence francophone d'informatique en Parallélisme,‌ Architecture et Système (COMPAS‌ 2025) COMPAS2025 Bordeaux, France‌‌ May 2025 HAL
32 inproceedingsA.Antoine Gicquel‌. A composable abstraction‌ of hierarchical methods for‌‌ matrix-vector product acceleration.COMPAS 2025 - Conférence‌ francophone d'informatique en Parallélisme,‌ Architecture et SystèmeBordeaux,‌‌ FranceJune 2025HAL
33 inproceedingsE.Esragul‌ Korkmaz, E.Emmanuel‌ Agullo and G.Guillaume‌‌ Sylvand. High performance‌ solvers for aeroacoustic simulations for Airbus.COMPAS‌ 2025 - Conférence francophone d'informatique en Parallélisme, Architecture‌ et SystèmeBordeaux, FranceJune 2025HAL

Conferences‌ without proceedings

34 inproceedingsA.Antoine Gicquel.‌ A composable abstraction of hierarchical methods for matrix-vector‌ product acceleration.Sparse Days 2025Toulouse, France‌June 2025HAL
35 inproceedingsA.Antoine Gicquel‌, O.Olivier Coulaud, B.Bérenger Bramas‌ and E.Emmanuel Agullo. A composable abstraction‌ of hierarchical methods for matrix-vector product acceleration.‌Workshop on Approximate Computing in Numerical Linear Algebra‌ 2025Paris, FranceDecember 2025HAL
36 inproceedings‌A.Alexandre Malhéné, E.Emmanuel Agullo,‌ L.Luc Giraud and M.Miroslav Rozloznik.‌ Practical stopping criteria for GMRES.Sparse Days‌ meeting 2025Toulouse, FranceJune 2025HAL
37‌ inproceedingsS.Stojce Nakov, E.Emmanuel Agullo‌, A.Alfredo Buttari, L.Luc Giraud‌ and G.Gilles Marait. Scalable Domain Decomposition‌ Methods for Large Sparse Linear Systems on Modern‌ Architectures.Sparse Days 2025Toulouse, FranceJune‌ 2025HAL
38 inproceedingsS.Stojce Nakov,‌ E.Emmanuel Agullo, A.Alfredo Buttari,‌ L.Luc Giraud and G.Gilles Marait.‌ Scalable Domain Decomposition Methods for Large Sparse Linear‌ Systems on Modern Architectures.COMPAS 2025 -‌ Conférence francophone d'informatique en Parallélisme, Architecture et Système‌Bordeaux, FranceJune 2025HAL
39 inproceedingsC.‌Clément Peaucelle, E.Emmanuel Agullo and G.‌Guillaume Sylvand. Using Floating Point Compression to‌ increase H-Matrices performance.COMPAS 2025 - Conférence‌ francophone d'informatique en Parallélisme, Architecture et SystèmeBordeaux,‌ FranceJune 2025HAL
40 inproceedingsA. E.‌Amine El Mehdi Zekri, O.Olivier Coulaud‌, L.Luc Giraud and J.-R.Jean-René Poirier‌. Tensor Train-Based Approach for the Magnetic Moment‌ Method.Compumag 2025 - 25th International Conference‌ on the Computation of Electromagnetic FieldsNaples, Italy‌June 2025HAL

Reports & preprints

41 misc‌M. A.Mohamed Anwar Abouabdallah, O.Olivier‌ Coulaud, N.Nathalie Peyrard and A.Alain‌ Franc. Computing WSBM conditional marginals with Tensor-Train‌ decomposition.May 2025HAL
42 reportC.‌Céline Acary-Robert, E.Emmanuel Agullo, B.‌Benjamin Arrondeau, L.Lars Bilke, D.‌Dylan Bissuel, L.Ludovic Courtès, C.‌ J.Collin J. Doering, R.Romain Garbage‌, K.Konrad Hinsen, A.Arun Isaac‌, E.Emmanuel Medernach, S.Sorina Camarasu-Pop‌, P.Pjotr Prins, C.Cayetano Santos‌, P.Philippe Swartvagher, S.Simon Tournier‌ and R.Ricardo Wurmus. Guix-HPC Activity Report‌ 2023-2024.Inria Bordeaux - Sud OuestFebruary‌ 2025, 1-36HALback to text
43‌ miscE.Emmanuel Agullo, A.Alfredo Buttari‌, A.Abdou Guermouche and A.Antoine Jego‌. Redundant computations in task-based parallelism with applications‌ to communication-reducing algorithms.July 2025HAL
44‌ reportO.Olivier Coulaud, L.Luc Giraud‌, M.Martina Iannacito and M.Mohammad Issa. Solving eigenvalue problems‌ in high dimensions using‌ contour integration and Tensor‌‌ Train format.RR-9586InriaApril 2025,‌ 19HAL back to‌ text
45 reportE.‌‌ M.El Mehdi Ettaouchi, L.Luc Giraud‌, C.Carola Kruse‌ and N.Nicolas Tardieu‌‌. Convergence analysis of overlapping domain decomposition preconditioners‌ for nonlinear problems.‌RR-9575INRIAFebruary 2025‌‌HAL
46 reportL.Luc Giraud, C.‌Carola Kruse, P.‌Paul Mycek, M.‌‌Maksym Shpakovych and Y.Yanfei Xiang. Neural‌ network preconditioning: a case‌ study for the solution‌‌ of the parametric Helmholtz equation.RR-9593Inria‌ Centre at the University‌ of Bordeaux, FranceJuly‌‌ 2025HAL back to text
47 reportL.‌Luc Giraud and Y.‌Yanfei Xiang. A‌‌ note on the partial convergence management for the‌ solution of symmetric linear‌ systems with multiple right-hand‌‌ sides.RR-9574Inria Centre at the University‌ of BordeauxJanuary 2025‌HAL back to text‌‌
48 reportC.Clément Guillet. Error Estimates‌ for Sparse Tensor Products‌ of B-spline Approximation Spaces‌‌.RR-9598Centre Inria de l'Université de Bordeaux‌October 2025HAL back‌ to text
49 report‌‌C.Carola Kruse, A.Ana Clara Ordonez‌ Egas and N.Nicolas‌ Tardieu. A Scalable‌‌ and Parameter-Robust Preconditioner for a Second Gradient of‌ Dilation Regularization Applied to‌ a Mechanics Problem.‌‌Cerfacs; EDF R&DAugust 2025HAL back to‌ text
50 reportJ.‌Julian Litz, P.‌‌Philippe Leleux, C.Carola Kruse, J.‌Joscha Gedicke and M.‌ J.Martin J Kühn‌‌. Memory-and compute-optimized geometric multigrid GMGPolar for curvilinear‌ coordinate representations -Applications to‌ fusion plasma.Cerfacs‌‌ - Toulouse, FranceJuly 2025HAL back to‌ text
51 reportA.‌Alix Tremodeux, E.‌‌Emmanuel Agullo, A.Anne Benoit, L.‌Luc Giraud, T.‌Thomas Herault and Y.‌‌Yves Robert. Fault-tolerant numerical iterative algorithms at‌ scale.RR-9567Inria‌ LyonJanuary 2025HAL‌‌
52 reportY.Yanfei Xiang. Randomized LOBPCG‌ algorithm with linear dimension‌ reduction.Inria Centre‌‌ at the University of Bordeaux, FranceFebruary 2025‌, 33HAL
53‌ reportY.Yanfei Xiang‌‌. Unsupervised convolution neural operator preconditioning for the‌ solution of some heterogeneous‌ fluid PDEs.Inria‌‌ Centre at the University of BordeauxJanuary 2025‌HAL

Other scientific publications‌

54 inproceedingsP.Przemysław‌‌ Dominikowski, A.Atte Torri, B.Brice‌ Pointal, O.Oguz‌ Kaya, L.Laercio‌‌ Lima Pilla and O.Olivier Coulaud. Exploring‌ Near-Optimal Contraction Strategies for‌ the Scalar Product in‌‌ the Tensor-Train Format.IPDPS 2025 - 39th‌ IEEE International Parallel &‌ Distributed Processing SymposiumMilan,‌‌ ItalyJune 2025, 1274-1276HAL DOI
55‌ inproceedingsA.Antoine Gicquel‌, O.Olivier Coulaud‌‌ and B.Bérenger Bramas. Towards a composable‌ abstraction of hierarchical methods‌ for matrix-vector product acceleration‌‌.COMPAS 2025 - Conférence francophone d'informatique en‌ Parallélisme, Architecture et Système‌Bordeaux, FranceJune 2025‌‌HAL
56 miscE.‌Esragul Korkmaz. ResourceMonitor: Coarse-grain resource monitoring tool‌ for HPC applications.November 2025HAL

Software‌

57 softwareO.Olivier Coulaud, A.Alain‌ Franc, J.-M.Jean-Marc Frigerio, L.Lucas‌ Leandro, R.Romain Peressoni and F.Florent‌ Pruvost. cppdiodon.0.4February 2025 lic:‌ CeCILL-C Free Software License Agreement.HAL Software‌ Heritage VCS
58 softwareA.Alain Franc,‌ F.Florent Pruvost, R.Romain Peressoni,‌ J.-M.Jean-Marc Frigerio and O.Olivier Coulaud.‌ pydiodon.0.1.0April 2025 lic: GNU General‌ Public License v3.0 or later.HAL Software‌ Heritage VCS

10.3 Cited publications

59 articleE.‌E. Agullo, O.O. Aumage, B.‌B. Bramas, O.O. Coulaud and S.‌S. Pitoiset. Bridging the gap between openMP‌ and task-based runtime systems for the fast multipole‌ method.IEEE Transactions on Parallel and Distributed‌ Systems28102017DOI back to text‌
60 articleE.Emmanuel Agullo, B.Bérenger‌ Bramas, O.Olivier Coulaud, E.Eric‌ Darve, M.Matthias Messner and T.Toru‌ Takahashi. Task-Based FMM for Multicore Architectures.‌SIAM Journal on Scientific Computing3612014‌, 66-93HAL DOIback to text
61‌ articleE.Emmanuel Agullo, B.Berenger Bramas‌, O.Olivier Coulaud, E.Eric Darve‌, M.Matthias Messner and T.Toru Takahashi‌. Task-based FMM for heterogeneous architectures.Concurrency‌ and Computation: Practice and Experience289jun‌ 2016, 2608--2629URL: http://doi.wiley.com/10.1002/cpe.3723DOI back to‌ text
62 articleE.Emmanuel Agullo, S.‌Siegfried Cools, E.Emrullah Fatih-Yetkin, L.‌Luc Giraud, N.Nick Schenkels and W.‌Wim Vanroose. On soft errors in the‌ conjugate gradient method: sensitivity and robust numerical detection‌.SIAM Journal on Scientific Computing426‌November 2020HAL DOIback to text
63‌ articleE.Emmanuel Agullo, E.Eric Darve‌, L.Luc Giraud and Y.Yuval Harness‌. Low-Rank Factorizations in Data Sparse Hierarchical Algorithms‌ for Preconditioning Symmetric Positive Definite Matrices.SIAM‌ Journal on Matrix Analysis and Applications394‌October 2018, 1701-1725HAL back to text‌
64 techreportE.Emmanuel Agullo, M.Marek‌ Felšöci and G.Guillaume Sylvand. A comparison‌ of selected solvers for coupled FEM/BEM linear systems‌ arising from discretization of aeroacoustic problems: literate and‌ reproducible environment.RT-0513Inria Bordeaux Sud-OuestJune‌ 2021, 100HALback to text
65‌ articleE.Emmanuel Agullo, L.Luc Giraud‌ and Y.-F.Y-F Jing. Block GMRES method‌ with inexact breakdowns and deflated restarting.SIAM‌ Journal on Matrix Analysis and Applications354‌2014, 1625--1651back to text
66 article‌E.Emmanuel Agullo, L.Luc Giraud and‌ L.Louis Poirel. Robust preconditioners via generalized‌ eigenproblems for hybrid sparse linear solvers.SIAM‌ Journal on Matrix Analysis and Applications402‌2019, 417--439HALDOI back to text
67 techreportP.Pierre‌ Blanchard, O.Olivier‌ Coulaud and E.Eric‌‌ Darve. Fast hierarchical algorithms for generating Gaussian‌ random fields.8811‌Inria Bordeaux Sud-OuestDecember‌‌ 2015HAL back to text back to text‌
68 phdthesisP.Pierre‌ Blanchard. Fast hierarchical‌‌ algorithms for the low-rank approximation of matrices, with‌ applications to materials physics,‌ geostatistics and data analysis‌‌.Bordeaux2017, URL: https://tel.archives-ouvertes.fr/tel-01534930back to‌ text
69 techreportS.‌Steffen Börm, L.‌‌Lars Grasedyck and W.Wolfgang Hackbusch. Hierarchical‌ Matrices.2003,‌ 1--173back to text‌‌
70 unpublishedJ.Jérémy Briant, P.Paul‌ Mycek, M.Mayeul‌ Destouches, O.Olivier‌‌ Goux, S.Serge Gratton, S.Selime‌ Gürol, E.Ehouarn‌ Simon and A. T.‌‌Anthony T. Weaver. A filtered multilevel Monte‌ Carlo method for estimating‌ the expectation of discretized‌‌ random fields.November 2023, working paper‌ or preprintHAL DOI‌back to text
71‌‌ articleA.Alfredo Buttari, J.Julien Langou‌, J.Jakub Kurzak‌ and J.Jack Dongarra‌‌. Parallel tiled QR factorization for multicore architectures‌.Concurrency and Computation:‌ Practice and Experience20‌‌132008, 1573--1590back to text
72‌ articleE.Erin Carson‌, N. J.Nicholas‌‌ J. Higham and S.Srikara Pranesh. Three-Precision‌ GMRES-Based Iterative Refinement for‌ Least Squares Problems.‌‌SIAM Journal on Scientific Computing426January‌ 2020, A4063--A4083DOI‌back to text
73‌‌ articleF.Fabien Casenave, A.Alexandre Ern‌ and G.Guillaume Sylvand‌. Coupled BEM-FEM for‌‌ the convected Helmholtz equation with non-uniform flow in‌ a bounded domain.‌Journal of Computational Physics‌‌257A23 pages, 9 figuresJanuary 2014‌, 627-644HAL DOI‌back to text
74‌‌ bookA.Andrzej Cichocki, R.Rafal Zdunek‌, A. H.Anh‌ Huy Phan and S.-i.‌‌Shun-ichi Amari. Nonnegative Matrix and Tensor Factorizations:‌ Applications to Exploratory Multi-way‌ Data Analysis and Blind‌‌ Source Separation.Wiley2009back to text‌
75 articleS.Siegfried‌ Cools, E. F.‌‌Emrullah Fatih Yetkin, E.Emmanuel Agullo,‌ L.Luc Giraud and‌ W.Wim Vanroose.‌‌ Analyzing the Effect of Local Rounding Error Propagation‌ on the Maximal Attainable‌ Accuracy of the Pipelined‌‌ Conjugate Gradient Method.SIAM Journal on Matrix‌ Analysis and Applications39‌1March 2018,‌‌ 426 - 450HALDOI back to text‌
76 techreportO.Olivier‌ Coulaud, A. A.‌‌Alain A. Franc and M.Martina Iannacito.‌ Extension of Correspondence Analysis‌ to multiway data-sets through‌‌ High Order SVD: a geometric framework.RR-9429‌Inria Bordeaux - Sud-Ouest‌ ; InraeNovember 2021‌‌HAL back to text
77 phdthesisA.Aurelien‌ Falco. Combler l'écart‌ entre -Matrices et méthodes‌‌ directes creuses pour la résolution de systèmes linéaires‌ de grandes tailles.‌Université de BordeauxJune‌‌ 2019HAL back to text
78 articleA.‌ A.Alain A. Franc‌, P.Pierre Blanchard‌‌ and O.Olivier Coulaud‌. Nonlinear mapping and distance geometry.Optimization‌ Letters1422020, 453-467HAL DOI‌back to text
79 bookN.Nicolas Gillis‌. Nonnegative Matrix Factorization.Society for Industrial‌ and Applied MathematicsJanuary 2020DOI back to‌ text
80 techreportL.Luc Giraud, Y.-F.‌Yan-Fei Jing and Y.Yanfei Xiang. A‌ block minimum residual norm subspace solver for sequences‌ of multiple left and right-hand side linear systems‌.RR-9393Inria Bordeaux Sud-OuestFebruary 2021,‌ 60HAL back to text
81 articleL.‌Lars Grasedyck, W.Wolfgang Hackbusch and B.‌Bericht Nr. An Introduction to Hierachical (‌ H - ) Rank and TT - Rank‌ of Tensors with Examples.Computational Methods in‌ Applied Mathematics113292011, 291--304back‌ to text
82 articleB.Brian Gunter and‌ R.Robert Van De Geijn. Parallel out-of-core‌ computation and updating of the QR factorization.‌ACM Transactions on Mathematical Software (TOMS)311‌2005, 60--78back to text
83 book‌W.Wolfgang Hackbusch. Hierarchical Matrices: Algorithms and‌ Analysis.Springer Publishing Company, Incorporated2015back‌ to text
84 articleN.Nathan Halko,‌ P.-G. G.Per-Gunnar G. Martinsson and J. A.‌Joel A. Tropp. Finding structure with randomness:‌ Probabilistic algorithms for constructing approximate matrix decompositions.‌SIAM Review5322011, 217--288URL:‌ http://arxiv.org/abs/0909.4061DOI back to text
85 articleT.‌ G.Tamara G. Kolda and B. W.Brett‌ W. Bader. Tensor Decompositions and Applications.‌SIAM Review513aug 2009, 455--500‌URL: http://epubs.siam.org/doi/abs/10.1137/07070111XDOI back to text
86 article‌C.Carola Kruse, V.Vincent Darrigrand,‌ N.Nicolas Tardieu, M.Mario Arioli and‌ U.Ulrich Rüde. Application of an iterative‌ Golub-Kahan algorithm to structural mechanics problems with multi-point‌ constraints.Adv. Model. Simul. Eng. Sci.7‌12020, 45URL: https://doi.org/10.1186/s40323-020-00181-2DOI back‌ to text
87 articleC.Carola Kruse,‌ M.Masha Sosonkina, M.Mario Arioli,‌ N.Nicolas Tardieu and U.Ulrich Rüde.‌ Parallel solution of saddle point systems with nested‌ iterative solvers based on the Golub-Kahan Bidiagonalization.‌Concurr. Comput. Pract. Exp.33112021,‌ URL: https://doi.org/10.1002/cpe.5914DOI back to text
88 article‌V.Vincent Le Bris, M.Marc Odunlami‌, D.Didier Bégué, I.Isabelle Baraille‌ and O.Olivier Coulaud. Using computed infrared‌ intensities for the reduction of vibrational configuration interaction‌ bases.Phys. Chem. Chem. Phys.2213‌2020, 7021-7030URL: http://dx.doi.org/10.1039/D0CP00593BDOI back to‌ text
89 phdthesisB.Benôit Lizé. Résolution‌ directe rapide pour les éléments finis de frontière‌ en électromagnétisme et acoustique : -Matrices. Parallélisme et‌ applications industrielles.Université Paris-Nord - Paris XIII‌June 2014HAL back to text
90 article‌P.-G.Per-Gunnar Martinsson and J.Joel Tropp.‌ Randomized Numerical Linear Algebra: Foundations & Algorithms.‌2020, URL: http://arxiv.org/abs/2002.01387back to text
91 articleM.Marc Odunlami‌, V.Vincent Le‌ Bris, D.Didier‌‌ Bégué, I.Isabelle Baraille and O.Olivier‌ Coulaud. A-VCI: A‌ flexible method to efficiently‌‌ compute vibrational spectra.The Journal of Chemical‌ Physics14621june‌ 2017, 214108URL:‌‌ http://aip.scitation.org/doi/10.1063/1.4984266DOI back to text
92 articleI.‌ V.I. V. Oseledets‌. Tensor-Train Decomposition.‌‌SIAM Journal on Scientific Computing335January‌ 2011, 2295--2317URL:‌ https://doi.org/10.1137/090752286DOI back to‌‌ text
93 phdthesisL.Louis Poirel. Algebraic‌ domain decomposition methods for‌ hybrid (iterative/direct) solvers.‌‌Université de BordeauxNovember 2018HAL back to‌ text
94 articleJ.-R.‌Jean-René Poirier, O.‌‌Olivier Coulaud and O.Oguz Kaya. Fast‌ BEM Solution for 2-D‌ Scattering Problems Using Quantized‌‌ Tensor-Train Format.IEEE Transactions on Magnetics56‌3March 2020,‌ 1-4HAL DOI back‌‌ to text
95 phdthesisG.Guillaume Sylvand.‌ La méthode multipôle rapide‌ en électromagnétisme. Performances, parallélisation,‌‌ applications.Ecole des Ponts ParisTechJune 2002‌HAL back to text‌
96 techreportN.Nicolas‌‌ Venkovic, P.Paul Mycek, L.Luc‌ Giraud and O.Olivier‌ Le Maitre. Recycling‌‌ Krylov subspace strategies for sequences of sampled stochastic‌ elliptic equations.RR-9425‌Inria Bordeaux - Sud‌‌ OuestOctober 2021HALback to text

CONCACE - 2025

CONCACE - 2025

2025Activity reportProject-Team﻿﻿﻿‌CONCACE

Keywords

Computer Science​‌﻿﻿ and Digital Science

Other﻿​﻿﻿ Research Topics and Application​‌﻿﻿ Domains

1 Team members,​​​‌ visitors, external collaborators

Research﻿​﻿﻿ Scientists

Post-Doctoral Fellows

PhD Students﻿​﻿﻿

Technical Staff

Interns and Apprentices

Administrative Assistants

External Collaborators

2 Overall objectives﻿﻿﻿‌

3 Research program

3.1 Numerical algebra methodologies﻿‌​‌ in model and data-driven﻿​​﻿ scientific computing

3.1.1 Scientific computing in​​​‌ large size linear algebra﻿﻿﻿‌

3.1.2 Scientific computing in​​﻿﻿ large dimension multi-linear algebra​​​‌

3.1.3 Scientific​‌﻿﻿ continuum between large size​​﻿﻿ and large dimension

3.2 Composition​​​‌ of parallel numerical algorithms﻿﻿﻿‌ from a sequential expression﻿‌​‌

4 Application​‌﻿﻿ domains

4.1 Aeroacoustics﻿​﻿﻿ Simulation

5 Highlights of the​​​‌ year

5.1 Awards

6 Latest﻿​​﻿ software developments, platforms, open​​​‌ data

6.1 Latest software﻿﻿﻿‌ developments

6.1.1 composyx

6.1.2​​​‌ ScalFMM

6.1.3 CPPDiodon

6.1.4 FMR

7 New results

7.1 Error Estimates for﻿﻿﻿‌ Sparse Tensor Products of﻿‌​‌ B-spline Approximation Spaces

7.2 On some﻿​﻿﻿ orthogonalization schemes in Tensor​‌﻿﻿ Train format

7.3 A note on​​​‌ TT-GMRES for the solution﻿​﻿﻿ of parametric linear systems​‌﻿﻿

7.4 Solving﻿​​﻿ eigenvalue problems in high​​​‌ dimensions using contour integration﻿﻿﻿‌ and Tensor Train format﻿‌​‌

7.5 A﻿﻿﻿‌ Tensor Train solver for﻿‌​‌ the Magnetic Moment Method﻿​​﻿

7.6 Generalized​​​‌ Golub–Kahan Bidiagonalization for Nonsymmetric﻿﻿﻿‌ Saddle-Point Systems

7.7 A note on​‌﻿﻿ the partial convergence management​​﻿﻿ for the solution of​​​‌ symmetric linear systems with﻿​﻿﻿ multiple right-hand sides

7.8 A Scalable and​​﻿﻿ Parameter-Robust Preconditioner for a​​​‌ Second Gradient of Dilation﻿​﻿﻿ Regularization Applied to a​‌﻿﻿ Mechanics Problem

7.9​‌﻿﻿ Neural network preconditioning: a​​﻿﻿ case study for the​​​‌ solution of the parametric﻿​﻿﻿ Helmholtz equation

7.10 Memory-and﻿﻿﻿‌ compute-optimized geometric multigrid GMGPolar﻿‌​‌ for curvilinear coordinate representations﻿​​﻿ -Applications to fusion plasma​​​‌

7.11 Complexity﻿‌​‌ analysis and scalability of﻿​​﻿ a matrix-free extrapolated geometric​​​‌ multigrid solver for curvilinear﻿﻿﻿‌ coordinates representations from fusion﻿‌​‌ plasma applications

7.12﻿​​﻿ Fault-tolerant numerical iterative algorithms​​​‌ at scale

7.13​‌﻿﻿ A filtered multilevel Monte​​﻿﻿ Carlo method for estimating​​​‌ the expectation of cell-centered﻿​﻿﻿ discretized random fields

7.14​​﻿﻿ Multilevel Monte Carlo methods​​​‌ for ensemble variational data﻿​﻿﻿ assimilation

7.15 Convergence analysis of﻿​​﻿ overlapping domain decomposition preconditioners​​​‌ for nonlinear problems

7.16 Robustness and﻿﻿﻿‌ reliability of state-space, frame-based﻿‌​‌ modeling for thermoacoustics

7.17 Juxtaposing the fourth﻿​﻿﻿ order vibrational operator perturbation​‌﻿﻿ theory CVPT(4) and the​​﻿﻿ adaptive VCI (A-VCI): Accuracy,​​​‌ vibrational resonances and polyads﻿​﻿﻿ of C2H4 and C2D4​‌﻿﻿

7.18 Approximation Algorithms﻿​﻿﻿ for Scheduling with/without Deadline​‌﻿﻿ Constraints where Rejection Costs​​﻿﻿ are Proportional to Processing​​​‌ Times

7.19 Guix-HPC Activity Report﻿​​﻿ 2023-2024

8 Partnerships and cooperations﻿​​﻿

8.1 European initiatives﻿﻿﻿‌

8.1.1 H2020 projects

EoCoE-3﻿‌​‌

8.2﻿​﻿﻿ National initiatives

MAMBO

SANTANA

PEPR Numpex

TensorVIM

Maturation

Diwina

9​​﻿﻿ Dissemination

9.1 Promoting scientific activities﻿​﻿﻿

9.1.1 Scientific events: organisation​‌﻿﻿

General chair, scientific chair​​﻿﻿

9.1.2 Scientific​‌﻿﻿ events: selection

Member of​​﻿﻿ the conference program committees​​​‌

9.1.3 Journal

Reviewer​​​‌ - reviewing activities

9.1.4 Invited talks

9.1.5 Scientific expertise﻿​﻿﻿

9.1.6 Research﻿​​﻿ administration

9.2 Teaching -﻿﻿﻿‌ Supervision - Juries -﻿‌​‌ Educational and pedagogical outreach﻿​​﻿

9.2.1​​​‌ Supervision

9.2.2​​​‌ Juries

10​​​‌ Scientific production

10.1 Major﻿​﻿﻿ publications

10.2﻿﻿﻿‌ Publications of the year﻿‌​‌

International journals

2025Activity reportProject-Team‌CONCACE

Computer Science‌ and Digital Science

Other Research Topics and Application‌ Domains

1 Team members,‌ visitors, external collaborators

Research Scientists

PhD Students

2 Overall objectives‌

3.1 Numerical algebra methodologies‌‌ in model and data-driven scientific computing

3.1.1 Scientific computing in‌ large size linear algebra‌

3.1.2 Scientific computing in large dimension multi-linear algebra‌

3.1.3 Scientific‌ continuum between large size and large dimension

3.2 Composition‌ of parallel numerical algorithms‌ from a sequential expression‌‌

4 Application‌ domains

4.1 Aeroacoustics Simulation

5 Highlights of the‌ year

6 Latest software developments, platforms, open‌ data

6.1 Latest software‌ developments

6.1.2‌ ScalFMM

7.1 Error Estimates for‌ Sparse Tensor Products of‌‌ B-spline Approximation Spaces

7.2 On some orthogonalization schemes in Tensor‌ Train format

7.3 A note on‌ TT-GMRES for the solution of parametric linear systems‌

7.4 Solving eigenvalue problems in high‌ dimensions using contour integration‌ and Tensor Train format‌‌

7.5 A‌ Tensor Train solver for‌‌ the Magnetic Moment Method

7.6 Generalized‌ Golub–Kahan Bidiagonalization for Nonsymmetric‌ Saddle-Point Systems

7.7 A note on‌ the partial convergence management for the solution of‌ symmetric linear systems with multiple right-hand sides

7.8 A Scalable and Parameter-Robust Preconditioner for a‌ Second Gradient of Dilation Regularization Applied to a‌ Mechanics Problem

7.9‌ Neural network preconditioning: a case study for the‌ solution of the parametric Helmholtz equation

7.10 Memory-and‌ compute-optimized geometric multigrid GMGPolar‌‌ for curvilinear coordinate representations -Applications to fusion plasma‌

7.11 Complexity‌‌ analysis and scalability of a matrix-free extrapolated geometric‌ multigrid solver for curvilinear‌ coordinates representations from fusion‌‌ plasma applications

7.12 Fault-tolerant numerical iterative algorithms‌ at scale

7.13‌ A filtered multilevel Monte Carlo method for estimating‌ the expectation of cell-centered discretized random fields

7.14 Multilevel Monte Carlo methods‌ for ensemble variational data assimilation

7.15 Convergence analysis of overlapping domain decomposition preconditioners‌ for nonlinear problems

7.16 Robustness and‌ reliability of state-space, frame-based‌‌ modeling for thermoacoustics

7.17 Juxtaposing the fourth order vibrational operator perturbation‌ theory CVPT(4) and the adaptive VCI (A-VCI): Accuracy,‌ vibrational resonances and polyads of C2H4 and C2D4‌

7.18 Approximation Algorithms for Scheduling with/without Deadline‌ Constraints where Rejection Costs are Proportional to Processing‌ Times

7.19 Guix-HPC Activity Report 2023-2024

8 Partnerships and cooperations

8.1 European initiatives‌

EoCoE-3‌‌

8.2 National initiatives

9 Dissemination

9.1 Promoting scientific activities

9.1.1 Scientific events: organisation‌

General chair, scientific chair

9.1.2 Scientific‌ events: selection

Member of the conference program committees‌

Reviewer‌ - reviewing activities

9.1.5 Scientific expertise

9.1.6 Research administration

9.2 Teaching -‌ Supervision - Juries -‌‌ Educational and pedagogical outreach

9.2.1‌ Supervision

9.2.2‌ Juries

10‌ Scientific production

10.1 Major publications

10.2‌ Publications of the year‌‌

Invited‌‌ conferences

International peer-reviewed conferences

National‌ peer-reviewed Conferences

Conferences‌ without proceedings

Reports & preprints

Other scientific publications‌

Software‌

10.3 Cited publications