Bibliography
Major publications by the team in recent years
-
1E. Agullo, O. Aumage, M. Faverge, N. Furmento, F. Pruvost, M. Sergent, S. Thibault.
Achieving High Performance on Supercomputers with a Sequential Task-based Programming Model, in: IEEE Transactions on Parallel and Distributed Systems, 2017. [ DOI : 10.1109/TPDS.2017.2766064 ]
https://hal.inria.fr/hal-01618526 -
2E. Agullo, B. Bramas, O. Coulaud, E. Darve, M. Messner, T. Takahashi.
Task-Based FMM for Multicore Architectures, in: SIAM Journal on Scientific Computing, 2014, vol. 36, no 1, pp. 66-93. [ DOI : 10.1137/130915662 ]
https://hal.inria.fr/hal-00911856 -
3E. Agullo, A. Buttari, A. Guermouche, F. Lopez.
Implementing multifrontal sparse solvers for multicore architectures with Sequential Task Flow runtime systems, in: ACM Transactions on Mathematical Software, July 2016. [ DOI : 10.1145/0000000.0000000 ]
https://hal.inria.fr/hal-01333645 -
4E. Agullo, L. Giraud, Y.-F. Jing.
Block GMRES method with inexact breakdowns and deflated restarting, in: SIAM Journal on Matrix Analysis and Applications, November 2014, vol. 35, no 4, pp. 1625-1651.
https://hal.inria.fr/hal-01067159 -
5E. Agullo, L. Giraud, P. Salas, M. Zounon.
Interpolation-restart strategies for resilient eigensolvers, in: SIAM Journal on Scientific Computing, 2016, vol. 38, no 5, pp. C560-C583. [ DOI : 10.1137/15M1042115 ]
https://hal.inria.fr/hal-01347793 -
6A. Casadei, P. Ramet, J. Roman.
An improved recursive graph bipartitioning algorithm for well balanced domain decomposition, in: 21st annual IEEE International Conference on High Performance Computing (HiPC 2014), Goa, India, December 2014.
https://hal.inria.fr/hal-01100962 -
7J. Dongarra, M. Faverge, T. Hérault, M. Jacquelin, J. Langou, Y. Robert.
Hierarchical QR factorization algorithms for multi-core clusters, in: Parallel Computing, 2013, vol. 39, no 4-5, pp. 212-232. [ DOI : 10.1016/j.parco.2013.01.003 ]
http://hal.inria.fr/hal-00809770 -
8X. Lacoste, M. Faverge, P. Ramet, S. Thibault, G. Bosilca.
Taking advantage of hybrid systems for sparse direct solvers via task-based runtimes, in: HCW'2014 workshop of IPDPS, Phoenix, United States, IEEE, May 2014.
https://hal.inria.fr/hal-00987094 -
9S. Moustafa, M. Faverge, L. Plagne, P. Ramet.
3D Cartesian Transport Sweep for Massively Parallel Architectures with PARSEC, in: 29th IEEE International Parallel & Distributed Processing Symposium, Hyderabad, India, May 2015, pp. 581-590. [ DOI : 10.1109/IPDPS.2015.75 ]
https://hal.inria.fr/hal-01078362 -
10M. Odunlami, V. Le Bris, D. Bégué, I. Baraille, O. Coulaud.
A-VCI: A flexible method to efficiently compute vibrational spectra, in: Journal of Chemical Physics, June 2017, vol. 146, no 21. [ DOI : 10.1063/1.4984266 ]
https://hal.inria.fr/hal-01534134 -
11G. Pichon, M. Faverge, P. Ramet, J. Roman.
Reordering Strategy for Blocking Optimization in Sparse Linear Solvers, in: SIAM Journal on Matrix Analysis and Applications, 2017, vol. 38, no 1, pp. 226 - 248. [ DOI : 10.1137/16M1062454 ]
https://hal.inria.fr/hal-01485507 -
12M. Predari, A. Esnard, J. Roman.
Comparison of initial partitioning methods for multilevel direct k-way graph partitioning with fixed vertices, in: Parallel Computing, 2017. [ DOI : 10.1016/j.parco.2017.05.002 ]
https://hal.inria.fr/hal-01538600 -
13F. Rozar, G. Latu, J. Roman, V. Grandgirard.
Toward memory scalability of GYSELA code for extreme scale computers, in: Concurrency and Computation: Practice and Experience, November 2014, pp. 1-16. [ DOI : 10.1002/cpe.3429 ]
https://hal.inria.fr/hal-01111720
Doctoral Dissertations and Habilitation Theses
-
14B. Alzaix.
Mathematical and numerical analysis of the Herberthson integral equation dedicated to electromagnetic plane wave scattering, Université de Bordeaux, April 2017.
https://tel.archives-ouvertes.fr/tel-01558135 -
15A. Guermouche.
Towards Sparse Direct Solvers for Modern High-Performance Architectures, Université de Bordeaux, November 2017, Habilitation à diriger des recherches. -
16P. Ramet.
Heterogeneous architectures, Hybrid methods, Hierarchical matrices for Sparse Linear Solvers, Université de Bordeaux, November 2017, Habilitation à diriger des recherches.
https://hal.inria.fr/tel-01668740
Articles in International Peer-Reviewed Journals
-
17E. Agullo, O. Aumage, B. Bramas, O. Coulaud, S. Pitoiset.
Bridging the gap between OpenMP and task-based runtime systems for the fast multipole method, in: IEEE Transactions on Parallel and Distributed Systems, April 2017, 14 p. [ DOI : 10.1109/TPDS.2017.2697857 ]
https://hal.inria.fr/hal-01517153 -
18E. Agullo, O. Aumage, M. Faverge, N. Furmento, F. Pruvost, M. Sergent, S. Thibault.
Achieving High Performance on Supercomputers with a Sequential Task-based Programming Model, in: IEEE Transactions on Parallel and Distributed Systems, 2017, forthcoming. [ DOI : 10.1109/TPDS.2017.2766064 ]
https://hal.inria.fr/hal-01618526 -
19D. Cariolle, P. Moinat, H. TEYSSÈDRE, L. Giraud, B. Josse, F. Lefèvre.
ASIS v1.0: an adaptive solver for the simulation of atmospheric chemistry, in: Geoscientific Model Development, 2017, vol. 10, pp. 1467 - 1485. [ DOI : 10.5194/gmd-10-1467-2017 ]
https://hal.inria.fr/hal-01507392 -
20J. M. Couteyen Carpaye, J. Roman, P. Brenner.
Design and Analysis of a Task-based Parallelization over a Runtime System of an Explicit Finite-Volume CFD Code with Adaptive Time Stepping, in: International Journal of Computational Science and Engineering, 2017, pp. 1 - 22, https://arxiv.org/abs/1704.01144. [ DOI : 10.1016/j.jocs.2017.03.008 ]
https://hal.inria.fr/hal-01507613 -
21M. Odunlami, V. Le Bris, D. Bégué, I. Baraille, O. Coulaud.
A-VCI: A flexible method to efficiently compute vibrational spectra, in: Journal of Chemical Physics, June 2017, vol. 146, no 21. [ DOI : 10.1063/1.4984266 ]
https://hal.inria.fr/hal-01534134 -
22G. Pichon, M. Faverge, P. Ramet, J. Roman.
Reordering Strategy for Blocking Optimization in Sparse Linear Solvers, in: SIAM Journal on Matrix Analysis and Applications, 2017, vol. 38, no 1, pp. 226 - 248. [ DOI : 10.1137/16M1062454 ]
https://hal.inria.fr/hal-01485507 -
23M. Predari, A. Esnard, J. Roman.
Comparison of initial partitioning methods for multilevel direct k-way graph partitioning with fixed vertices, in: Parallel Computing, 2017. [ DOI : 10.1016/j.parco.2017.05.002 ]
https://hal.inria.fr/hal-01538600 -
24D. Sukkari, H. Ltaief, M. Faverge, D. Keyes.
Asynchronous Task-Based Polar Decomposition on Single Node Manycore Architectures, in: IEEE Transactions on Parallel and Distributed Systems, August 2017, vol. XX. [ DOI : 10.1109/TPDS.2017.2755655 ]
https://hal.inria.fr/hal-01585079
International Conferences with Proceedings
-
25E. Agullo, S. Cools, L. Giraud, W. Vanroose, E. F. Yetkin.
Soft Error in Classical PCG and its Variants: Sensitivity, Numerical Detections and Possible Recovery Policies, in: SIAM Annual meeting 2017, AN'17, Pittsburgh, United States, July 2017.
https://hal.inria.fr/hal-01670198 -
26O. Beaumont, L. Eyraud-Dubois, S. Kumar.
Approximation Proofs of a Fast and Efficient List Scheduling Algorithm for Task-Based Runtime Systems on Multicores and GPUs, in: IEEE International Parallel & Distributed Processing Symposium (IPDPS), Orlando, United States, May 2017.
https://hal.inria.fr/hal-01386174 -
27N. Bouzat, F. Rozar, G. Latu, J. Roman.
A New Parallelization Scheme for the Hermite Interpolation Based Gyroaverage Operator, in: ISPDC 2017 - 16th International Symposium on Parallel and Distributed Computing, Innsbruck, Austria, IEEE, July 2017, pp. 1-8. [ DOI : 10.1109/ISPDC.2017.12 ]
https://hal.inria.fr/hal-01687727 -
28P. Clauss, E. Altıntas, M. Kuhn.
Automatic Collapsing of Non-Rectangular Loops, in: Parallel and Distributed Processing Symposium (IPDPS), 2017, Orlando, United States, IEEE International, May 2017, pp. 778 - 787. [ DOI : 10.1109/IPDPS.2017.34 ]
https://hal.inria.fr/hal-01581081 -
29M. Faverge, J. Langou, Y. Robert, J. Dongarra.
Bidiagonalization and R-Bidiagonalization: Parallel Tiled Algorithms, Critical Paths and Distributed-Memory Implementation, in: IPDPS'17 - 31st IEEE International Parallel and Distributed Processing Symposium, Orlando, United States, May 2017.
https://hal.inria.fr/hal-01484113 -
30G. Pichon, E. Darve, M. Faverge, P. Ramet, J. Roman.
Sparse Supernodal Solver Using Block Low-Rank Compression, in: 18th IEEE International Workshop on Parallel and Distributed Scientific and Engineering Computing (PDSEC 2017), Orlando, United States, June 2017.
https://hal.inria.fr/hal-01502215
National Conferences with Proceedings
-
31E. Agullo, A. Falco, L. Giraud, G. Sylvand.
Vers une factorisation symbolique hiérarchique de rang faible pour des matrices creuses, in: Conférence d’informatique en Parallélisme, Architecture et Système (ComPAS'17), Sophia Antipolis, France, June 2017.
https://hal.inria.fr/hal-01597072 -
32G. Pichon.
Utilisation de la compression Block Low-Rank pour accélérer un solveur direct creux supernodal, in: Conférence d’informatique en Parallélisme, Architecture et Système (ComPAS'17), Sophia Antipolis, France, June 2017.
https://hal.inria.fr/hal-01585660
Conferences without Proceedings
-
33E. Agullo, L. Giraud, E. Darve, Y. Harness.
Soft Error in Classical PCG and its Variants: Sensitivity, Numerical Detections and Possible Recovery Policies, in: SIAM Annual meeting 2017, AN'17, Pittsburgh, United States, July 2017.
https://hal.inria.fr/hal-01670160 -
34E. Agullo, L. Giraud, L. Poirel.
Robust coarse spaces for abstract Schwarz preconditioners via generalized eigenproblems, in: International conference on domain decomposition methods, DD24, Svalbard, Norway, February 2017.
https://hal.inria.fr/hal-01670178 -
35E. Agullo, L. Giraud, E. F. Yetkin.
Soft Error in PCG: Sensitivity, Numerical Detections and Possible Recoveries , in: SIAM Conference on Computational Science and Engineering, CSE'17, Atlanta, United States, February 2017.
https://hal.inria.fr/hal-01670189 -
36G. Pichon, E. Darve, M. Faverge, P. Ramet, J. Roman.
Sparse Supernodal Solver exploiting Low-Rankness Property, in: Sparse Days 2017, Toulouse, France, September 2017.
https://hal.inria.fr/hal-01585622 -
37G. Pichon, E. Darve, M. Faverge, P. Ramet, J. Roman.
Sparse Supernodal Solver Using Hierarchical Compression over Runtime System, in: SIAM Conference on Computation Science and Engineering (CSE'17), Atlanta, United States, February 2017.
https://hal.inria.fr/hal-01421379 -
38G. Pichon, M. Faverge, P. Ramet.
Exploiting Modern Manycore Architecture in Sparse Direct Solver with Runtime Systems, in: SIAM Conference on Computation Science and Engineering (CSE'17), Atlanta, United States, February 2017.
https://hal.inria.fr/hal-01421383 -
39G. Pichon, M. Faverge, P. Ramet, J. Roman.
Impact of Blocking Strategies for Sparse Direct Solvers on Top of Generic Runtimes, in: SIAM Conference on Computation Science and Engineering (CSE'17), Atlanta, United States, February 2017.
https://hal.inria.fr/hal-01421384
Internal Reports
-
40E. Agullo, B. Bramas, O. Coulaud, M. Khannouz, L. Stanisic.
Task-based fast multipole method for clusters of multicore processors, Inria Bordeaux Sud-Ouest, March 2017, no RR-8970, 15 p.
https://hal.inria.fr/hal-01387482 -
41E. Agullo, B. Bramas, O. Coulaud, L. Stanisic, S. Thibault.
Modeling Irregular Kernels of Task-based codes: Illustration with the Fast Multipole Method, Inria Bordeaux, February 2017, no RR-9036, 35 p.
https://hal.inria.fr/hal-01474556 -
42E. Agullo, A. Buttari, M. Byckling, A. Guermouche, I. Masliah.
Achieving high-performance with a sparse direct solver on Intel KNL, Inria Bordeaux Sud-Ouest ; CNRS-IRIT ; Intel corporation ; Université Bordeaux, February 2017, no RR-9035, 15 p.
https://hal.inria.fr/hal-01473475 -
43I. Baraille, D. Bégué, O. Coulaud, V. Le Bris, M. Odunlami.
A-VCI: a flexible method to efficiently compute vibrational spectra, Inria, March 2017, no RR-9043, 35 p.
https://hal.inria.fr/hal-01485877 -
44P. Blanchard, P. P. Chaumeil, J.-M. Frigerio, F. Rimet, F. Salin, S. Thérond, O. Coulaud, A. Franc.
A geometric view of Biodiversity: scaling to metagenomics, Inria ; INRA, January 2018, no RR-9144, pp. 1-16.
https://hal.inria.fr/hal-01685711 -
45N. Bouzat, F. Rozar, G. Latu, J. Roman.
A new parallelization scheme for the Hermite interpolation based gyroaverage operator, Inria, April 2017, no RR-9054, 22 p.
https://hal.inria.fr/hal-01502513 -
46M. Faverge, S. Moustafa, F. Févotte, L. Plagne, P. Ramet.
Efficient Parallel Solution of the 3D Stationary Boltzmann Transport Equation for Diffusive Problems, Inria ; EDF Lab, September 2017, no RR-9116, 22 p.
https://hal.inria.fr/hal-01630208 -
47G. Pichon, E. Darve, M. Faverge, P. Ramet, J. Roman.
Sparse Supernodal Solver Using Block Low-Rank Compression, Inria Bordeaux Sud-Ouest, January 2017, no RR-9022, 24 p.
https://hal.inria.fr/hal-01450732 -
48G. Pichon, E. Darve, M. Faverge, P. Ramet, J. Roman.
Sparse Supernodal Solver Using Block Low-Rank Compression: design, performance and analysis, Inria Bordeaux Sud-Ouest, December 2017, no RR-9130, pp. 1-32.
https://hal.inria.fr/hal-01660665
Other Publications
-
49G. Pichon, E. Darve, M. Faverge, S. Lanteri, P. Ramet, J. Roman.
Sparse supernodal solver with low-rank compression for solving the frequency-domain Maxwell equations discretized by a high order HDG method, November 2017, pp. 1-55, Journées jeunes chercheur-e-s - Résolution de problèmes d’ondes harmoniques de grande taille.
https://hal.inria.fr/hal-01660653