EN FR
EN FR
ROMA - 2019
New Software and Platforms
Bilateral Contracts and Grants with Industry
Bibliography
New Software and Platforms
Bilateral Contracts and Grants with Industry
Bibliography


Bibliography

Publications of the year

Doctoral Dissertations and Habilitation Theses

  • 1B. Uçar.

    Partitioning, matching, and ordering: Combinatorial scientific computing with matrices and tensors, ENS de Lyon, September 2019, Habilitation à diriger des recherches.

    https://hal.inria.fr/tel-02377874

Articles in International Peer-Reviewed Journals

  • 2P. R. Amestoy, A. Buttari, J.-Y. L'Excellent, T. Mary.

    Bridging the gap between flat and hierarchical low-rank matrix formats: the multilevel BLR format, in: SIAM Journal on Scientific Computing, May 2019, vol. 41, no 3, pp. A1414-A1442. [ DOI : 10.1137/18M1182760 ]

    https://hal.archives-ouvertes.fr/hal-01774642
  • 3P. R. Amestoy, A. Buttari, J.-Y. L'Excellent, T. Mary.

    Performance and Scalability of the Block Low-Rank Multifrontal Factorization on Multicore Architectures, in: ACM Transactions on Mathematical Software, February 2019, vol. 45, no 1, pp. 1-23. [ DOI : 10.1145/3242094 ]

    https://hal.archives-ouvertes.fr/hal-01955766
  • 4P. R. Amestoy, J.-Y. L'Excellent, G. Moreau.

    On exploiting sparsity of multiple right-hand sides in sparse direct solvers, in: SIAM Journal on Scientific Computing, 2019, vol. 41, no 1, pp. A269-A291. [ DOI : 10.1137/17M1151882 ]

    https://hal.inria.fr/hal-01955659
  • 5G. Aupy, A. Benoit, B. Goglin, L. Pottier, Y. Robert.

    Co-scheduling HPC workloads on cache-partitioned CMP platforms, in: International Journal of High Performance Computing Applications, April 2019, vol. 33, no 6, pp. 1221-1239. [ DOI : 10.1177/1094342019846956 ]

    https://hal.inria.fr/hal-02093172
  • 6O. Beaumont, T. Lambert, L. Marchal, B. Thomas.

    Performance Analysis and Optimality Results for Data-Locality Aware Tasks Scheduling with Replicated Inputs, in: Future Generation Computer Systems, October 2019, pp. 1-28. [ DOI : 10.1016/j.future.2019.08.024 ]

    https://hal.inria.fr/hal-02275473
  • 7A. Benoit, A. Cavelan, F. Ciorba, V. Le Fèvre, Y. Robert.

    Combining Checkpointing and Replication for Reliable Execution of Linear Workflows with Fail-Stop and Silent Errors, in: International Journal of Networking and Computing, 2019, vol. 9, no 1, pp. 2-27. [ DOI : 10.15803/ijnc.9.1_2 ]

    https://hal.inria.fr/hal-02082369
  • 8L.-C. Canon, A. K. W. Chang, Y. Robert, F. Vivien.

    Scheduling independent stochastic tasks under deadline and budget constraints, in: International Journal of High Performance Computing Applications, June 2019, pp. 1-19. [ DOI : 10.1177/1094342019852135 ]

    https://hal.inria.fr/hal-02291031
  • 9L.-C. Canon, L. Marchal, B. Simon, F. Vivien.

    Online Scheduling of Task Graphs on Heterogeneous Platforms, in: IEEE Transactions on Parallel and Distributed Systems, 2019, pp. 1-12, forthcoming. [ DOI : 10.1109/TPDS.2019.2942909 ]

    https://hal.inria.fr/hal-02291268
  • 10L. Han, V. Le Fèvre, L.-C. Canon, Y. Robert, F. Vivien.

    A Generic Approach to Scheduling and Checkpointing Workflows, in: International Journal of High Performance Computing Applications, May 2019, pp. 1-19. [ DOI : 10.1177/1094342019866891 ]

    https://hal.inria.fr/hal-02140295
  • 11J. Herrmann, Y. M. Özkaya, B. Uçar, K. Kaya, U. V. Catalyurek.

    Multilevel Algorithms for Acyclic Partitioning of Directed Acyclic Graphs, in: SIAM Journal on Scientific Computing, July 2019, vol. 41, no 4, pp. A2117-A2145. [ DOI : 10.1137/18M1176865 ]

    https://hal.inria.fr/hal-02306566
  • 12L. Marchal, B. Simon, F. Vivien.

    Limiting the memory footprint when dynamically scheduling DAGs on shared-memory platforms, in: Journal of Parallel and Distributed Computing, February 2019, vol. 128, pp. 30-42. [ DOI : 10.1016/j.jpdc.2019.01.009 ]

    https://hal.inria.fr/hal-02025521
  • 13F. Pawłowski, B. Uçar, A.-J. Yzelman.

    A multi-dimensional Morton-ordered block storage for mode-oblivious tensor computations, in: Journal of computational science, March 2019, pp. 1-35, forthcoming. [ DOI : 10.1016/j.jocs.2019.02.007 ]

    https://hal.inria.fr/hal-02082524

International Conferences with Proceedings

  • 14G. Aupy, A. Gainaru, V. Honoré, P. Raghavan, Y. Robert, H. Sun.

    Reservation Strategies for Stochastic Jobs, in: IPDPS 2019 - 33rd IEEE International Parallel and Distributed Processing Symposium, Rio de Janeiro, Brazil, IEEE, May 2019, pp. 166-175. [ DOI : 10.1109/IPDPS.2019.00027 ]

    https://hal.inria.fr/hal-01968419
  • 15A. Benoit, T. Hérault, V. Le Fèvre, Y. Robert.

    Replication Is More Efficient Than You Think, in: SC 2019 - International Conference for High Performance Computing, Networking, Storage, and Analysis (SC'19), Denver, United States, November 2019.

    https://hal.inria.fr/hal-02273142
  • 17Y. Gao, L.-C. Canon, Y. Robert, F. Vivien.

    Scheduling independent stochastic tasks on heterogeneous cloud platforms, in: IEEE Cluster 2019 - International Conference on Cluster Computing, Albuquerque, United States, IEEE, September 2019, pp. 1-11.

    https://hal.inria.fr/hal-02271675
  • 18L. Han, L.-C. Canon, J. Liu, Y. Robert, F. Vivien.

    Improved energy-aware strategies for periodic real-time tasks under reliability constraints, in: RTSS 2019 - 40th IEEE Real-Time Systems Symposium, York, United Kingdom, February 2020.

    https://hal.inria.fr/hal-02271704
  • 19T. Herault, Y. Robert, G. Bosilca, J. Dongarra.

    Generic Matrix Multiplication for Multi-GPU Accelerated Distributed-Memory Platforms over PaRSEC, in: ScalA 2019 - IEEE/ACM 10th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems, Denver, United States, IEEE, November 2019, pp. 33-41. [ DOI : 10.1109/ScalA49573.2019.00010 ]

    https://hal.inria.fr/hal-02436180
  • 20K. Kaya, J. Langguth, I. Panagiotas, B. Uçar.

    Karp-Sipser based kernels for bipartite graph matching, in: SIAM Symposium on Algorithm Engineering and Experiments (ALENEX20), Salt Lake City, Utah, United States, January 2020.

    https://hal.inria.fr/hal-02350734
  • 21J. Li, B. Uçar, U. V. Catalyurek, J. Sun, K. Barker, R. Vuduc.

    Efficient and effective sparse tensor reordering, in: ICS 2019 - ACM International Conference on Supercomputing, Phoenix, United States, June 2019, pp. 227-237. [ DOI : 10.1145/3330345.3330366 ]

    https://hal.inria.fr/hal-02306569
  • 22F. Pawłowski, B. Uçar, A.-J. Yzelman.

    High performance tensor-vector multiplication on shared-memory systems, in: PPAM 2019 - 13th International Conference on Parallel Processing and Applied Mathematics, Bialystok, Poland, September 2019, pp. 1-11.

    https://hal.inria.fr/hal-02332496
  • 23R. Portase, B. Uçar.

    Matrix symmetrization and sparse direct solvers, in: SIAM Workshop on Combinatorial Scientific Computing 2020, Seattle, United States, 2020.

    https://hal.inria.fr/hal-02417778
  • 24Y. M. Özkaya, A. Benoit, U. V. Catalyurek.

    Is Acyclic Directed Graph Partitioning Effective for Locality-Aware Scheduling?, in: PPAM 2019 - 13th International Conference on Parallel Processing and Applied Mathematics, Bialystok, Poland, September 2019.

    https://hal.inria.fr/hal-02273122
  • 25Y. M. Özkaya, A. Benoit, B. Uçar, J. Herrmann, U. V. Catalyurek.

    A scalable clustering-based task scheduler for homogeneous processors using DAG partitioning, in: IPDPS 2019 - 33rd IEEE International Parallel & Distributed Processing Symposium, Rio de Janeiro, Brazil, IEEE, May 2019, pp. 155-165. [ DOI : 10.1109/IPDPS.2019.00026 ]

    https://hal.inria.fr/hal-02082794

Scientific Books (or Scientific Book chapters)

  • 26L. Marchal, É. Saule, O. Sinnen.

    Special Issue Proposal for the Parallel Computing Journal: HeteroPar 2016 and HCW 2016 Workshops, Elsevier, April 2019.

    https://hal.inria.fr/hal-02423211

Internal Reports

  • 27A. Benoit, C. Gou, L. Marchal.

    Partitioning tree-shaped task graphs for distributed platforms with limited memory, Inria Grenoble Rhône-Alpes, March 2019, no RR-9115, pp. 1-34.

    https://hal.inria.fr/hal-01644352
  • 28A. Benoit, T. Hérault, V. Le Fèvre, Y. Robert.

    Replication Is More Efficient Than You Think, Inria - Research Centre Grenoble – Rhône-Alpes, 2019, no RR-9278.

    https://hal.inria.fr/hal-02265925
  • 29A. Benoit, V. Le Fèvre, P. Raghavan, Y. Robert, H. Sun.

    Design and Comparison of Resilient Scheduling Heuristics for Parallel Jobs, Inria - Research Centre Grenoble – Rhône-Alpes, October 2019, no RR-9296, pp. 1-29.

    https://hal.inria.fr/hal-02317464
  • 30L.-C. Canon, A. Kong Win Chang, Y. Robert, F. Vivien.

    Scheduling independent stochastic tasks under deadline and budget constraints. Extended Version, Inria Grenoble Rhône-Alpes, February 2019, no RR-9257, pp. 1-38.

    https://hal.inria.fr/hal-02025785
  • 31A. Gainaru, B. Goglin, V. Honoré, G. Pallez, P. Raghavan, Y. Robert, H. Sun.

    Reservation and Checkpointing Strategies for Stochastic Jobs (Extended Version), Inria & Labri, Univ. Bordeaux ; Department of EECS, Vanderbilt University, Nashville, TN, USA ; Laboratoire LIP, ENS Lyon & University of Tennessee Knoxville, Lyon, France, October 2019, no RR-9294.

    https://hal.inria.fr/hal-02328013
  • 32Y. Gao, L.-C. Canon, Y. Robert, F. Vivien.

    Scheduling independent stochastic tasks on heterogeneous cloud platforms, Inria - Research Centre Grenoble – Rhône-Alpes, May 2019, no RR-9275.

    https://hal.inria.fr/hal-02141253
  • 33Y. Gao, L.-C. Canon, F. Vivien, Y. Robert.

    Scheduling stochastic tasks on heterogeneous cloud platforms under budget and deadline constraints, Inria - Research Centre Grenoble – Rhône-Alpes, February 2019, no RR-9260, pp. 1-34.

    https://hal.inria.fr/hal-02047434
  • 34L. Han, L.-C. Canon, J. Liu, Y. Robert, F. Vivien.

    Improved energy-aware strategies for periodic real-time tasks under reliability constraints, Inria - Research Centre Grenoble – Rhône-Alpes, February 2019, no RR-9259, pp. 1-38.

    https://hal.inria.fr/hal-02056520
  • 35T. Hérault, Y. Robert, G. Bosilca, J. Dongarra.

    Generic matrix multiplication for multi-GPU accelerated distributed-memory platforms over PaRSEC, Inria Grenoble - Rhone-Alpes, September 2019, no RR-9289.

    https://hal.inria.fr/hal-02282529
  • 36F. Pawłowski, B. Uçar, A.-J. Yzelman.

    High performance tensor-vector multiplies on shared memory systems, Inria - Research Centre Grenoble – Rhône-Alpes, May 2019, no RR-9274, pp. 1-20.

    https://hal.inria.fr/hal-02123526

Other Publications

References in notes
  • 39Blue Waters Newsletter, dec 2012.
  • 40Blue Waters Resources, 2013.

    https://bluewaters.ncsa.illinois.edu/data
  • 41The BOINC project, 2013.

    http://boinc.berkeley.edu/
  • 42Final report of the Department of Energy Fault Management Workshop, December 2012.

    https://science.energy.gov/~/media/ascr/pdf/program-documents/docs/FaultManagement-wrkshpRpt-v4-final.pdf
  • 43System Resilience at Extreme Scale: white paper, 2008, DARPA.

    https://pdfs.semanticscholar.org/9fcb/154d6afce23cd9951fd7c116b86255d91b5c.pdf
  • 44Top500 List - November, 2011.

    http://www.top500.org/list/2011/11/
  • 45Top500 List - November, 2012.

    http://www.top500.org/list/2012/11/
  • 46M. Amaris, G. Lucarelli, C. Mommessin, D. Trystram.

    Generic Algorithms for Scheduling Applications on Hybrid Multi-core Machines, in: Euro-Par 2017: Parallel Processing, 2017, pp. 220–231.
  • 47I. Assayad, A. Girault, H. Kalla.

    Tradeoff exploration between reliability power consumption and execution time, in: Proceedings of SAFECOMP, the Conf. on Computer Safety, Reliability and Security, Washington, DC, USA, 2011.
  • 48H. Aydin, Q. Yang.

    Energy-aware partitioning for multiprocessor real-time systems, in: IPDPS'03, the IEEE Int. Parallel and Distributed Processing Symposium, 2003, pp. 113–121.
  • 49N. Bansal, T. Kimbrel, K. Pruhs.

    Speed Scaling to Manage Energy and Temperature, in: Journal of the ACM, 2007, vol. 54, no 1, pp. 1 – 39.

    http://doi.acm.org/10.1145/1206035.1206038
  • 50A. Benoit, L. Marchal, J.-F. Pineau, Y. Robert, F. Vivien.

    Scheduling concurrent bag-of-tasks applications on heterogeneous platforms, in: IEEE Transactions on Computers, 2010, vol. 59, no 2, pp. 202-217.
  • 51S. Blackford, J. Choi, A. Cleary, E. D'Azevedo, J. Demmel, I. Dhillon, J. Dongarra, S. Hammarling, G. Henry, A. Petitet, K. Stanley, D. Walker, R. C. Whaley.

    ScaLAPACK Users' Guide, SIAM, 1997.
  • 52S. Blackford, J. Dongarra.

    Installation Guide for LAPACK, LAPACK Working Note, June 1999, no 41, originally released March 1992.
  • 53A. Buttari, J. Langou, J. Kurzak, J. Dongarra.

    Parallel tiled QR factorization for multicore architectures, in: Concurrency: Practice and Experience, 2008, vol. 20, no 13, pp. 1573-1590.
  • 54J.-J. Chen, T.-W. Kuo.

    Multiprocessor energy-efficient scheduling for real-time tasks, in: ICPP'05, the Int. Conference on Parallel Processing, 2005, pp. 13–20.
  • 55S. Donfack, L. Grigori, W. Gropp, L. V. Kale.

    Hybrid Static/dynamic Scheduling for Already Optimized Dense Matrix Factorization, in: Parallel Distributed Processing Symposium (IPDPS), 2012 IEEE 26th International, 2012, pp. 496-507.

    http://dx.doi.org/10.1109/IPDPS.2012.53
  • 56J. Dongarra, J.-F. Pineau, Y. Robert, Z. Shi, F. Vivien.

    Revisiting Matrix Product on Master-Worker Platforms, in: International Journal of Foundations of Computer Science, 2008, vol. 19, no 6, pp. 1317-1336.
  • 57J. Dongarra, J.-F. Pineau, Y. Robert, F. Vivien.

    Matrix Product on Heterogeneous Master-Worker Platforms, in: 13th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Salt Lake City, Utah, February 2008, pp. 53–62.
  • 58I. S. Duff, J. K. Reid.

    The multifrontal solution of indefinite sparse symmetric linear systems, in: "ACM Transactions on Mathematical Software", 1983, vol. 9, pp. 302-325.
  • 59I. S. Duff, J. K. Reid.

    The multifrontal solution of unsymmetric sets of linear systems, in: SIAM Journal on Scientific and Statistical Computing, 1984, vol. 5, pp. 633-641.
  • 60L. Grigori, J. W. Demmel, H. Xiang.

    Communication avoiding Gaussian elimination, in: Proceedings of the 2008 ACM/IEEE conference on Supercomputing, Piscataway, NJ, USA, SC '08, IEEE Press, 2008, 29:1 p.

    http://dl.acm.org/citation.cfm?id=1413370.1413400
  • 61B. Hadri, H. Ltaief, E. Agullo, J. Dongarra.

    Tile QR Factorization with Parallel Panel Processing for Multicore Architectures, in: IPDPS'10, the 24st IEEE Int. Parallel and Distributed Processing Symposium, 2010.
  • 62M. A. Haque, H. Aydin, D. Zhu.

    On reliability management of energy-aware real-time systems through task replication, in: IEEE Transactions on Parallel and Distributed Systems, 2017, vol. 28, no 3, pp. 813–825.
  • 63J. W. H. Liu.

    The multifrontal method for sparse matrix solution: Theory and Practice, in: SIAM Review, 1992, vol. 34, pp. 82–109.
  • 64R. Melhem, D. Mossé, E. Elnozahy.

    The Interplay of Power Management and Fault Recovery in Real-Time Systems, in: IEEE Transactions on Computers, 2004, vol. 53, no 2, pp. 217-231.
  • 65A. J. Oliner, R. K. Sahoo, J. E. Moreira, M. Gupta, A. Sivasubramaniam.

    Fault-aware job scheduling for bluegene/l systems, in: IPDPS'04, the IEEE Int. Parallel and Distributed Processing Symposium, 2004, pp. 64–73.
  • 66G. Quintana-Ortí, E. Quintana-Ortí, R. A. van de Geijn, F. G. V. Zee, E. Chan.

    Programming Matrix Algorithms-by-Blocks for Thread-Level Parallelism, in: ACM Transactions on Mathematical Software, 2009, vol. 36, no 3.
  • 67Y. Robert, F. Vivien.

    Algorithmic Issues in Grid Computing, in: Algorithms and Theory of Computation Handbook, Chapman and Hall/CRC Press, 2009.
  • 68G. Zheng, X. Ni, L. V. Kale.

    A scalable double in-memory checkpoint and restart scheme towards exascale, in: Dependable Systems and Networks Workshops (DSN-W), 2012.

    http://dx.doi.org/10.1109/DSNW.2012.6264677
  • 69D. Zhu, R. Melhem, D. Mossé.

    The effects of energy management on reliability in real-time embedded systems, in: Proc. of IEEE/ACM Int. Conf. on Computer-Aided Design (ICCAD), 2004, pp. 35–40.