Bibliography

Publications of the year

Doctoral Dissertations and Habilitation Theses

1B. Uçar.

Partitioning, matching, and ordering: Combinatorial scientific computing with matrices and tensors, ENS de Lyon, September 2019, Habilitation à diriger des recherches.

https://hal.inria.fr/tel-02377874

Articles in International Peer-Reviewed Journals

2P. R. Amestoy, A. Buttari, J.-Y. L'Excellent, T. Mary.

Bridging the gap between flat and hierarchical low-rank matrix formats: the multilevel BLR format, in: SIAM Journal on Scientific Computing, May 2019, vol. 41, n^o 3, pp. A1414-A1442. [ DOI : 10.1137/18M1182760 ]

https://hal.archives-ouvertes.fr/hal-01774642
3P. R. Amestoy, A. Buttari, J.-Y. L'Excellent, T. Mary.

Performance and Scalability of the Block Low-Rank Multifrontal Factorization on Multicore Architectures, in: ACM Transactions on Mathematical Software, February 2019, vol. 45, n^o 1, pp. 1-23. [ DOI : 10.1145/3242094 ]

https://hal.archives-ouvertes.fr/hal-01955766
4P. R. Amestoy, J.-Y. L'Excellent, G. Moreau.

On exploiting sparsity of multiple right-hand sides in sparse direct solvers, in: SIAM Journal on Scientific Computing, 2019, vol. 41, n^o 1, pp. A269-A291. [ DOI : 10.1137/17M1151882 ]

https://hal.inria.fr/hal-01955659
5G. Aupy, A. Benoit, B. Goglin, L. Pottier, Y. Robert.

Co-scheduling HPC workloads on cache-partitioned CMP platforms, in: International Journal of High Performance Computing Applications, April 2019, vol. 33, n^o 6, pp. 1221-1239. [ DOI : 10.1177/1094342019846956 ]

https://hal.inria.fr/hal-02093172
6O. Beaumont, T. Lambert, L. Marchal, B. Thomas.

Performance Analysis and Optimality Results for Data-Locality Aware Tasks Scheduling with Replicated Inputs, in: Future Generation Computer Systems, October 2019, pp. 1-28. [ DOI : 10.1016/j.future.2019.08.024 ]

https://hal.inria.fr/hal-02275473
7A. Benoit, A. Cavelan, F. Ciorba, V. Le Fèvre, Y. Robert.

Combining Checkpointing and Replication for Reliable Execution of Linear Workflows with Fail-Stop and Silent Errors, in: International Journal of Networking and Computing, 2019, vol. 9, n^o 1, pp. 2-27. [ DOI : 10.15803/ijnc.9.1_2 ]

https://hal.inria.fr/hal-02082369
8L.-C. Canon, A. K. W. Chang, Y. Robert, F. Vivien.

Scheduling independent stochastic tasks under deadline and budget constraints, in: International Journal of High Performance Computing Applications, June 2019, pp. 1-19. [ DOI : 10.1177/1094342019852135 ]

https://hal.inria.fr/hal-02291031
9L.-C. Canon, L. Marchal, B. Simon, F. Vivien.

Online Scheduling of Task Graphs on Heterogeneous Platforms, in: IEEE Transactions on Parallel and Distributed Systems, 2019, pp. 1-12, forthcoming. [ DOI : 10.1109/TPDS.2019.2942909 ]

https://hal.inria.fr/hal-02291268
10L. Han, V. Le Fèvre, L.-C. Canon, Y. Robert, F. Vivien.

A Generic Approach to Scheduling and Checkpointing Workflows, in: International Journal of High Performance Computing Applications, May 2019, pp. 1-19. [ DOI : 10.1177/1094342019866891 ]

https://hal.inria.fr/hal-02140295
11J. Herrmann, Y. M. Özkaya, B. Uçar, K. Kaya, U. V. Catalyurek.

Multilevel Algorithms for Acyclic Partitioning of Directed Acyclic Graphs, in: SIAM Journal on Scientific Computing, July 2019, vol. 41, n^o 4, pp. A2117-A2145. [ DOI : 10.1137/18M1176865 ]

https://hal.inria.fr/hal-02306566
12L. Marchal, B. Simon, F. Vivien.

Limiting the memory footprint when dynamically scheduling DAGs on shared-memory platforms, in: Journal of Parallel and Distributed Computing, February 2019, vol. 128, pp. 30-42. [ DOI : 10.1016/j.jpdc.2019.01.009 ]

https://hal.inria.fr/hal-02025521
13F. Pawłowski, B. Uçar, A.-J. Yzelman.

A multi-dimensional Morton-ordered block storage for mode-oblivious tensor computations, in: Journal of computational science, March 2019, pp. 1-35, forthcoming. [ DOI : 10.1016/j.jocs.2019.02.007 ]

https://hal.inria.fr/hal-02082524

International Conferences with Proceedings

14G. Aupy, A. Gainaru, V. Honoré, P. Raghavan, Y. Robert, H. Sun.

Reservation Strategies for Stochastic Jobs, in: IPDPS 2019 - 33rd IEEE International Parallel and Distributed Processing Symposium, Rio de Janeiro, Brazil, IEEE, May 2019, pp. 166-175. [ DOI : 10.1109/IPDPS.2019.00027 ]

https://hal.inria.fr/hal-01968419
15A. Benoit, T. Hérault, V. Le Fèvre, Y. Robert.

Replication Is More Efficient Than You Think, in: SC 2019 - International Conference for High Performance Computing, Networking, Storage, and Analysis (SC'19), Denver, United States, November 2019.

https://hal.inria.fr/hal-02273142
17Y. Gao, L.-C. Canon, Y. Robert, F. Vivien.

Scheduling independent stochastic tasks on heterogeneous cloud platforms, in: IEEE Cluster 2019 - International Conference on Cluster Computing, Albuquerque, United States, IEEE, September 2019, pp. 1-11.

https://hal.inria.fr/hal-02271675
18L. Han, L.-C. Canon, J. Liu, Y. Robert, F. Vivien.

Improved energy-aware strategies for periodic real-time tasks under reliability constraints, in: RTSS 2019 - 40th IEEE Real-Time Systems Symposium, York, United Kingdom, February 2020.

https://hal.inria.fr/hal-02271704
19T. Herault, Y. Robert, G. Bosilca, J. Dongarra.

Generic Matrix Multiplication for Multi-GPU Accelerated Distributed-Memory Platforms over PaRSEC, in: ScalA 2019 - IEEE/ACM 10th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems, Denver, United States, IEEE, November 2019, pp. 33-41. [ DOI : 10.1109/ScalA49573.2019.00010 ]

https://hal.inria.fr/hal-02436180
20K. Kaya, J. Langguth, I. Panagiotas, B. Uçar.

Karp-Sipser based kernels for bipartite graph matching, in: SIAM Symposium on Algorithm Engineering and Experiments (ALENEX20), Salt Lake City, Utah, United States, January 2020.

https://hal.inria.fr/hal-02350734
21J. Li, B. Uçar, U. V. Catalyurek, J. Sun, K. Barker, R. Vuduc.

Efficient and effective sparse tensor reordering, in: ICS 2019 - ACM International Conference on Supercomputing, Phoenix, United States, June 2019, pp. 227-237. [ DOI : 10.1145/3330345.3330366 ]

https://hal.inria.fr/hal-02306569
22F. Pawłowski, B. Uçar, A.-J. Yzelman.

High performance tensor-vector multiplication on shared-memory systems, in: PPAM 2019 - 13th International Conference on Parallel Processing and Applied Mathematics, Bialystok, Poland, September 2019, pp. 1-11.

https://hal.inria.fr/hal-02332496
23R. Portase, B. Uçar.

Matrix symmetrization and sparse direct solvers, in: SIAM Workshop on Combinatorial Scientific Computing 2020, Seattle, United States, 2020.

https://hal.inria.fr/hal-02417778
24Y. M. Özkaya, A. Benoit, U. V. Catalyurek.

Is Acyclic Directed Graph Partitioning Effective for Locality-Aware Scheduling?, in: PPAM 2019 - 13th International Conference on Parallel Processing and Applied Mathematics, Bialystok, Poland, September 2019.

https://hal.inria.fr/hal-02273122
25Y. M. Özkaya, A. Benoit, B. Uçar, J. Herrmann, U. V. Catalyurek.

A scalable clustering-based task scheduler for homogeneous processors using DAG partitioning, in: IPDPS 2019 - 33rd IEEE International Parallel & Distributed Processing Symposium, Rio de Janeiro, Brazil, IEEE, May 2019, pp. 155-165. [ DOI : 10.1109/IPDPS.2019.00026 ]

https://hal.inria.fr/hal-02082794

Scientific Books (or Scientific Book chapters)

26L. Marchal, É. Saule, O. Sinnen.

Special Issue Proposal for the Parallel Computing Journal: HeteroPar 2016 and HCW 2016 Workshops, Elsevier, April 2019.

https://hal.inria.fr/hal-02423211

Internal Reports

27A. Benoit, C. Gou, L. Marchal.

Partitioning tree-shaped task graphs for distributed platforms with limited memory, Inria Grenoble Rhône-Alpes, March 2019, n^o RR-9115, pp. 1-34.

https://hal.inria.fr/hal-01644352
28A. Benoit, T. Hérault, V. Le Fèvre, Y. Robert.

Replication Is More Efficient Than You Think, Inria - Research Centre Grenoble – Rhône-Alpes, 2019, n^o RR-9278.

https://hal.inria.fr/hal-02265925
29A. Benoit, V. Le Fèvre, P. Raghavan, Y. Robert, H. Sun.

Design and Comparison of Resilient Scheduling Heuristics for Parallel Jobs, Inria - Research Centre Grenoble – Rhône-Alpes, October 2019, n^o RR-9296, pp. 1-29.

https://hal.inria.fr/hal-02317464
30L.-C. Canon, A. Kong Win Chang, Y. Robert, F. Vivien.

Scheduling independent stochastic tasks under deadline and budget constraints. Extended Version, Inria Grenoble Rhône-Alpes, February 2019, n^o RR-9257, pp. 1-38.

https://hal.inria.fr/hal-02025785
31A. Gainaru, B. Goglin, V. Honoré, G. Pallez, P. Raghavan, Y. Robert, H. Sun.

Reservation and Checkpointing Strategies for Stochastic Jobs (Extended Version), Inria & Labri, Univ. Bordeaux ; Department of EECS, Vanderbilt University, Nashville, TN, USA ; Laboratoire LIP, ENS Lyon & University of Tennessee Knoxville, Lyon, France, October 2019, n^o RR-9294.

https://hal.inria.fr/hal-02328013
32Y. Gao, L.-C. Canon, Y. Robert, F. Vivien.

Scheduling independent stochastic tasks on heterogeneous cloud platforms, Inria - Research Centre Grenoble – Rhône-Alpes, May 2019, n^o RR-9275.

https://hal.inria.fr/hal-02141253
33Y. Gao, L.-C. Canon, F. Vivien, Y. Robert.

Scheduling stochastic tasks on heterogeneous cloud platforms under budget and deadline constraints, Inria - Research Centre Grenoble – Rhône-Alpes, February 2019, n^o RR-9260, pp. 1-34.

https://hal.inria.fr/hal-02047434
34L. Han, L.-C. Canon, J. Liu, Y. Robert, F. Vivien.

Improved energy-aware strategies for periodic real-time tasks under reliability constraints, Inria - Research Centre Grenoble – Rhône-Alpes, February 2019, n^o RR-9259, pp. 1-38.

https://hal.inria.fr/hal-02056520
35T. Hérault, Y. Robert, G. Bosilca, J. Dongarra.

Generic matrix multiplication for multi-GPU accelerated distributed-memory platforms over PaRSEC, Inria Grenoble - Rhone-Alpes, September 2019, n^o RR-9289.

https://hal.inria.fr/hal-02282529
36F. Pawłowski, B. Uçar, A.-J. Yzelman.

High performance tensor-vector multiplies on shared memory systems, Inria - Research Centre Grenoble – Rhône-Alpes, May 2019, n^o RR-9274, pp. 1-20.

https://hal.inria.fr/hal-02123526

Other Publications

37A. Azad, B. Uçar, A. Pothen.

Trends in Combinatorial Analysis: Complex Data, Machine Learning, and High-Performance Computing, SIAM, September 2019, pp. 1-3.

https://hal.inria.fr/hal-02304457
38C. Mommessin, O. Beaumont, L.-C. Canon, L. Eyraud-Dubois, G. Lucarelli, L. Marchal, B. Simon, D. Trystram.

Scheduling on Two Types of Resources: a Survey, January 2020, https://arxiv.org/abs/1909.11365 - working paper or preprint.

https://hal.inria.fr/hal-02432381

References in notes

39Blue Waters Newsletter, dec 2012.
40Blue Waters Resources, 2013.

https://bluewaters.ncsa.illinois.edu/data
41The BOINC project, 2013.

http://boinc.berkeley.edu/
42Final report of the Department of Energy Fault Management Workshop, December 2012.

https://science.energy.gov/~/media/ascr/pdf/program-documents/docs/FaultManagement-wrkshpRpt-v4-final.pdf
43System Resilience at Extreme Scale: white paper, 2008, DARPA.

https://pdfs.semanticscholar.org/9fcb/154d6afce23cd9951fd7c116b86255d91b5c.pdf
44Top500 List - November, 2011.

http://www.top500.org/list/2011/11/
45Top500 List - November, 2012.

http://www.top500.org/list/2012/11/
46M. Amaris, G. Lucarelli, C. Mommessin, D. Trystram.

Generic Algorithms for Scheduling Applications on Hybrid Multi-core Machines, in: Euro-Par 2017: Parallel Processing, 2017, pp. 220–231.
47I. Assayad, A. Girault, H. Kalla.

Tradeoff exploration between reliability power consumption and execution time, in: Proceedings of SAFECOMP, the Conf. on Computer Safety, Reliability and Security, Washington, DC, USA, 2011.
48H. Aydin, Q. Yang.

Energy-aware partitioning for multiprocessor real-time systems, in: IPDPS'03, the IEEE Int. Parallel and Distributed Processing Symposium, 2003, pp. 113–121.
49N. Bansal, T. Kimbrel, K. Pruhs.

Speed Scaling to Manage Energy and Temperature, in: Journal of the ACM, 2007, vol. 54, n^o 1, pp. 1 – 39.

http://doi.acm.org/10.1145/1206035.1206038
50A. Benoit, L. Marchal, J.-F. Pineau, Y. Robert, F. Vivien.

Scheduling concurrent bag-of-tasks applications on heterogeneous platforms, in: IEEE Transactions on Computers, 2010, vol. 59, n^o 2, pp. 202-217.
51S. Blackford, J. Choi, A. Cleary, E. D'Azevedo, J. Demmel, I. Dhillon, J. Dongarra, S. Hammarling, G. Henry, A. Petitet, K. Stanley, D. Walker, R. C. Whaley.

ScaLAPACK Users' Guide, SIAM, 1997.
52S. Blackford, J. Dongarra.

Installation Guide for LAPACK, LAPACK Working Note, June 1999, n^o 41, originally released March 1992.
53A. Buttari, J. Langou, J. Kurzak, J. Dongarra.

Parallel tiled QR factorization for multicore architectures, in: Concurrency: Practice and Experience, 2008, vol. 20, n^o 13, pp. 1573-1590.
54J.-J. Chen, T.-W. Kuo.

Multiprocessor energy-efficient scheduling for real-time tasks, in: ICPP'05, the Int. Conference on Parallel Processing, 2005, pp. 13–20.
55S. Donfack, L. Grigori, W. Gropp, L. V. Kale.

Hybrid Static/dynamic Scheduling for Already Optimized Dense Matrix Factorization, in: Parallel Distributed Processing Symposium (IPDPS), 2012 IEEE 26th International, 2012, pp. 496-507.

http://dx.doi.org/10.1109/IPDPS.2012.53
56J. Dongarra, J.-F. Pineau, Y. Robert, Z. Shi, F. Vivien.

Revisiting Matrix Product on Master-Worker Platforms, in: International Journal of Foundations of Computer Science, 2008, vol. 19, n^o 6, pp. 1317-1336.
57J. Dongarra, J.-F. Pineau, Y. Robert, F. Vivien.

Matrix Product on Heterogeneous Master-Worker Platforms, in: 13th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Salt Lake City, Utah, February 2008, pp. 53–62.
58I. S. Duff, J. K. Reid.

The multifrontal solution of indefinite sparse symmetric linear systems, in: "ACM Transactions on Mathematical Software", 1983, vol. 9, pp. 302-325.
59I. S. Duff, J. K. Reid.

The multifrontal solution of unsymmetric sets of linear systems, in: SIAM Journal on Scientific and Statistical Computing, 1984, vol. 5, pp. 633-641.
60L. Grigori, J. W. Demmel, H. Xiang.

Communication avoiding Gaussian elimination, in: Proceedings of the 2008 ACM/IEEE conference on Supercomputing, Piscataway, NJ, USA, SC '08, IEEE Press, 2008, 29:1 p.

http://dl.acm.org/citation.cfm?id=1413370.1413400
61B. Hadri, H. Ltaief, E. Agullo, J. Dongarra.

Tile QR Factorization with Parallel Panel Processing for Multicore Architectures, in: IPDPS'10, the 24st IEEE Int. Parallel and Distributed Processing Symposium, 2010.
62M. A. Haque, H. Aydin, D. Zhu.

On reliability management of energy-aware real-time systems through task replication, in: IEEE Transactions on Parallel and Distributed Systems, 2017, vol. 28, n^o 3, pp. 813–825.
63J. W. H. Liu.

The multifrontal method for sparse matrix solution: Theory and Practice, in: SIAM Review, 1992, vol. 34, pp. 82–109.
64R. Melhem, D. Mossé, E. Elnozahy.

The Interplay of Power Management and Fault Recovery in Real-Time Systems, in: IEEE Transactions on Computers, 2004, vol. 53, n^o 2, pp. 217-231.
65A. J. Oliner, R. K. Sahoo, J. E. Moreira, M. Gupta, A. Sivasubramaniam.

Fault-aware job scheduling for bluegene/l systems, in: IPDPS'04, the IEEE Int. Parallel and Distributed Processing Symposium, 2004, pp. 64–73.
66G. Quintana-Ortí, E. Quintana-Ortí, R. A. van de Geijn, F. G. V. Zee, E. Chan.

Programming Matrix Algorithms-by-Blocks for Thread-Level Parallelism, in: ACM Transactions on Mathematical Software, 2009, vol. 36, n^o 3.
67Y. Robert, F. Vivien.

Algorithmic Issues in Grid Computing, in: Algorithms and Theory of Computation Handbook, Chapman and Hall/CRC Press, 2009.
68G. Zheng, X. Ni, L. V. Kale.

A scalable double in-memory checkpoint and restart scheme towards exascale, in: Dependable Systems and Networks Workshops (DSN-W), 2012.

http://dx.doi.org/10.1109/DSNW.2012.6264677
69D. Zhu, R. Melhem, D. Mossé.

The effects of energy management on reliability in real-time embedded systems, in: Proc. of IEEE/ACM Int. Conf. on Computer-Aided Design (ICCAD), 2004, pp. 35–40.

Previous |

Home