Bibliography
Publications of the year
Doctoral Dissertations and Habilitation Theses
-
1B. Uçar.
Partitioning, matching, and ordering: Combinatorial scientific computing with matrices and tensors, ENS de Lyon, September 2019, Habilitation à diriger des recherches.
https://hal.inria.fr/tel-02377874
Articles in International Peer-Reviewed Journals
-
2P. R. Amestoy, A. Buttari, J.-Y. L'Excellent, T. Mary.
Bridging the gap between flat and hierarchical low-rank matrix formats: the multilevel BLR format, in: SIAM Journal on Scientific Computing, May 2019, vol. 41, no 3, pp. A1414-A1442. [ DOI : 10.1137/18M1182760 ]
https://hal.archives-ouvertes.fr/hal-01774642 -
3P. R. Amestoy, A. Buttari, J.-Y. L'Excellent, T. Mary.
Performance and Scalability of the Block Low-Rank Multifrontal Factorization on Multicore Architectures, in: ACM Transactions on Mathematical Software, February 2019, vol. 45, no 1, pp. 1-23. [ DOI : 10.1145/3242094 ]
https://hal.archives-ouvertes.fr/hal-01955766 -
4P. R. Amestoy, J.-Y. L'Excellent, G. Moreau.
On exploiting sparsity of multiple right-hand sides in sparse direct solvers, in: SIAM Journal on Scientific Computing, 2019, vol. 41, no 1, pp. A269-A291. [ DOI : 10.1137/17M1151882 ]
https://hal.inria.fr/hal-01955659 -
5G. Aupy, A. Benoit, B. Goglin, L. Pottier, Y. Robert.
Co-scheduling HPC workloads on cache-partitioned CMP platforms, in: International Journal of High Performance Computing Applications, April 2019, vol. 33, no 6, pp. 1221-1239. [ DOI : 10.1177/1094342019846956 ]
https://hal.inria.fr/hal-02093172 -
6O. Beaumont, T. Lambert, L. Marchal, B. Thomas.
Performance Analysis and Optimality Results for Data-Locality Aware Tasks Scheduling with Replicated Inputs, in: Future Generation Computer Systems, October 2019, pp. 1-28. [ DOI : 10.1016/j.future.2019.08.024 ]
https://hal.inria.fr/hal-02275473 -
7A. Benoit, A. Cavelan, F. Ciorba, V. Le Fèvre, Y. Robert.
Combining Checkpointing and Replication for Reliable Execution of Linear Workflows with Fail-Stop and Silent Errors, in: International Journal of Networking and Computing, 2019, vol. 9, no 1, pp. 2-27. [ DOI : 10.15803/ijnc.9.1_2 ]
https://hal.inria.fr/hal-02082369 -
8L.-C. Canon, A. K. W. Chang, Y. Robert, F. Vivien.
Scheduling independent stochastic tasks under deadline and budget constraints, in: International Journal of High Performance Computing Applications, June 2019, pp. 1-19. [ DOI : 10.1177/1094342019852135 ]
https://hal.inria.fr/hal-02291031 -
9L.-C. Canon, L. Marchal, B. Simon, F. Vivien.
Online Scheduling of Task Graphs on Heterogeneous Platforms, in: IEEE Transactions on Parallel and Distributed Systems, 2019, pp. 1-12, forthcoming. [ DOI : 10.1109/TPDS.2019.2942909 ]
https://hal.inria.fr/hal-02291268 -
10L. Han, V. Le Fèvre, L.-C. Canon, Y. Robert, F. Vivien.
A Generic Approach to Scheduling and Checkpointing Workflows, in: International Journal of High Performance Computing Applications, May 2019, pp. 1-19. [ DOI : 10.1177/1094342019866891 ]
https://hal.inria.fr/hal-02140295 -
11J. Herrmann, Y. M. Özkaya, B. Uçar, K. Kaya, U. V. Catalyurek.
Multilevel Algorithms for Acyclic Partitioning of Directed Acyclic Graphs, in: SIAM Journal on Scientific Computing, July 2019, vol. 41, no 4, pp. A2117-A2145. [ DOI : 10.1137/18M1176865 ]
https://hal.inria.fr/hal-02306566 -
12L. Marchal, B. Simon, F. Vivien.
Limiting the memory footprint when dynamically scheduling DAGs on shared-memory platforms, in: Journal of Parallel and Distributed Computing, February 2019, vol. 128, pp. 30-42. [ DOI : 10.1016/j.jpdc.2019.01.009 ]
https://hal.inria.fr/hal-02025521 -
13F. Pawłowski, B. Uçar, A.-J. Yzelman.
A multi-dimensional Morton-ordered block storage for mode-oblivious tensor computations, in: Journal of computational science, March 2019, pp. 1-35, forthcoming. [ DOI : 10.1016/j.jocs.2019.02.007 ]
https://hal.inria.fr/hal-02082524
International Conferences with Proceedings
-
14G. Aupy, A. Gainaru, V. Honoré, P. Raghavan, Y. Robert, H. Sun.
Reservation Strategies for Stochastic Jobs, in: IPDPS 2019 - 33rd IEEE International Parallel and Distributed Processing Symposium, Rio de Janeiro, Brazil, IEEE, May 2019, pp. 166-175. [ DOI : 10.1109/IPDPS.2019.00027 ]
https://hal.inria.fr/hal-01968419 -
15A. Benoit, T. Hérault, V. Le Fèvre, Y. Robert.
Replication Is More Efficient Than You Think, in: SC 2019 - International Conference for High Performance Computing, Networking, Storage, and Analysis (SC'19), Denver, United States, November 2019.
https://hal.inria.fr/hal-02273142 -
17Y. Gao, L.-C. Canon, Y. Robert, F. Vivien.
Scheduling independent stochastic tasks on heterogeneous cloud platforms, in: IEEE Cluster 2019 - International Conference on Cluster Computing, Albuquerque, United States, IEEE, September 2019, pp. 1-11.
https://hal.inria.fr/hal-02271675 -
18L. Han, L.-C. Canon, J. Liu, Y. Robert, F. Vivien.
Improved energy-aware strategies for periodic real-time tasks under reliability constraints, in: RTSS 2019 - 40th IEEE Real-Time Systems Symposium, York, United Kingdom, February 2020.
https://hal.inria.fr/hal-02271704 -
19T. Herault, Y. Robert, G. Bosilca, J. Dongarra.
Generic Matrix Multiplication for Multi-GPU Accelerated Distributed-Memory Platforms over PaRSEC, in: ScalA 2019 - IEEE/ACM 10th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems, Denver, United States, IEEE, November 2019, pp. 33-41. [ DOI : 10.1109/ScalA49573.2019.00010 ]
https://hal.inria.fr/hal-02436180 -
20K. Kaya, J. Langguth, I. Panagiotas, B. Uçar.
Karp-Sipser based kernels for bipartite graph matching, in: SIAM Symposium on Algorithm Engineering and Experiments (ALENEX20), Salt Lake City, Utah, United States, January 2020.
https://hal.inria.fr/hal-02350734 -
21J. Li, B. Uçar, U. V. Catalyurek, J. Sun, K. Barker, R. Vuduc.
Efficient and effective sparse tensor reordering, in: ICS 2019 - ACM International Conference on Supercomputing, Phoenix, United States, June 2019, pp. 227-237. [ DOI : 10.1145/3330345.3330366 ]
https://hal.inria.fr/hal-02306569 -
22F. Pawłowski, B. Uçar, A.-J. Yzelman.
High performance tensor-vector multiplication on shared-memory systems, in: PPAM 2019 - 13th International Conference on Parallel Processing and Applied Mathematics, Bialystok, Poland, September 2019, pp. 1-11.
https://hal.inria.fr/hal-02332496 -
23R. Portase, B. Uçar.
Matrix symmetrization and sparse direct solvers, in: SIAM Workshop on Combinatorial Scientific Computing 2020, Seattle, United States, 2020.
https://hal.inria.fr/hal-02417778 -
24Y. M. Özkaya, A. Benoit, U. V. Catalyurek.
Is Acyclic Directed Graph Partitioning Effective for Locality-Aware Scheduling?, in: PPAM 2019 - 13th International Conference on Parallel Processing and Applied Mathematics, Bialystok, Poland, September 2019.
https://hal.inria.fr/hal-02273122 -
25Y. M. Özkaya, A. Benoit, B. Uçar, J. Herrmann, U. V. Catalyurek.
A scalable clustering-based task scheduler for homogeneous processors using DAG partitioning, in: IPDPS 2019 - 33rd IEEE International Parallel & Distributed Processing Symposium, Rio de Janeiro, Brazil, IEEE, May 2019, pp. 155-165. [ DOI : 10.1109/IPDPS.2019.00026 ]
https://hal.inria.fr/hal-02082794
Scientific Books (or Scientific Book chapters)
-
26L. Marchal, É. Saule, O. Sinnen.
Special Issue Proposal for the Parallel Computing Journal: HeteroPar 2016 and HCW 2016 Workshops, Elsevier, April 2019.
https://hal.inria.fr/hal-02423211
Internal Reports
-
27A. Benoit, C. Gou, L. Marchal.
Partitioning tree-shaped task graphs for distributed platforms with limited memory, Inria Grenoble Rhône-Alpes, March 2019, no RR-9115, pp. 1-34.
https://hal.inria.fr/hal-01644352 -
28A. Benoit, T. Hérault, V. Le Fèvre, Y. Robert.
Replication Is More Efficient Than You Think, Inria - Research Centre Grenoble – Rhône-Alpes, 2019, no RR-9278.
https://hal.inria.fr/hal-02265925 -
29A. Benoit, V. Le Fèvre, P. Raghavan, Y. Robert, H. Sun.
Design and Comparison of Resilient Scheduling Heuristics for Parallel Jobs, Inria - Research Centre Grenoble – Rhône-Alpes, October 2019, no RR-9296, pp. 1-29.
https://hal.inria.fr/hal-02317464 -
30L.-C. Canon, A. Kong Win Chang, Y. Robert, F. Vivien.
Scheduling independent stochastic tasks under deadline and budget constraints. Extended Version, Inria Grenoble Rhône-Alpes, February 2019, no RR-9257, pp. 1-38.
https://hal.inria.fr/hal-02025785 -
31A. Gainaru, B. Goglin, V. Honoré, G. Pallez, P. Raghavan, Y. Robert, H. Sun.
Reservation and Checkpointing Strategies for Stochastic Jobs (Extended Version), Inria & Labri, Univ. Bordeaux ; Department of EECS, Vanderbilt University, Nashville, TN, USA ; Laboratoire LIP, ENS Lyon & University of Tennessee Knoxville, Lyon, France, October 2019, no RR-9294.
https://hal.inria.fr/hal-02328013 -
32Y. Gao, L.-C. Canon, Y. Robert, F. Vivien.
Scheduling independent stochastic tasks on heterogeneous cloud platforms, Inria - Research Centre Grenoble – Rhône-Alpes, May 2019, no RR-9275.
https://hal.inria.fr/hal-02141253 -
33Y. Gao, L.-C. Canon, F. Vivien, Y. Robert.
Scheduling stochastic tasks on heterogeneous cloud platforms under budget and deadline constraints, Inria - Research Centre Grenoble – Rhône-Alpes, February 2019, no RR-9260, pp. 1-34.
https://hal.inria.fr/hal-02047434 -
34L. Han, L.-C. Canon, J. Liu, Y. Robert, F. Vivien.
Improved energy-aware strategies for periodic real-time tasks under reliability constraints, Inria - Research Centre Grenoble – Rhône-Alpes, February 2019, no RR-9259, pp. 1-38.
https://hal.inria.fr/hal-02056520 -
35T. Hérault, Y. Robert, G. Bosilca, J. Dongarra.
Generic matrix multiplication for multi-GPU accelerated distributed-memory platforms over PaRSEC, Inria Grenoble - Rhone-Alpes, September 2019, no RR-9289.
https://hal.inria.fr/hal-02282529 -
36F. Pawłowski, B. Uçar, A.-J. Yzelman.
High performance tensor-vector multiplies on shared memory systems, Inria - Research Centre Grenoble – Rhône-Alpes, May 2019, no RR-9274, pp. 1-20.
https://hal.inria.fr/hal-02123526
Other Publications
-
37A. Azad, B. Uçar, A. Pothen.
Trends in Combinatorial Analysis: Complex Data, Machine Learning, and High-Performance Computing, SIAM, September 2019, pp. 1-3.
https://hal.inria.fr/hal-02304457 -
38C. Mommessin, O. Beaumont, L.-C. Canon, L. Eyraud-Dubois, G. Lucarelli, L. Marchal, B. Simon, D. Trystram.
Scheduling on Two Types of Resources: a Survey, January 2020, https://arxiv.org/abs/1909.11365 - working paper or preprint.
https://hal.inria.fr/hal-02432381
- 39Blue Waters Newsletter, dec 2012.
-
40Blue Waters Resources, 2013.
https://bluewaters.ncsa.illinois.edu/data -
41The BOINC project, 2013.
http://boinc.berkeley.edu/ -
42Final report of the Department of Energy Fault Management Workshop, December 2012.
https://science.energy.gov/~/media/ascr/pdf/program-documents/docs/FaultManagement-wrkshpRpt-v4-final.pdf -
43System Resilience at Extreme Scale: white paper, 2008, DARPA.
https://pdfs.semanticscholar.org/9fcb/154d6afce23cd9951fd7c116b86255d91b5c.pdf -
44Top500 List - November, 2011.
http://www.top500.org/list/2011/11/ -
45Top500 List - November, 2012.
http://www.top500.org/list/2012/11/ -
46M. Amaris, G. Lucarelli, C. Mommessin, D. Trystram.
Generic Algorithms for Scheduling Applications on Hybrid Multi-core Machines, in: Euro-Par 2017: Parallel Processing, 2017, pp. 220–231. -
47I. Assayad, A. Girault, H. Kalla.
Tradeoff exploration between reliability power consumption and execution time, in: Proceedings of SAFECOMP, the Conf. on Computer Safety, Reliability and Security, Washington, DC, USA, 2011. -
48H. Aydin, Q. Yang.
Energy-aware partitioning for multiprocessor real-time systems, in: IPDPS'03, the IEEE Int. Parallel and Distributed Processing Symposium, 2003, pp. 113–121. -
49N. Bansal, T. Kimbrel, K. Pruhs.
Speed Scaling to Manage Energy and Temperature, in: Journal of the ACM, 2007, vol. 54, no 1, pp. 1 – 39.
http://doi.acm.org/10.1145/1206035.1206038 -
50A. Benoit, L. Marchal, J.-F. Pineau, Y. Robert, F. Vivien.
Scheduling concurrent bag-of-tasks applications on heterogeneous platforms, in: IEEE Transactions on Computers, 2010, vol. 59, no 2, pp. 202-217. -
51S. Blackford, J. Choi, A. Cleary, E. D'Azevedo, J. Demmel, I. Dhillon, J. Dongarra, S. Hammarling, G. Henry, A. Petitet, K. Stanley, D. Walker, R. C. Whaley.
ScaLAPACK Users' Guide, SIAM, 1997. -
52S. Blackford, J. Dongarra.
Installation Guide for LAPACK, LAPACK Working Note, June 1999, no 41, originally released March 1992. -
53A. Buttari, J. Langou, J. Kurzak, J. Dongarra.
Parallel tiled QR factorization for multicore architectures, in: Concurrency: Practice and Experience, 2008, vol. 20, no 13, pp. 1573-1590. -
54J.-J. Chen, T.-W. Kuo.
Multiprocessor energy-efficient scheduling for real-time tasks, in: ICPP'05, the Int. Conference on Parallel Processing, 2005, pp. 13–20. -
55S. Donfack, L. Grigori, W. Gropp, L. V. Kale.
Hybrid Static/dynamic Scheduling for Already Optimized Dense Matrix Factorization, in: Parallel Distributed Processing Symposium (IPDPS), 2012 IEEE 26th International, 2012, pp. 496-507.
http://dx.doi.org/10.1109/IPDPS.2012.53 -
56J. Dongarra, J.-F. Pineau, Y. Robert, Z. Shi, F. Vivien.
Revisiting Matrix Product on Master-Worker Platforms, in: International Journal of Foundations of Computer Science, 2008, vol. 19, no 6, pp. 1317-1336. -
57J. Dongarra, J.-F. Pineau, Y. Robert, F. Vivien.
Matrix Product on Heterogeneous Master-Worker Platforms, in: 13th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Salt Lake City, Utah, February 2008, pp. 53–62. -
58I. S. Duff, J. K. Reid.
The multifrontal solution of indefinite sparse symmetric linear systems, in: "ACM Transactions on Mathematical Software", 1983, vol. 9, pp. 302-325. -
59I. S. Duff, J. K. Reid.
The multifrontal solution of unsymmetric sets of linear systems, in: SIAM Journal on Scientific and Statistical Computing, 1984, vol. 5, pp. 633-641. -
60L. Grigori, J. W. Demmel, H. Xiang.
Communication avoiding Gaussian elimination, in: Proceedings of the 2008 ACM/IEEE conference on Supercomputing, Piscataway, NJ, USA, SC '08, IEEE Press, 2008, 29:1 p.
http://dl.acm.org/citation.cfm?id=1413370.1413400 -
61B. Hadri, H. Ltaief, E. Agullo, J. Dongarra.
Tile QR Factorization with Parallel Panel Processing for Multicore Architectures, in: IPDPS'10, the 24st IEEE Int. Parallel and Distributed Processing Symposium, 2010. -
62M. A. Haque, H. Aydin, D. Zhu.
On reliability management of energy-aware real-time systems through task replication, in: IEEE Transactions on Parallel and Distributed Systems, 2017, vol. 28, no 3, pp. 813–825. -
63J. W. H. Liu.
The multifrontal method for sparse matrix solution: Theory and Practice, in: SIAM Review, 1992, vol. 34, pp. 82–109. -
64R. Melhem, D. Mossé, E. Elnozahy.
The Interplay of Power Management and Fault Recovery in Real-Time Systems, in: IEEE Transactions on Computers, 2004, vol. 53, no 2, pp. 217-231. -
65A. J. Oliner, R. K. Sahoo, J. E. Moreira, M. Gupta, A. Sivasubramaniam.
Fault-aware job scheduling for bluegene/l systems, in: IPDPS'04, the IEEE Int. Parallel and Distributed Processing Symposium, 2004, pp. 64–73. -
66G. Quintana-Ortí, E. Quintana-Ortí, R. A. van de Geijn, F. G. V. Zee, E. Chan.
Programming Matrix Algorithms-by-Blocks for Thread-Level Parallelism, in: ACM Transactions on Mathematical Software, 2009, vol. 36, no 3. -
67Y. Robert, F. Vivien.
Algorithmic Issues in Grid Computing, in: Algorithms and Theory of Computation Handbook, Chapman and Hall/CRC Press, 2009. -
68G. Zheng, X. Ni, L. V. Kale.
A scalable double in-memory checkpoint and restart scheme towards exascale, in: Dependable Systems and Networks Workshops (DSN-W), 2012.
http://dx.doi.org/10.1109/DSNW.2012.6264677 -
69D. Zhu, R. Melhem, D. Mossé.
The effects of energy management on reliability in real-time embedded systems, in: Proc. of IEEE/ACM Int. Conf. on Computer-Aided Design (ICCAD), 2004, pp. 35–40.