Bibliography
Publications of the year
Articles in International Peer-Reviewed Journals
-
1G. Aupy, A. Benoit, F. Dufossé, Y. Robert.
Reclaiming the energy of a schedule: models and algorithms, in: Concurrency and Computation: Practice and Experience, 2013, vol. 25, pp. 1505-1523. [ DOI : 10.1002/cpe.2889 ]
http://hal.inria.fr/hal-00763388 -
2G. Aupy, Y. Robert, F. Vivien, D. Zaidouni.
Checkpointing algorithms and fault prediction, in: Journal of Parallel and Distributed Computing, November 2013. [ DOI : 10.1016/j.jpdc.2013.10.010 ]
http://hal.inria.fr/hal-00908446 -
3M. Baboulin, J. Dongarra, J. Herrmann, S. Tomov.
Accelerating linear system solutions using randomization technique, in: ACM Transactions on Mathematical Software, February 2013, vol. 39, no 2. [ DOI : 10.1145/2427023.2427025 ]
http://hal.inria.fr/hal-00908496 -
4A. Benoit, V. U. Catalyurek, Y. Robert, E. Saule.
A Survey of Pipelined Workflow Scheduling: Models and Algorithms, in: ACM Computing Surveys, 2013, vol. 45, no 4. [ DOI : 10.1145/2501654.2501664 ]
http://hal.inria.fr/hal-00926178 -
5A. Benoit, A. Dobrila, J.-M. Nicod, L. Philippe.
Scheduling linear chain streaming applications on heterogeneous systems with failures, in: Future Generation Computer Systems, 2013, vol. 29, no 5, pp. 1140-1151. [ DOI : 10.1016/j.future.2012.12.015 ]
http://hal.inria.fr/hal-00926146 -
6A. Benoit, F. Dufossé, A. Girault, Y. Robert.
Reliability and performance optimization of pipelined real-time systems, in: Journal of Parallel and Distributed Computing, 2013, vol. 73, no 6, pp. 851-865. [ DOI : 10.1016/j.jpdc.2013.02.009 ]
http://hal.inria.fr/hal-00926123 -
7A. Benoit, M. Gallet, B. Gaujal, Y. Robert.
Computing the throughput of probabilistic and replicated streaming applications, in: Algorithmica, March 2013.
http://hal.inria.fr/hal-00800083 -
8A. Benoit, R. Melhem, P. Renaud-Goud, Y. Robert.
Assessing the performance of energy-aware mappings, in: Parallel Processing Letters, 2013, vol. 23, no 2. [ DOI : 10.1142/S0129626413400033 ]
http://hal.inria.fr/hal-00926105 -
9A. Benoit, Y. Robert, A. Rosenberg, F. Vivien.
Static strategies for worksharing with unrecoverable interruption, in: Theory of Computing Systems, 2013, vol. 53, no 3, pp. 386-423. [ DOI : 10.1007/s00224-012-9426-z ]
http://hal.inria.fr/hal-00763321 -
10G. Bosilca, A. Bouteiller, É. Brunet, F. Cappello, J. Dongarra, A. Guermouche, T. Hérault, Y. Robert, F. Vivien, D. Zaidouni.
Unified Model for Assessing Checkpointing Protocols at Extreme-Scale, in: Journal of Concurrency and Computation: Practice and Experience, November 2013. [ DOI : 10.1002/cpe.3173 ]
http://hal.inria.fr/hal-00908447 -
11M. Bougeret, H. Casanova, Y. Robert, F. Vivien, D. Zaidouni.
Using group replication for resilience on exascale systems, in: International Journal of High Performance Computing Applications, October 2013. [ DOI : 10.1177/1094342013505348 ]
http://hal.inria.fr/hal-00881463 -
12H. Casanova, F. Dufossé, Y. Robert, F. Vivien.
Mapping Applications on Volatile Resources, in: International Journal of High Performance Computing Applications, 2013.
http://hal.inria.fr/hal-00923948 -
13J. Dongarra, M. Faverge, T. Hérault, M. Jacquelin, J. Langou, Y. Robert.
Hierarchical QR factorization algorithms for multi-core clusters, in: Parallel Computing, 2013, vol. 39, no 4-5, pp. 212-232. [ DOI : 10.1016/j.parco.2013.01.003 ]
http://hal.inria.fr/hal-00809770 -
14K. Kaya, J. Langguth, F. Manne, B. Uçar.
Push-relabel based algorithms for the maximum transversal problem, in: Computers & Operations Research, 2013, vol. 40, no 5, pp. 1266-1275. [ DOI : 10.1016/j.cor.2012.12.009 ]
http://hal.inria.fr/hal-00763920 -
15K. Kaya, B. Uçar.
Constructing elimination trees for sparse unsymmetric matrices, in: SIAM Journal on Matrix Analysis and Applications, April 2013, vol. 34, no 2, pp. 345-354. [ DOI : 10.1137/110825443 ]
http://hal.inria.fr/inria-00567970 -
16S. Prasad, A. Gupta, K. Kant, A. Lumsdaine, D. Padua, Y. Robert, A. Rosenberg, A. Sussman, C. Weems.
Literacy for all in parallel and distributed computing: guidelines for an undergraduate core curriculum, in: CSI Journal of Computing, 2013, To appear.
http://hal.inria.fr/hal-00764026
International Conferences with Proceedings
-
17P. Amestoy, O. Boiteau, A. Buttari, G. Joslin, J.-Y. L'Excellent, W. M. Sid-Lakhdar, C. Weisbecker, M. Forzan, C. Pozza, V. Pellissier, R. Perrin.
Shared memory parallelism and low-rank approximation techniques applied to direct solvers in FEM simulation (regular paper), in: IEEE International Conference on the Computation of Electromagnetic Fields (COMPUMAG), Budapest, Hungary, 2013.
http://hal.inria.fr/hal-00924660 -
18G. Aupy, A. Benoit, T. Hérault, Y. Robert, J. Dongarra.
Optimal Checkpointing Period: Time vs. Energy, in: Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems, Denver, United States, November 2013.
http://hal.inria.fr/hal-00926199 -
19G. Aupy, A. Benoit, T. Hérault, Y. Robert, F. Vivien, D. Zaidouni.
On the Combination of Silent Error Detection and Checkpointing, in: PRDC - The 19th IEEE Pacific Rim International Symposium on Dependable Computing - 2013, Vancouver, Canada, IEEE, December 2013.
http://hal.inria.fr/hal-00847620 -
20G. Aupy, A. Benoit, R. Melhem, P. Renaud-Goud, Y. Robert.
Energy-aware checkpointing of divisible tasks with soft or hard deadlines, in: IGCC - 4th International Green Computing Conference - 2013, Arlington, United States, February 2013.
http://hal.inria.fr/hal-00857244 -
21G. Aupy, M. Faverge, Y. Robert, J. Kurzak, P. Luszczek, J. Dongarra.
Implementing a systolic algorithm for QR factorization on multicore clusters with PaRSEC, in: PROPER 2013 - 6th Workshop on Productivity and Performance, Aachen, Germany, August 2013.
http://hal.inria.fr/hal-00844492 -
22G. Aupy, Y. Robert, F. Vivien, D. Zaidouni.
Checkpointing strategies with prediction windows, in: PRDC - The 19th IEEE Pacific Rim International Symposium on Dependable Computing - 2013, Vancouver, Canada, IEEE, December 2013.
http://hal.inria.fr/hal-00847622 -
23O. Beaumont, H. Larchevêque, L. Marchal.
Non Linear Divisible Loads: There is No Free Lunch, in: IPDPS 2013, 27th IEEE International Parallel & Distributed Processing Symposium, Boston, United States, IEEE, 2013.
http://hal.inria.fr/hal-00771640 -
24A. Benoit, L.-C. Canon, L. Marchal.
Non-clairvoyant reduction algorithms for heterogeneous platforms, in: HeteroPar'2013, in conjunction with Euro-Par 2013, Aachen, Germany, 2013.
http://hal.inria.fr/hal-00926093 -
25A. Benoit, J. Langguth, B. Uçar.
Semi-matching algorithms for scheduling parallel tasks under resource constraints, in: IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum, Cambridge, MA, United States, IEEE Computer Society, 2013, pp. 1744-1753. [ DOI : 10.1109/IPDPSW.2013.30 ]
http://hal.inria.fr/hal-00738393 -
26A. Bouteiller, F. Cappello, J. Dongarra, A. Guermouche, T. Hérault, Y. Robert.
Multi-criteria checkpointing strategies: response-time versus resource utilization, in: Euro-Par 2013, Aachen, Germany, S. Verlag (editor), LNCS, 2013, vol. 8097, pp. 420-431. [ DOI : 10.1007/978-3-642-40047-6_43 ]
http://hal.inria.fr/hal-00926606 -
27H. Casanova, F. Dufossé, Y. Robert, F. Vivien.
Mapping tightly-coupled applications on volatile resources, in: PDP'2013, the 21st Euromicro Int. Conf. on Parallel, Distributed, and Network-Based Processing, Belfast, United Kingdom, IEEE Computer Society Press, 2013.
http://hal.inria.fr/hal-00763376 -
28H. Casanova, F. Dufossé, Y. Robert, F. Vivien.
Scheduling Tightly-Coupled Applications on Heterogeneous Desktop Grids, in: HCW 2013 - 22nd International Heterogeneity in Computing Workshop, Boston, United States, May 2013.
http://hal.inria.fr/hal-00788606 -
29H. Casanova, L. Lim, Y. Robert, F. Vivien, D. Zaidouni.
Cost-Optimal Execution of Boolean Query Trees with Shared Streams, in: 28th IEEE International Parallel & Distributed Processing Symposium, Phoenix, United States, IEEE, May 2014.
http://hal.inria.fr/hal-00923953 -
30M. Deveci, K. Kaya, B. Uçar, V. U. Catalyurek.
A Push-Relabel-Based Maximum Cardinality Bipartite Matching Algorithm on GPUs, in: 42nd International Conference on Parallel Processing, Lyon, France, IEEE Computer Society, 2013, pp. 21 - 29. [ DOI : 10.1109/ICPP.2013.11 ]
http://hal.inria.fr/hal-00923464 -
31M. Deveci, K. Kaya, B. Uçar, V. U. Catalyurek.
GPU accelerated maximum cardinality matching algorithms for bipartite graphs, in: Euro-Par 2013, Aachen, Germany, F. Wolf, B. Mohr, D. an Mey (editors), Springer, August 2013, pp. 850-861. [ DOI : 10.1007/978-3-642-40047-6_84 ]
http://hal.inria.fr/hal-00923449 -
32S. Di, Y. Robert, F. Vivien, D. Kondo, C.-L. Wang, F. Cappello.
Optimization of Cloud Task Processing with Checkpoint-Restart Mechanism, in: SC13 - Supercomputing - 2013, Denver, United States, ACM, November 2013. [ DOI : 10.1145/2503210.2503217 ]
http://hal.inria.fr/hal-00847635 -
33J. Dongarra, T. Hérault, Y. Robert.
Revisiting the double checkpointing algorithm, in: APDCM 2013, Boston, United States, IEEE, 2013.
http://hal.inria.fr/hal-00925168 -
34M. Faverge, J. Herrmann, J. Langou, B. Lowery, Y. Robert, J. Dongarra.
Designing LU-QR hybrid solvers for performance and stability, in: IEEE International Parallel & Distributed Processing Symposium, Phoenix, United States, December 2013.
http://hal.inria.fr/hal-00930238 -
35J. Herrmann, L. Marchal, Y. Robert.
Model and complexity results for tree traversals on hybrid platforms, in: HeteroPar - International Workshop on Algorithms, Models and Tools for Parallel Computing on Heterogeneous Platforms, Aachen, Germany, August 2013.
http://hal.inria.fr/hal-00926502 -
36K. Kaya, B. Uçar, V. U. Catalyurek.
Analysis of Partitioning Models and Metrics in Parallel Sparse Matrix-Vector Multiplication, in: 10th PPAM - Parallel Processing and Applied Mathematics, Varsovie, Poland, Springer, 2014, to appear.
http://hal.inria.fr/hal-00923454 -
37L. Marchal, O. Sinnen, F. Vivien.
Scheduling tree-shaped task graphs to minimize memory and makespan, in: IPDPS 2013 - 27th IEEE International Parallel & Distributed Processing Symposium, Boston, United States, May 2013.
http://hal.inria.fr/hal-00788612 -
38C. Weisbecker, P. R. Amestoy, O. Boiteau, R. Brossier, A. Buttari, J.-Y. L'Excellent, S. Operto, J. Virieux.
3D frequency-domain seismic modeling with a Block Low-Rank algebraic multifrontal direct solver, in: SEG Technical Program Expanded Abstracts, SEG annual meeting, Houston, Texas, United States, 2013. [ DOI : 10.1190/segam2013-0603.1 ]
http://hal.inria.fr/hal-00924638 -
39I. Yamazaki, X. S. Li, F.-H. Rouet, B. Uçar.
On Partitioning and Reordering Problems in a Hierarchically Parallel Hybrid Linear Solver, in: 2013 IEEE 27th International Parallel and Distributed Processing Symposium Workshops & PhD Forum (IPDPSW), Cambridge, MA, United States, IEEE Computer Society, May 2013.
http://hal.inria.fr/hal-00923447
Scientific Books (or Scientific Book chapters)
-
40A. Benoit, Y. Robert, F. Vivien.
A Guide to Algorithm Design: Paradigms, Methods, and Complexity Analysis, Applied Algorithms and Data Structures series, Chapman & Hall/CRC, August 2013, 380 p.
http://hal.inria.fr/hal-00908448 -
41A. Benoit, L. Marchal, Y. Robert, B. Uçar, F. Vivien.
Scheduling for Large-Scale Systems, in: The Computing Handbook Set, vol. 1, T. Gonzalez, J. L. Díaz Herrera (editors), Chapman and Hall/CRC Press, 2013, To appear.
http://hal.inria.fr/hal-00763372 -
42V. U. Catalyurek, M. Deveci, K. Kaya, B. Uçar.
UMPA: A Multi-objective, multi-level partitioner for communication minimization, in: Graph Partitioning and Graph Clustering 2012, D. A. Bader, H. Meyerhenke, P. Sanders, D. Wagner (editors), Contemporary Mathematics, AMS, 2013, vol. 588, pp. 53-66. [ DOI : 10.1090/conm/588/11704 ]
http://hal.inria.fr/hal-00763563 -
43V. U. Catalyurek, K. Kaya, J. Langguth, B. Uçar.
A Partitioning-based divisive clustering technique for maximizing the modularity, in: Graph Partitioning and Graph Clustering 2012, D. A. Bader, H. Meyerhenke, P. Sanders, D. Wagner (editors), Contemporary Mathematics, AMS, 2013, vol. 588, pp. 171-186. [ DOI : 10.1090/conm/588/11712 ]
http://hal.inria.fr/hal-00763559
Internal Reports
-
44P. R. Amestoy, C. Ashcraft, O. Boiteau, A. Buttari, J.-Y. L'Excellent, C. Weisbecker.
Improving multifrontal methods by means of block low-rank representations, Inria, January 2013, no RR-8199, Submitted for publication to SIAM.
http://hal.inria.fr/hal-00776859 -
45G. Aupy, A. Benoit, T. Hérault, Y. Robert, J. Dongarra.
Optimal Checkpointing Period: Time vs. Energy, Inria, October 2013, no RR-8387, 19 p.
http://hal.inria.fr/hal-00878938 -
46G. Aupy, A. Benoit, T. Hérault, Y. Robert, F. Vivien, D. Zaidouni.
On the Combination of Silent Error Detection and Checkpointing, Inria, June 2013, no RR-8319.
http://hal.inria.fr/hal-00836871 -
47G. Aupy, A. Benoit, R. Melhem, P. Renaud-Goud, Y. Robert.
Energy-aware checkpointing of divisible tasks with soft or hard deadlines, Inria, February 2013, no RR-8238, 33 p.
http://hal.inria.fr/hal-00788641 -
48G. Aupy, M. Faverge, Y. Robert, J. Kurzak, P. Luszczek, J. Dongarra.
Implementing a Systolic Algorithm for QR Factorization on Multicore Clusters with PaRSEC, Inria, November 2013, no RR-8390, 16 p, Published in ProPer'13.
http://hal.inria.fr/hal-00879248 -
49G. Aupy, Y. Robert, F. Vivien, D. Zaidouni.
Checkpointing algorithms and fault prediction, Inria, February 2013, no RR-8237, Accepted to be published in JPDC.
http://hal.inria.fr/hal-00788313 -
50G. Aupy, Y. Robert, F. Vivien, D. Zaidouni.
Checkpointing strategies with prediction windows, Inria, February 2013, no RR-8239, 44 p.
http://hal.inria.fr/hal-00789109 -
51G. Aupy, Y. Robert, F. Vivien, D. Zaidouni.
Comments on ”Improving the computing efficiency of HPC systems using a combination of proactive and preventive checkpoint”, Inria, June 2013, no RR-8318.
http://hal.inria.fr/hal-00836629 -
52G. Aupy, M. Shantharam, A. Benoit, Y. Robert, P. Raghavan.
Co-Scheduling Algorithms for High-Throughput Workload Execution, Inria, April 2013, no RR-8293, 21 p.
http://hal.inria.fr/hal-00819036 -
53A. Benoit, L.-C. Canon, L. Marchal.
Non-clairvoyant reduction algorithms for heterogeneous platforms, Inria, June 2013, no RR-8315.
http://hal.inria.fr/hal-00832102 -
54H. Casanova, L. Lim, Y. Robert, F. Vivien, D. Zaidouni.
Cost-Optimal Execution of Trees of Boolean Operators with Shared Streams, Inria, October 2013, no RR-8373, 39 p.
http://hal.inria.fr/hal-00869340 -
55V. U. Catalyurek, K. Kaya, B. Uçar.
On analysis of partitioning models and metrics in parallel sparse matrix-vector multiplication, Inria, May 2013, no RR-8301, 25 p.
http://hal.inria.fr/hal-00821523 -
56F. Dufossé, K. Kaya, B. Uçar.
Randomized matching heuristics with quality guarantees on shared memory parallel computers, Inria, October 2013, no RR-8386 ; Rapport LAAS n°13578, 28 p.
http://hal.inria.fr/hal-00877211 -
57J. Herrmann, L. Marchal, Y. Robert.
Tree traversals with task-memory affinities, Inria, February 2013, no RR-8226, 31 p.
http://hal.inria.fr/hal-00787753 -
58J. Herrmann, L. Marchal, Y. Robert.
Memory-aware list scheduling for hybrid platforms, Inria, February 2014, no RR-8461, 30 p.
http://hal.inria.fr/hal-00944336 -
59O. Kaya, E. Kayaaslan, B. Uçar, I. S. Duff.
Fill-in reduction in sparse matrix factorizations using hypergraphs, Inria, January 2014, no RR-8448.
http://hal.inria.fr/hal-00932882 -
60O. Kaya, E. Kayaaslan, B. Uçar.
On the minimum edge cover and vertex partition by quasi-cliques problems, Inria, February 2013, no RR-8255.
http://hal.inria.fr/hal-00795429 -
61J.-Y. L'Excellent, M. W. Sid-Lakhdar.
Introduction of shared-memory parallelism in a distributed-memory multifrontal solver, Inria, February 2013, no RR-8227, 35 p.
http://hal.inria.fr/hal-00786055
-
62Blue Waters Newsletter, dec 2012.
http://cgi.ncsa.illinois.edu/BlueWaters/pdfs/bw-newsletter-1212.pdf -
63Blue Waters Resources, 2013.
https://bluewaters.ncsa.illinois.edu/data -
64The BOINC project, 2013.
http://boinc.berkeley.edu/ -
65Final report of the Department of Energy Fault Management Workshop, December 2012.
http://science.energy.gov/~/media/ascr/pdf/program-documents/docs/FaultManagement-wrkshpRpt-v4-final.pdf -
66System Resilience at Extreme Scale: white paper, 2008, DARPA.
http://institute.lanl.gov/resilience/docs/IBM%20Mootaz%20White%20Paper%20System%20Resilience.pdf -
67Top500 List - November 2011, 2011.
http://www.top500.org/list/2011/11/ -
68Top500 List - November 2012, 2012.
http://www.top500.org/list/2012/11/ -
69I. Assayad, A. Girault, H. Kalla.
Tradeoff exploration between reliability power consumption and execution time, in: Proceedings of SAFECOMP, the Conf. on Computer Safety, Reliability and Security, Washington, DC, USA, 2011. -
70H. Aydin, Q. Yang.
Energy-aware partitioning for multiprocessor real-time systems, in: IPDPS'03, the IEEE Int. Parallel and Distributed Processing Symposium, 2003, pp. 113–121. -
71N. Bansal, T. Kimbrel, K. Pruhs.
Speed Scaling to Manage Energy and Temperature, in: Journal of the ACM, 2007, vol. 54, no 1, pp. 1 – 39.
http://doi.acm.org/10.1145/1206035.1206038 -
72A. Benoit, L. Marchal, J.-F. Pineau, Y. Robert, F. Vivien.
Scheduling concurrent bag-of-tasks applications on heterogeneous platforms, in: IEEE Transactions on Computers, 2010, vol. 59, no 2, pp. 202-217. -
73L. S. Blackford, J. Choi, A. Cleary, E. D'Azevedo, J. Demmel, I. Dhillon, J. Dongarra, S. Hammarling, G. Henry, A. Petitet, K. Stanley, D. Walker, R. C. Whaley.
ScaLAPACK Users' Guide, SIAM, 1997. -
74S. Blackford, J. Dongarra.
Installation Guide for LAPACK, LAPACK Working Note, June 1999, no 41, originally released March 1992. -
75A. Buttari, J. Langou, J. Kurzak, J. Dongarra.
Parallel tiled QR factorization for multicore architectures, in: Concurrency: Practice and Experience, 2008, vol. 20, no 13, pp. 1573-1590. -
76J.-J. Chen, T.-W. Kuo.
Multiprocessor energy-efficient scheduling for real-time tasks, in: ICPP'05, the Int. Conference on Parallel Processing, 2005, pp. 13–20. -
77S. Donfack, L. Grigori, W. Gropp, L. V. Kale.
Hybrid Static/dynamic Scheduling for Already Optimized Dense Matrix Factorization, in: Parallel Distributed Processing Symposium (IPDPS), 2012 IEEE 26th International, 2012, pp. 496-507.
http://dx.doi.org/10.1109/IPDPS.2012.53 -
78J. Dongarra, J.-F. Pineau, Y. Robert, Z. Shi, F. Vivien.
Revisiting Matrix Product on Master-Worker Platforms, in: International Journal of Foundations of Computer Science, 2008, vol. 19, no 6, pp. 1317-1336. -
79J. Dongarra, J.-F. Pineau, Y. Robert, F. Vivien.
Matrix Product on Heterogeneous Master-Worker Platforms, in: 13th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Salt Lake City, Utah, February 2008, pp. 53–62. -
80I. S. Duff, J. K. Reid.
The multifrontal solution of indefinite sparse symmetric linear systems, in: "ACM Transactions on Mathematical Software", 1983, vol. 9, pp. 302-325. -
81I. S. Duff, J. K. Reid.
The multifrontal solution of unsymmetric sets of linear systems, in: SIAM Journal on Scientific and Statistical Computing, 1984, vol. 5, pp. 633-641. -
82S. C. Eisenstat, J. W. H. Liu.
The theory of elimination trees for sparse unsymmetric matrices, in: SIAM Journal on Matrix Analysis and Applications, 2005, vol. 26, no 3, pp. 686–705. -
83S. C. Eisenstat, J. W. H. Liu.
Algorithmic aspects of elimination trees for sparse unsymmetric matrices, in: SIAM Journal on Matrix Analysis and Applications, 2008, vol. 29, no 4, pp. 1363–1381. -
84L. Grigori, J. W. Demmel, H. Xiang.
Communication avoiding Gaussian elimination, in: Proceedings of the 2008 ACM/IEEE conference on Supercomputing, Piscataway, NJ, USA, SC '08, IEEE Press, 2008, 29:1 p.
http://dl.acm.org/citation.cfm?id=1413370.1413400 -
85B. Hadri, H. Ltaief, E. Agullo, J. Dongarra.
Tile QR Factorization with Parallel Panel Processing for Multicore Architectures, in: IPDPS'10, the 24st IEEE Int. Parallel and Distributed Processing Symposium, 2010. -
86J. W. H. Liu.
The multifrontal method for sparse matrix solution: Theory and Practice, in: SIAM Review, 1992, vol. 34, pp. 82–109. -
87R. Melhem, D. Mossé, E. Elnozahy.
The Interplay of Power Management and Fault Recovery in Real-Time Systems, in: IEEE Transactions on Computers, 2004, vol. 53, no 2, pp. 217-231. -
88A. J. Oliner, R. K. Sahoo, J. E. Moreira, M. Gupta, A. Sivasubramaniam.
Fault-aware job scheduling for bluegene/l systems, in: IPDPS'04, the IEEE Int. Parallel and Distributed Processing Symposium, 2004, pp. 64–73. -
89G. Quintana-Ortí, E. Quintana-Ortí, R. A. van de Geijn, F. G. V. Zee, E. Chan.
Programming Matrix Algorithms-by-Blocks for Thread-Level Parallelism, in: ACM Transactions on Mathematical Software, 2009, vol. 36, no 3. -
90Y. Robert, F. Vivien.
Algorithmic Issues in Grid Computing, in: Algorithms and Theory of Computation Handbook, Chapman and Hall/CRC Press, 2009. -
91G. Zheng, X. Ni, L. V. Kale.
A scalable double in-memory checkpoint and restart scheme towards exascale, in: Dependable Systems and Networks Workshops (DSN-W), 2012.
http://dx.doi.org/10.1109/DSNW.2012.6264677 -
92D. Zhu, R. Melhem, D. Mossé.
The effects of energy management on reliability in real-time embedded systems, in: Proc. of IEEE/ACM Int. Conf. on Computer-Aided Design (ICCAD), 2004, pp. 35–40.