Bibliography
Publications of the year
Doctoral Dissertations and Habilitation Theses
-
1G. Vaumourin.
Read Only Data Specific Management for an Energy Efficient Memory System, Université de Bordeaux, October 2016.
https://tel.archives-ouvertes.fr/tel-01402354
Invited Conferences
-
2T. Cojean.
The StarPU Runtime System at Exascale ?: Scheduling and Programming over Upcoming Machines, in: RESPA workshop at SC16, Salt Lake City, Utah, United States, November 2016.
https://hal.inria.fr/hal-01410103
International Conferences with Proceedings
-
3E. Agullo, O. Beaumont, L. Eyraud-Dubois, S. Kumar.
Are Static Schedules so Bad ? A Case Study on Cholesky Factorization, in: IEEE International Parallel & Distributed Processing Symposium (IPDPS 2016), Chicago, IL, United States, IEEE, May 2016.
https://hal.inria.fr/hal-01223573 -
4P.-A. Arras, D. Fuin, E. Jeannot, S. Thibault.
DKPN: A Composite Dataflow/Kahn Process Networks Execution Model, in: 24th Euromicro International Conference on Parallel, Distributed and Network-based processing, Heraklion Crete, Greece, February 2016.
https://hal.inria.fr/hal-01234333 -
5O. Beaumont, T. Cojean, L. Eyraud-Dubois, A. Guermouche, S. Kumar.
Scheduling of Linear Algebra Kernels on Multiple Heterogeneous Resources, in: International Conference on High Performance Computing, Data, and Analytics (HiPC 2016), Hyderabad, India, Proceedings of the IEEE International Conference on High Performance Computing (HiPC 2016), IEEE, December 2016.
https://hal.inria.fr/hal-01361992 -
6A. Cassagne, O. Aumage, C. Leroux, D. Barthou, B. Le Gal.
Energy Consumption Analysis of Software Polar Decoders on Low Power Processors, in: The 2016 European Signal Processing Conference (EUSIPCO 2016), Budapest, Hungary, August 2016.
https://hal.archives-ouvertes.fr/hal-01363975 -
7A. Cassagne, T. Tonnellier, C. Leroux, B. Le Gal, O. Aumage, D. Barthou.
Beyond Gbps Turbo Decoder on Multi-Core CPUs, in: International Symposium on Turbo Codes & Iterative Information Processing, Brest, France, Turbo Codes and Iterative Information Processing, September 2016. [ DOI : 10.1109/ISTC.2016.7593092 ]
https://hal.archives-ouvertes.fr/hal-01363980 -
8T. Cojean, A. Guermouche, A. Hugo, R. Namyst, P.-A. Wacrenier.
Resource aggregation for task-based Cholesky Factorization on top of heterogeneous machines, in: HeteroPar'2016 worshop of Euro-Par, Grenoble, France, August 2016.
https://hal.inria.fr/hal-01181135 -
9T. Cojean, A. Guermouche, A.-E. Hugo, R. Namyst, P.-A. Wacrenier.
Resource aggregation in task-based applications over accelerator-based multicore machines, in: HeteroPar'2016 worshop of Euro-Par, Grenoble, France, August 2016.
https://hal.inria.fr/hal-01355385 -
10V. Garcia Pinto, L. Stanisic, A. Legrand, L. Mello Schnorr, S. Thibault, V. Danjean.
Analyzing Dynamic Task-Based Applications on Hybrid Platforms: An Agile Scripting Approach, in: 3rd Workshop on Visual Performance Analysis (VPA), Salt Lake City, United States, November 2016, Held in conjunction with SC16.
https://hal.inria.fr/hal-01353962 -
11P. Huchant, M.-C. Counilh, D. Barthou.
Automatic OpenCL Task Adaptation for Heterogeneous Architectures, in: Euro-Par, Grenoble, France, Euro-Par 2016: Parallel Processing, August 2016, pp. 684 - 696. [ DOI : 10.1007/978-3-319-43659-3_50 ]
https://hal.archives-ouvertes.fr/hal-01419366 -
12M. Sergent, D. Goudin, S. Thibault, O. Aumage.
Controlling the Memory Subscription of Distributed Applications with a Task-Based Runtime System, in: 21st International Workshop on High-Level Parallel Programming Models and Supportive Environments, Chicago, United States, May 2016.
https://hal.inria.fr/hal-01284004
Conferences without Proceedings
-
13O. Aumage, D. Barthou, A. Honorat.
A Stencil DSEL for Single Code Accelerated Computing with SYCL, in: SYCL 2016 1st SYCL Programming Workshop during the 21st ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Barcelone, Spain, March 2016.
https://hal.archives-ouvertes.fr/hal-01290099 -
14M. Sergent, D. Goudin, S. Thibault, O. Aumage.
Controlling the Memory Subscription of Distributed Applications with a Task-Based Runtime System, in: SIAM Conference on Parallel Processing for Scientific Computing (SIAM PP 2016), Paris, France, April 2016, pp. 318 - 327.
https://hal.inria.fr/hal-01380126
Internal Reports
-
15E. Agullo, O. Aumage, B. Bramas, O. Coulaud, S. Pitoiset.
Bridging the gap between OpenMP 4.0 and native runtime systems for the fast multipole method, Inria, March 2016, no RR-8953, 49 p.
https://hal.inria.fr/hal-01372022 -
16E. Agullo, O. Aumage, M. Faverge, N. Furmento, F. Pruvost, M. Sergent, S. Thibault.
Achieving High Performance on Supercomputers with a Sequential Task-based Programming Model, Inria Bordeaux Sud-Ouest ; Bordeaux INP ; CNRS ; Université de Bordeaux ; CEA, June 2016, no RR-8927, 27 p.
https://hal.inria.fr/hal-01332774 -
17E. Agullo, B. Bramas, O. Coulaud, M. Khannouz, L. Stanisic.
Task-based fast multipole method for clusters of multicore processors, Inria Bordeaux Sud-Ouest, October 2016, no RR-8970, 15 p.
https://hal.inria.fr/hal-01387482
Other Publications
-
18O. Beaumont, L. Eyraud-Dubois, S. Kumar.
Approximation Proofs of a Fast and Efficient List Scheduling Algorithm for Task-Based Runtime Systems on Multicores and GPUs, October 2016, working paper or preprint.
https://hal.inria.fr/hal-01386174 -
19T. Cojean, A. Guermouche, A. Hugo, R. Namyst, P.-A. Wacrenier.
Resource aggregation for task-based Cholesky Factorization on top of modern architectures, November 2016, This paper is submitted for review to the Parallel Computing special issue for HCW and HeteroPar 16 workshops.
https://hal.inria.fr/hal-01409965