Bibliography

Publications of the year

Doctoral Dissertations and Habilitation Theses

1G. Vaumourin.

Read Only Data Specific Management for an Energy Efficient Memory System, Université de Bordeaux, October 2016.

https://tel.archives-ouvertes.fr/tel-01402354

Invited Conferences

2T. Cojean.

The StarPU Runtime System at Exascale ?: Scheduling and Programming over Upcoming Machines, in: RESPA workshop at SC16, Salt Lake City, Utah, United States, November 2016.

https://hal.inria.fr/hal-01410103

International Conferences with Proceedings

3E. Agullo, O. Beaumont, L. Eyraud-Dubois, S. Kumar.

Are Static Schedules so Bad ? A Case Study on Cholesky Factorization, in: IEEE International Parallel & Distributed Processing Symposium (IPDPS 2016), Chicago, IL, United States, IEEE, May 2016.

https://hal.inria.fr/hal-01223573
4P.-A. Arras, D. Fuin, E. Jeannot, S. Thibault.

DKPN: A Composite Dataflow/Kahn Process Networks Execution Model, in: 24th Euromicro International Conference on Parallel, Distributed and Network-based processing, Heraklion Crete, Greece, February 2016.

https://hal.inria.fr/hal-01234333
5O. Beaumont, T. Cojean, L. Eyraud-Dubois, A. Guermouche, S. Kumar.

Scheduling of Linear Algebra Kernels on Multiple Heterogeneous Resources, in: International Conference on High Performance Computing, Data, and Analytics (HiPC 2016), Hyderabad, India, Proceedings of the IEEE International Conference on High Performance Computing (HiPC 2016), IEEE, December 2016.

https://hal.inria.fr/hal-01361992
6A. Cassagne, O. Aumage, C. Leroux, D. Barthou, B. Le Gal.

Energy Consumption Analysis of Software Polar Decoders on Low Power Processors, in: The 2016 European Signal Processing Conference (EUSIPCO 2016), Budapest, Hungary, August 2016.

https://hal.archives-ouvertes.fr/hal-01363975
7A. Cassagne, T. Tonnellier, C. Leroux, B. Le Gal, O. Aumage, D. Barthou.

Beyond Gbps Turbo Decoder on Multi-Core CPUs, in: International Symposium on Turbo Codes & Iterative Information Processing, Brest, France, Turbo Codes and Iterative Information Processing, September 2016. [ DOI : 10.1109/ISTC.2016.7593092 ]

https://hal.archives-ouvertes.fr/hal-01363980
8T. Cojean, A. Guermouche, A. Hugo, R. Namyst, P.-A. Wacrenier.

Resource aggregation for task-based Cholesky Factorization on top of heterogeneous machines, in: HeteroPar'2016 worshop of Euro-Par, Grenoble, France, August 2016.

https://hal.inria.fr/hal-01181135
9T. Cojean, A. Guermouche, A.-E. Hugo, R. Namyst, P.-A. Wacrenier.

Resource aggregation in task-based applications over accelerator-based multicore machines, in: HeteroPar'2016 worshop of Euro-Par, Grenoble, France, August 2016.

https://hal.inria.fr/hal-01355385
10V. Garcia Pinto, L. Stanisic, A. Legrand, L. Mello Schnorr, S. Thibault, V. Danjean.

Analyzing Dynamic Task-Based Applications on Hybrid Platforms: An Agile Scripting Approach, in: 3rd Workshop on Visual Performance Analysis (VPA), Salt Lake City, United States, November 2016, Held in conjunction with SC16.

https://hal.inria.fr/hal-01353962
11P. Huchant, M.-C. Counilh, D. Barthou.

Automatic OpenCL Task Adaptation for Heterogeneous Architectures, in: Euro-Par, Grenoble, France, Euro-Par 2016: Parallel Processing, August 2016, pp. 684 - 696. [ DOI : 10.1007/978-3-319-43659-3_50 ]

https://hal.archives-ouvertes.fr/hal-01419366
12M. Sergent, D. Goudin, S. Thibault, O. Aumage.

Controlling the Memory Subscription of Distributed Applications with a Task-Based Runtime System, in: 21st International Workshop on High-Level Parallel Programming Models and Supportive Environments, Chicago, United States, May 2016.

https://hal.inria.fr/hal-01284004

Conferences without Proceedings

13O. Aumage, D. Barthou, A. Honorat.

A Stencil DSEL for Single Code Accelerated Computing with SYCL, in: SYCL 2016 1st SYCL Programming Workshop during the 21st ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Barcelone, Spain, March 2016.

https://hal.archives-ouvertes.fr/hal-01290099
14M. Sergent, D. Goudin, S. Thibault, O. Aumage.

Controlling the Memory Subscription of Distributed Applications with a Task-Based Runtime System, in: SIAM Conference on Parallel Processing for Scientific Computing (SIAM PP 2016), Paris, France, April 2016, pp. 318 - 327.

https://hal.inria.fr/hal-01380126

Internal Reports

15E. Agullo, O. Aumage, B. Bramas, O. Coulaud, S. Pitoiset.

Bridging the gap between OpenMP 4.0 and native runtime systems for the fast multipole method, Inria, March 2016, n^o RR-8953, 49 p.

https://hal.inria.fr/hal-01372022
16E. Agullo, O. Aumage, M. Faverge, N. Furmento, F. Pruvost, M. Sergent, S. Thibault.

Achieving High Performance on Supercomputers with a Sequential Task-based Programming Model, Inria Bordeaux Sud-Ouest ; Bordeaux INP ; CNRS ; Université de Bordeaux ; CEA, June 2016, n^o RR-8927, 27 p.

https://hal.inria.fr/hal-01332774
17E. Agullo, B. Bramas, O. Coulaud, M. Khannouz, L. Stanisic.

Task-based fast multipole method for clusters of multicore processors, Inria Bordeaux Sud-Ouest, October 2016, n^o RR-8970, 15 p.

https://hal.inria.fr/hal-01387482

Other Publications

18O. Beaumont, L. Eyraud-Dubois, S. Kumar.

Approximation Proofs of a Fast and Efficient List Scheduling Algorithm for Task-Based Runtime Systems on Multicores and GPUs, October 2016, working paper or preprint.

https://hal.inria.fr/hal-01386174
19T. Cojean, A. Guermouche, A. Hugo, R. Namyst, P.-A. Wacrenier.

Resource aggregation for task-based Cholesky Factorization on top of modern architectures, November 2016, This paper is submitted for review to the Parallel Computing special issue for HCW and HeteroPar 16 workshops.

https://hal.inria.fr/hal-01409965

Previous |

Home