EN FR
EN FR


Bibliography

Major publications by the team in recent years
  • 1J. C. Beyler, P. Clauss.

    Performance driven data cache prefetching in a dynamic software optimization system, in: ICS '07: Proceedings of the 21st annual international conference on Supercomputing, New York, NY, USA, ACM, 2007, p. 202–209.

    http://doi.acm.org/10.1145/1274971.1275000
  • 2J. C. Beyler, M. Klemm, P. Clauss, M. Philippsen.

    A meta-predictor framework for prefetching in object-based DSMs, in: Concurr. Comput. : Pract. Exper., September 2009, vol. 21, p. 1789–1803.

    http://dx.doi.org/10.1002/cpe.v21:14
  • 3P. Clauss.

    Counting solutions to linear and nonlinear constraints through Ehrhart polynomials: applications to analyze and transform scientific programs, in: ICS '96: Proceedings of the 10th international conference on Supercomputing, New York, NY, USA, ACM, 1996, p. 278–285.

    http://doi.acm.org/10.1145/237578.237617
  • 4P. Clauss, F. J. Fernández, D. Garbervetsky, S. Verdoolaege.

    Symbolic polynomial maximization over convex sets and its application to memory requirement estimation, in: IEEE Transactions on Very Large Scale Integration (VLSI) Systems, Aug 2009, vol. 17, no 8, p. 983-996.

    http://hal.inria.fr/inria-00504617
  • 5P. Clauss, V. Loechner.

    Parametric Analysis of Polyhedral Iteration Spaces, in: J. VLSI Signal Process. Syst., 1998, vol. 19, no 2, p. 179–194.

    http://dx.doi.org/10.1023/A:1008069920230
  • 6P. Clauss, I. Tchoupaeva.

    A Symbolic Approach to Bernstein Expansion for Program Analysis and Optimization, LNCS, Springer, April 2004, vol. 2985, p. 120-133.
  • 7A. Ketterlin, P. Clauss.

    Prediction and trace compression of data access addresses through nested loop recognition, in: 6th annual IEEE/ACM international symposium on Code generation and optimization, États-Unis Boston, ACM, April 2008, p. 94-103.

    http://dx.doi.org/10.1145/1356058.1356071, http://hal.inria.fr/inria-00504597/en
  • 8V. Loechner, B. Meister, P. Clauss.

    Precise data locality optimization of nested loops, in: Journal of Supercomputing, January 2002, vol. 21, no 1, p. 37–76, Kluwer Academic Pub..
  • 9V. Loechner, D. K. Wilde.

    Parameterized Polyhedra and their Vertices, in: International Journal of Parallel Programming, December 1997, vol. 25, no 6.
  • 10S. Verdoolaege, R. Seghir, K. Beyls, V. Loechner, M. Bruynooghe.

    Counting Integer Points in Parametric Polytopes Using Barvinok's Rational Functions, in: Algorithmica, 2007, vol. 48, no 1, p. 37–66.

    http://dx.doi.org/10.1007/s00453-006-1231-0
  • 11É. Violard.

    A Semantic Framework to Address Data Locality in Data Parallel Languages, in: Parallel Computing, 2004, vol. 30, no 1, p. 139-161.
Publications of the year

Doctoral Dissertations and Habilitation Theses

  • 12B. Pradelle.

    Static and Dynamic Polyhedral Compilation Methods for the Execution in Multi-core Environments, Université de Strasbourg, December 2011.

Articles in International Peer-Reviewed Journal

  • 13R. Seghir, V. Loechner, B. Meister.

    Integer Affine Transformations of Parametric Z-polytopes and Applications to Loop Nest Optimization, in: ACM Transactions on Architecture and Code Optimization, 2011, to appear in 2011..

    http://hal.inria.fr/inria-00582388/en

International Conferences with Proceedings

  • 14J.-D. Genevaux, J. Narboux, P. Schreck.

    Formalization of Wu's simple method in Coq, in: CPP 2011 First International Conference on Certified Programs and Proofs, Kenting, Taiwan, Province Of China, J.-P. Jouannaud, Z. Shao (editors), Lecture Notes in Computer Science, Springer-Verlag, December 2011.

    http://hal.inria.fr/inria-00618745/en
  • 15A. Jimborean, M. Herrmann, V. Loechner, P. Clauss.

    VMAD: a Virtual Machine for Advanced Dynamic Analysis of Programs, in: IEEE International Symposium on Performance Analysis of Systems and Software, ISPASS, Austin, United States, IEEE, April 2011.

    http://hal.inria.fr/inria-00544501/en
  • 16A. Jimborean, V. Loechner, P. Clauss.

    Handling Multi-Versioning in LLVM: Code Tracking and Cloning, in: WIR 2011: Workshop on Intermediate Representations, in conjunction with CGO 2011, Chamonix, France, Florent Bouchez, Sebastian Hack, Eelco Visser, April 2011.

    http://hal.inria.fr/inria-00572785/en
  • 17A. Ketterlin, P. Clauss.

    Efficient Memory Tracing by Program Skeletonization, in: IEEE International Symposium on Performance Analysis of Systems and Software, ISPASS, Austin, United States, IEEE, April 2011.

    http://hal.inria.fr/inria-00544497/en
  • 18T. M. Pham, Y. Bertot, J. Narboux.

    A Coq-based Library for Interactive and Automated Theorem Proving in Plane Geometry, in: The 11th International Conference on Computational Science and Its Applications (ICCSA 2011), Santander, Spain, Lecture Notes in Computer Science, Springer-Verlag, 2011, vol. 6785, p. 368-383. [ DOI : 10.1007/978-3-642-21898-9_32 ]

    http://hal.inria.fr/inria-00584918/en
  • 19B. Pradelle, P. Clauss, V. Loechner.

    Adaptive Runtime Selection of Parallel Schedules in the Polytope Model, in: High Performance Computing Symposium, Boston, États-Unis, ACM/SIGSIM, April 2011.

    http://hal.inria.fr/inria-00564311/en/
  • 20B. Pradelle, A. Ketterlin, P. Clauss.

    Transparent Parallelization of Binary Code, in: First International Workshop on Polyhedral Compilation Techniques, IMPACT 2011, in conjunction with CGO 2011, Chamonix, France, Christophe Alias, Cédric Bastoul, April 2011.

    http://hal.inria.fr/inria-00572797/en

Scientific Books (or Scientific Book chapters)

  • 21P. Clauss, D. Garbervetsky, V. Loechner, S. Verdoolaege.

    Polyhedral Techniques for Parametric Memory Requirement Estimation, in: Energy-Aware Memory Management for Embedded Multimedia Systems: A Computer-Aided Approach, F. Balasa, D. Pradhan (editors), Taylor & Francis, 2011.
References in notes
  • 22F. Agakov, E. Bonilla, J. Cavazos, B. Franke, G. Fursin, M. F. P. O'Boyle, J. Thomson, M. Toussaint, C. K. I. Williams.

    Using Machine Learning to Focus Iterative Optimization, in: CGO '06: Proceedings of the International Symposium on Code Generation and Optimization, Washington, DC, USA, IEEE Computer Society, 2006, p. 295–305.

    http://dx.doi.org/10.1109/CGO.2006.37
  • 23R. Back.

    On the Correctness of Refinement Steps in Program Development, University of Helsinki, 1978.
  • 24J.-P. Banâtre, D. Le Métayer.

    The Gamma Model and its Discipline of Programming, in: Science of Computer Programming, 1990, vol. 15, no 1, p. 55-79.
  • 25B. Barras, S. Boutin, C. Cornes, J. Courant, J.-C. Filliatre, E. Gimenez, H. Herbelin, G. Huet, C. Munoz, C. Murthy, C. Parent, C. Paulin-Mohring, A. Saibi, B. Werner.

    The Coq Proof Assistant Reference Manual : Version 6.1, 1997.
  • 26M. M. Baskaran, U. Bondhugula, S. Krishnamoorthy, J. Ramanujam, A. Rountev, P. Sadayappan.

    A compiler framework for optimization of affine loop nests for GPGPUs, in: ICS '08: Proceedings of the 22nd annual international conference on Supercomputing, New York, NY, USA, ACM, 2008, p. 225–234.

    http://doi.acm.org/10.1145/1375527.1375562
  • 27C. Bastoul.

    Code Generation in the Polyhedral Model Is Easier Than You Think, in: PACT'13 IEEE International Conference on Parallel Architecture and Compilation Techniques, Juan-les-Pins, France, 2004, p. 7–16.

    http://hal.ccsd.cnrs.fr/ccsd-00017260
  • 28Y. Bertot, B. Grégoire, X. Leroy.

    A Structured Approach to Proving Compiler Optimizations Based on Dataflow Analysis, in: TYPES 2004, 2004, p. 66-81.
  • 29L. Bougé, Y. Le Guyadec, G. Utard, B. Virot.

    A Proof System for a Simple Data-Parallel Programming Language, in: IFIP WG 10.3, Applications in Parallel and Distributed Computing, Caracas (Venezuela), North-Holland, April 1994.
  • 30M. Bridges, N. Vachharajani, Y. Zhang, T. Jablin, D. I. August.

    Revisiting the Sequential Programming Model for Multi-Core, in: MICRO '07: Proceedings of the 40th Annual IEEE/ACM International Symposium on Microarchitecture, Washington, DC, USA, IEEE Computer Society, 2007, p. 69–84.

    http://dx.doi.org/10.1109/MICRO.2007.35
  • 31M. Burtscher, I. Ganusov, S. J. Jackson, J. Ke, P. Ratanaworabhan, N. B. Sam.

    The VPC Trace-Compression Algorithms, in: IEEE Trans. Comput., 2005, vol. 54, no 11, p. 1329–1344.
  • 32D. Cachera, D. Pichardie.

    Embedding of Systems of Affine Recurrence Equations in Coq, in: Proc. of 16th International Conference on Theorem Proving in Higher Order Logics (TPHOLs'03), Lecture Notes in Computer Science, Springer-Verlag, 2003, no 2758, p. 155–170.
  • 33K. Chandy, J. Misra.

    Parallel Program Design: A Foundation, Addison Wesley, 1988.
  • 34M. Cintra, D. R. Llanos.

    Design Space Exploration of a Software Speculative Parallelization Scheme, in: IEEE Trans. Parallel Distrib. Syst., 2005, vol. 16, no 6, p. 562–576.

    http://dx.doi.org/10.1109/tpds.2005.69
  • 35P. Feautrier.

    Dataflow analysis of scalar and array references, in: International Journal of Parallel Programming, 1991, vol. 20, no 1, p. 23–53.
  • 36P. Feautrier.

    Some efficient solutions to the affine scheduling problem, Part 1 : one dimensional time, in: International Journal of Parallel Programming, 1992, vol. 21, no 5, p. 313–348.
  • 37P. Feautrier.

    Some efficient solutions to the affine scheduling problem, Part 2 : multidimensional time, in: International Journal of Parallel Programming, 1992, vol. 21, no 6.
  • 38P. Feautrier.

    Automatic Parallelization in the Polytope Model, in: The Data Parallel Programming Model: Foundations, HPF Realization, and Scientific Applications, Springer-Verlag, 1996, p. 79–103.
  • 39X. Feng, Z. Shao, Y. Dong, Y. Guo.

    Certifying low-level programs with hardware interrupts and preemptive threads, in: SIGPLAN Not., 2008, vol. 43, no 6, p. 170–182.

    http://dx.doi.org/10.1145/1379022.1375603
  • 40C. Flanagan, S. N. Freund, J. Yi.

    Velodrome: a sound and complete dynamic atomicity checker for multithreaded programs, in: PLDI '08: Proceedings of the 2008 ACM SIGPLAN conference on Programming language design and implementation, New York, NY, USA, ACM, 2008, p. 293–303.

    http://dx.doi.org/10.1145/1375581.1375618
  • 41P. Gerner.

    La sémantique des directives au compilateur : application au parallélisme de données, Université Louis Pasteur, 2002.
  • 42P. Gerner, É. Violard.

    A Theoretical Framework of Data Parallelism and Its Operational Semantics, in: Euro-Par 2000, LNCS, Springer, 2001, vol. 1900, p. 668–677.
  • 43E. P. Gribomont.

    Stepwise refinement and concurrency: the finite-state case, in: Sci. Comput. Program., 1990, vol. 14, no 2-3, p. 185–228.

    http://dx.doi.org/10.1016/0167-6423(90)90020-E
  • 44M. Hall, D. Padua, K. Pingali.

    Compiler research: the next 50 years, in: Commun. ACM, 2009, vol. 52, no 2, p. 60–67.

    http://doi.acm.org/10.1145/1461928.1461946
  • 45A. Hobor, A. W. Appel, F. Z. Nardelli.

    Oracle Semantics for Concurrent Separation Logic, in: ESOP, 2008, p. 353-367.
  • 46A. Jimborean, M. Herrmann, V. Loechner, P. Clauss.

    A Static-Dynamic Collaborative Framework for Nested Loops Instrumentation and Profiling, Université de Strasbourg, 05 2010.

    http://hal.inria.fr/inria-00534745/en/
  • 47A. Jimborean, M. Herrmann, V. Loechner, P. Clauss.

    VMAD: a Virtual Machine for Advanced Dynamic Analysis of Programs, Université de Strasbourg, 09 2010.

    http://hal.inria.fr/inria-00534748/en/
  • 48A. Ketterlin, P. Clauss.

    Recovering the Memory Behavior of Executable Programs, in: 10th IEEE Working Conference on Source Code Analysis and Manipulation, SCAM, Roumanie Timisoara, IEEE Computer Society Press, Sep 2010.

    http://hal.inria.fr/inria-00502813
  • 49M. Kulkarni, K. Pingali, B. Walter, G. Ramanarayanan, K. Bala, L. P. Chew.

    Optimistic parallelism requires abstractions, in: PLDI '07: Proceedings of the 2007 ACM SIGPLAN conference on Programming language design and implementation, New York, NY, USA, ACM, 2007, p. 211–222.

    http://doi.acm.org/10.1145/1250734.1250759
  • 50J. Larus, C. Kozyrakis.

    Transactional memory, in: Commun. ACM, 2008, vol. 51, no 7, p. 80–88.
  • 51E. A. Lee.

    The Problem with Threads, in: Computer, 2006, vol. 39, no 5, p. 33–42.

    http://dx.doi.org/10.1109/MC.2006.180
  • 52C. Lengauer.

    Loop Parallelization in the Polytope Model, in: Parallel Processing Letters, 1994, vol. 4, no 3.
  • 53X. Leroy.

    Formal verification of a realistic compiler, in: Communications of the ACM, July 2009, To appear.
  • 54X. Leroy.

    The Compcert verified compiler, software and commented proof, January 2010.

    http://compcert.inria.fr
  • 55S.-W. Liao, A. Diwan, R. P. Bosch, A. Ghuloum, M. S. Lam.

    SUIF Explorer: an interactive and interprocedural parallelizer, in: PPoPP '99: Proceedings of the seventh ACM SIGPLAN symposium on Principles and practice of parallel programming, New York, NY, USA, ACM, 1999, p. 37–48.

    http://doi.acm.org/10.1145/301104.301108
  • 56V. Loechner, B. Meister, P. Clauss.

    Data Sequence Locality: a Generalization of Temporal Locality, in: Euro-Par 2001, Manchester, UK, Springer, 2001.
  • 57V. Loechner, C. Mongenet.

    Communication Optimization for Affine Recurrence Equations using Broadcast and Locality, in: International Journal of Parallel Programming, 2000, vol. 28, no 1.
  • 58C.-K. Luk, R. Cohn, R. Muth, H. Patil, A. Klauser, G. Lowney, S. Wallace, V. J. Reddi, K. Hazelwood.

    Pin: building customized program analysis tools with dynamic instrumentation, in: PLDI '05: Proceedings of the 2005 ACM SIGPLAN conference on Programming language design and implementation, New York, NY, USA, ACM, 2005, p. 190–200.

    http://doi.acm.org/10.1145/1065010.1065034
  • 59K. F. Moore, D. Grossman.

    High-level small-step operational semantics for transactions, in: POPL '08: Proceedings of the 35th annual ACM SIGPLAN-SIGACT symposium on Principles of programming languages, New York, NY, USA, ACM, 2008, p. 51–62.

    http://dx.doi.org/10.1145/1328438.1328448
  • 60T. Moseley, D. A. Connors, D. Grunwald, R. Peri.

    Identifying potential parallelism via loop-centric profiling, in: CF '07: Proceedings of the 4th international conference on Computing frontiers, ACM, 2007, p. 143–152.
  • 61G. C. Necula.

    Translation validation for an optimizing compiler, in: SIGPLAN Not., 2000, vol. 35, no 5, p. 83–94.

    http://doi.acm.org/10.1145/358438.349314
  • 62A. Pnueli, O. Shtrichman, M. Siegel.

    The Code Validation Tool (CVT) - Automatic verification of code generated from synchronous languages, in: Software Tools for Technology Transfer, 1998, vol. 2.
  • 63L.-N. Pouchet, C. Bastoul, A. Cohen, J. Cavazos.

    Iterative optimization in the polyhedral model: part II, multidimensional time, in: PLDI '08: Proceedings of the 2008 ACM SIGPLAN conference on Programming language design and implementation, New York, NY, USA, ACM, 2008, p. 90–100.

    http://doi.acm.org/10.1145/1375581.1375594
  • 64L.-N. Pouchet, C. Bastoul, A. Cohen, N. Vasilache.

    Iterative Optimization in the Polyhedral Model: part I, One-Dimensional Time, in: CGO '07: Proceedings of the International Symposium on Code Generation and Optimization, Washington, DC, USA, IEEE Computer Society, 2007, p. 144–156.

    http://dx.doi.org/10.1109/CGO.2007.21
  • 65G. D. Price, J. Giacomoni, M. Vachharajani.

    Visualizing potential parallelism in sequential programs, in: PACT '08: Proceedings of the 17th international conference on Parallel architectures and compilation techniques, New York, NY, USA, ACM, 2008, p. 82–90.

    http://doi.acm.org/10.1145/1454115.1454129
  • 66E. Raman, N. Vachharajani, R. Rangan, D. I. August.

    Spice: speculative parallel iteration chunk execution, in: CGO '08: Proceedings of the sixth annual IEEE/ACM international symposium on Code generation and optimization, New York, NY, USA, ACM, 2008, p. 175–184.

    http://doi.acm.org/10.1145/1356058.1356082
  • 67L. Rauchwerger, D. Padua.

    The LRPD Test: Speculative Run-Time Parallelization of Loops with Privatization and Reduction Parallelization, in: IEEE Trans. Parallel Distrib. Syst., 1999, vol. 10, no 2, p. 160–180.

    http://dx.doi.org/10.1109/71.752782
  • 68R. Seghir.

    Méthodes de dénombrement de points entiers de polyèdres et applications à l'optimisation de programmes, Université de Strasbourg, December 2006.
  • 69T. Sherwood, E. Perelman, G. Hamerly, B. Calder.

    Automatically characterizing large scale program behavior, in: ASPLOS-X: Proceedings of the 10th international conference on Architectural support for programming languages and operating systems, New York, NY, USA, ACM, 2002, p. 45–57.

    http://doi.acm.org/10.1145/605397.605403
  • 70J. Smith, R. Nair.

    Virtual Machines: Versatile Platforms for Systems and Processes (The Morgan Kaufmann Series in Computer Architecture and Design), Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, 2005.
  • 71C. Tian, M. Feng, V. Nagarajan, R. Gupta.

    Copy or Discard Execution Model For Speculative Parallelization On Multicores, in: IEEE/ACM 41st International Symposium on Microarchitecture, MICRO 41, November 2008, p. 330-341.
  • 72J.-B. Tristan, X. Leroy.

    Formal verification of translation validators: a case study on instruction scheduling optimizations, in: SIGPLAN Not., 2008, vol. 43, no 1, p. 17–27.

    http://dx.doi.org/10.1145/1328897.1328444
  • 73J.-B. Tristan, X. Leroy.

    Verified Validation of Lazy Code Motion, in: Programming Language Design and Implementation 2009, ACM Press, 2009, To appear.
  • 74É. Violard, S. Genaud, G.-R. Perrin.

    Refinement of Data Parallel Programs in PEI, in: Proceedings of the IFIP TC 2 WG 2.1 international workshop on Algorithmic languages and calculi, London, UK, UK, Chapman & Hall, Ltd., 1997, p. 107–131.
  • 75A. Welc, S. Jagannathan, A. Hosking.

    Safe futures for Java, in: OOPSLA '05: Proceedings of the 20th annual ACM SIGPLAN conference on Object-oriented programming, systems, languages, and applications, New York, NY, USA, ACM, 2005, p. 439–453.

    http://doi.acm.org/10.1145/1094811.1094845
  • 76Q. Wu, O. Mencer.

    Evaluating Sampling Based Hotspot Detection, in: International Conference on Architecture of Computing Systems, ARCS, March 2009.
  • 77B. Xin, W. N. Sumner, X. Zhang.

    Efficient program execution indexing, in: PLDI '08: Proceedings of the 2008 ACM SIGPLAN conference on Programming language design and implementation, New York, NY, USA, ACM, 2008, p. 238–248.

    http://doi.acm.org/10.1145/1375581.1375611