M. Amini, C. Ancourt, F. Coelho, B. Creusillet, S. Guelton et al., Pips is not (just) polyhedral software, 1st International Workshop on Polyhedral Compilation Techniques, Impact, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00744312

C. Ancourt, F. Coelho, F. Irigoin, and R. Keryell, A Linear Algebra Framework for Static High Performance Fortran Code Distribution, Scientic Programming, p.327, 1997.
DOI : 10.1155/1997/195689

D. Aubert, M. Amini, and R. David, A Particle-Mesh Integrator for Galactic Dynamics Powered by GPGPUs, International Conference on Computational Science: Part I, ICCS '09, 2009.
DOI : 10.1007/978-3-642-01970-8_88

C. Augonnet, S. Thibault, R. Namyst, and P. Wacrenier, StarPU: A Unied Platform for Task Scheduling on Heterogeneous Multicore Architectures. Concurrency and Computation: Practice and Experience, Special Issue: Euro-Par, p.187198, 2009.

F. Bodin and S. Bihan, Heterogeneous Multicore Parallel Programming for Graphics Processing Units, Scientific Programming, vol.17, issue.4, p.325336, 2009.
DOI : 10.1155/2009/784893

S. Che, M. Boyer, J. Meng, D. Tarjan, J. W. Sheaer et al., Rodinia: A benchmark suite for heterogeneous computing, 2009 IEEE International Symposium on Workload Characterization (IISWC), 2009.
DOI : 10.1109/IISWC.2009.5306797

Y. Chen, X. Cui, and H. Mei, Large-scale FFT on GPU clusters, Proceedings of the 24th ACM International Conference on Supercomputing, ICS '10, 2010.
DOI : 10.1145/1810085.1810128

K. Datta, M. Murphy, V. Volkov, S. Williams, J. Carter et al., Stencil computation optimization and auto-tuning on state-of-the-art multicore architectures, 2008 SC, International Conference for High Performance Computing, Networking, Storage and Analysis
DOI : 10.1109/SC.2008.5222004

W. Fang, B. He, and Q. Luo, Database compression on graphics processors, Proceedings of the VLDB Endowment, vol.3, issue.1-2, p.670680, 2010.
DOI : 10.14778/1920841.1920927

P. Feautrier, Parametric integer programming, RAIRO - Operations Research, vol.22, issue.3, 1988.
DOI : 10.1051/ro/1988220302431

H. Michael, G. , and H. P. Zima, Optimizing communication in SU- PERB, Proceedings of the joint international conference on Vector and parallel processing, CONPAR 90-VAPP IV, 1990.

C. Gong, R. Gupta, and R. Melhem, Compilation Techniques for Optimizing Communication on Distributed-Memory Systems, 1993 International Conference on Parallel Processing, ICPP'93 Vol2, 1993.
DOI : 10.1109/ICPP.1993.58

T. David, H. , and T. S. Abdelrahman, hiCUDA: a high-level directivebased language for GPU programming, Proceedings of GPGPU-2, 2009.

H. Project, Par4All automatic parallelization

W. Huang, S. Ghosh, S. Velusamy, K. Sankaranarayanan, K. Skadron et al., HotSpot: a compact thermal modeling methodology for early-stage VLSI design, IEEE Transactions on Very Large Scale Integration (VLSI) Systems, vol.14, issue.5, 2006.
DOI : 10.1109/TVLSI.2006.876103

F. Irigoin, P. Jouvelot, and R. Triolet, Semantical interprocedural parallelization: an overview of the PIPS project, ICS '91, p.244251, 1991.
URL : https://hal.archives-ouvertes.fr/hal-00984684

T. B. Jablin, P. Prabhu, J. A. Jablin, N. P. Johnson, S. R. Beard et al., Automatic cpu-gpu communication management and optimization, Proceedings of the 32nd ACM SIGPLAN conference on Programming language design and implementation, PLDI '11, p.142151, 2011.

S. Lee and R. Eigenmann, OpenMPC: Extended OpenMP programming and tuning for GPUs, SC '10, p.111, 2010.

S. Lee, S. Min, and R. Eigenmann, OpenMP to GPGPU: a compiler framework for automatic translation and optimization, PPoPP, 2009.

S. Ohshima, S. Hirasawa, and H. Honda, OMPCUDA : OpenMP Execution Framework for CUDA Based on Omni OpenMP Compiler, Beyond Loop Level Parallelism in OpenMP: Accelerators, Tasking and More, p.161173, 2010.
DOI : 10.1007/978-3-642-13217-9_13

L. Pouchet, The Polyhedral Benchmark suite 2, 2011.

M. Wolfe, Implementing the PGI Accelerator model, Proceedings of the 3rd Workshop on General-Purpose Computation on Graphics Processing Units, GPGPU '10, 2010.
DOI : 10.1145/1735688.1735697

Y. Yan, M. Grossman, and V. Sarkar, JCUDA: A Programmer-Friendly Interface for Accelerating Java Programs with CUDA, Proceedings of the 15th International Euro-Par Conference on Parallel Processing, 2009.
DOI : 10.1007/978-3-540-85261-2_6