Pips is not (just) polyhedral software, 1st International Workshop on Polyhedral Compilation Techniques, Impact, 2011. ,
URL : https://hal.archives-ouvertes.fr/hal-00744312
A Linear Algebra Framework for Static High Performance Fortran Code Distribution, Scientic Programming, p.327, 1997. ,
DOI : 10.1155/1997/195689
A Particle-Mesh Integrator for Galactic Dynamics Powered by GPGPUs, International Conference on Computational Science: Part I, ICCS '09, 2009. ,
DOI : 10.1007/978-3-642-01970-8_88
StarPU: A Unied Platform for Task Scheduling on Heterogeneous Multicore Architectures. Concurrency and Computation: Practice and Experience, Special Issue: Euro-Par, p.187198, 2009. ,
Heterogeneous Multicore Parallel Programming for Graphics Processing Units, Scientific Programming, vol.17, issue.4, p.325336, 2009. ,
DOI : 10.1155/2009/784893
Rodinia: A benchmark suite for heterogeneous computing, 2009 IEEE International Symposium on Workload Characterization (IISWC), 2009. ,
DOI : 10.1109/IISWC.2009.5306797
Large-scale FFT on GPU clusters, Proceedings of the 24th ACM International Conference on Supercomputing, ICS '10, 2010. ,
DOI : 10.1145/1810085.1810128
Stencil computation optimization and auto-tuning on state-of-the-art multicore architectures, 2008 SC, International Conference for High Performance Computing, Networking, Storage and Analysis ,
DOI : 10.1109/SC.2008.5222004
Database compression on graphics processors, Proceedings of the VLDB Endowment, vol.3, issue.1-2, p.670680, 2010. ,
DOI : 10.14778/1920841.1920927
Parametric integer programming, RAIRO - Operations Research, vol.22, issue.3, 1988. ,
DOI : 10.1051/ro/1988220302431
Optimizing communication in SU- PERB, Proceedings of the joint international conference on Vector and parallel processing, CONPAR 90-VAPP IV, 1990. ,
Compilation Techniques for Optimizing Communication on Distributed-Memory Systems, 1993 International Conference on Parallel Processing, ICPP'93 Vol2, 1993. ,
DOI : 10.1109/ICPP.1993.58
hiCUDA: a high-level directivebased language for GPU programming, Proceedings of GPGPU-2, 2009. ,
Par4All automatic parallelization ,
HotSpot: a compact thermal modeling methodology for early-stage VLSI design, IEEE Transactions on Very Large Scale Integration (VLSI) Systems, vol.14, issue.5, 2006. ,
DOI : 10.1109/TVLSI.2006.876103
Semantical interprocedural parallelization: an overview of the PIPS project, ICS '91, p.244251, 1991. ,
URL : https://hal.archives-ouvertes.fr/hal-00984684
Automatic cpu-gpu communication management and optimization, Proceedings of the 32nd ACM SIGPLAN conference on Programming language design and implementation, PLDI '11, p.142151, 2011. ,
OpenMPC: Extended OpenMP programming and tuning for GPUs, SC '10, p.111, 2010. ,
OpenMP to GPGPU: a compiler framework for automatic translation and optimization, PPoPP, 2009. ,
OMPCUDA : OpenMP Execution Framework for CUDA Based on Omni OpenMP Compiler, Beyond Loop Level Parallelism in OpenMP: Accelerators, Tasking and More, p.161173, 2010. ,
DOI : 10.1007/978-3-642-13217-9_13
The Polyhedral Benchmark suite 2, 2011. ,
Implementing the PGI Accelerator model, Proceedings of the 3rd Workshop on General-Purpose Computation on Graphics Processing Units, GPGPU '10, 2010. ,
DOI : 10.1145/1735688.1735697
JCUDA: A Programmer-Friendly Interface for Accelerating Java Programs with CUDA, Proceedings of the 15th International Euro-Par Conference on Parallel Processing, 2009. ,
DOI : 10.1007/978-3-540-85261-2_6