A Linear Algebra Framework for Static High Performance Fortran Code Distribution, Scientific Programming, pp.3-27, 1997. ,
DOI : 10.1155/1997/195689
A Particle-Mesh Integrator for Galactic Dynamics Powered by GPGPUs, International Conference on Computational Science : Part I, ICCS '09 ,
DOI : 10.1007/978-3-642-01970-8_88
StarPU : A Unified Platform for Task Scheduling on Heterogeneous Multicore Architectures. Concurrency and Computation : Practice and Experience, Special Issue : Euro-Par, pp.187-198, 2009. ,
URL : https://hal.archives-ouvertes.fr/inria-00384363
Heterogeneous Multicore Parallel Programming for Graphics Processing Units, Scientific Programming, vol.17, issue.4, pp.325-336, 2009. ,
DOI : 10.1155/2009/784893
Large-scale FFT on GPU clusters, Proceedings of the 24th ACM International Conference on Supercomputing, ICS '10, pp.315-324, 2010. ,
DOI : 10.1145/1810085.1810128
Interprocedural Array Region Analyses, International Journal of Parallel Programming, vol.2, issue.3, pp.513-546, 1996. ,
DOI : 10.1007/BF03356758
URL : https://hal.archives-ouvertes.fr/hal-00752611
Database compression on graphics processors, Proc. VLDB Endow, pp.670-680, 2010. ,
DOI : 10.14778/1920841.1920927
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.172.8125
Parametric integer programming, RAIRO - Operations Research, vol.22, issue.3, 1988. ,
DOI : 10.1051/ro/1988220302431
Optimizing communication in SUPERB, Proceedings of the joint international conference on Vector and parallel processing, 1990. ,
Compilation Techniques for Optimizing Communication on Distributed-Memory Systems, 1993 International Conference on Parallel Processing, ICPP'93 Vol2, 1993. ,
DOI : 10.1109/ICPP.1993.58
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.17.759
hiCUDA : a high-level directive-based language for GPU programming, Proceedings of GPGPU-2, 2009. ,
Par4All initiative for automatic parallelization ,
Semantical interprocedural parallelization : an overview of the PIPS project, ICS '91, pp.244-251, 1991. ,
URL : https://hal.archives-ouvertes.fr/hal-00984684
OpenMPC: Extended OpenMP Programming and Tuning for GPUs, 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis, pp.1-11, 2010. ,
DOI : 10.1109/SC.2010.36
OpenMP to GPGPU : a compiler framework for automatic translation and optimization, pp.101-110, 2009. ,
OMPCUDA : OpenMP Execution Framework for CUDA Based on Omni OpenMP Compiler, Beyond Loop Level Parallelism in OpenMP : Accelerators , Tasking and More, pp.161-173, 2010. ,
DOI : 10.1007/978-3-642-13217-9_13
Exploring data streaming to improve 3d FFT implementation on multiple GPUs, International Symposium on Computer Architecture and High Performance Computing Workshops, 2010. ,
Implementing the PGI Accelerator model, Proceedings of the 3rd Workshop on General-Purpose Computation on Graphics Processing Units, GPGPU '10, pp.43-50, 2010. ,
DOI : 10.1145/1735688.1735697