D. Barthou, G. Grosdidier, M. Kruse, O. , and C. Tadonki, QIRAL: A High Level Language for Lattice QCD Code Generation, ETAPS, vol.2012, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00666885

M. A. Clark, R. Babich, K. Barros, R. C. Brower, and C. Rebbi, Solving lattice QCD systems of equations using mixed precision solvers on GPUs, Computer Physics Communications, vol.181, issue.9, pp.181-15171528, 2010.
DOI : 10.1016/j.cpc.2010.05.002

R. G. Edwards, B. Jó, and T. Jefferson, The Chroma Software System for Lattice QCD http

G. Grosdidier, Scaling stories, PetaQCD Final Review Meeting, 2012.

K. Z. Ibrahim and F. Bodin, Implementing Wilson-Dirac operator on the cell broadband engine, Proceedings of the 22nd annual international conference on Supercomputing , ICS '08, pp.4-14, 2008.
DOI : 10.1145/1375527.1375532

URL : https://hal.archives-ouvertes.fr/inria-00203478

K. Jansen and C. Urbach, tmLQCD: A program suite to simulate Wilson twisted mass lattice QCD, Computer Physics Communications, vol.180, issue.12, pp.2717-2738, 2009.
DOI : 10.1016/j.cpc.2009.05.016

Y. Li, I. Pandis, R. Mueller, V. Raman, and G. Lohman, NUMA-aware algorithms: the case of data shuffling http

M. Luscher, Implementation of the lattice Dirac operator, 2006.

D. Pleiter, QPACE: Power-efficient parallel architecture based on IBM PowerXCell 8i, 2010.

M. Smelyanskiy, K. Vaidyanathan, J. Choi, B. Joo, J. Chhugani et al., High-performance lattice QCD for multi-core based parallel systems using a cache-friendly hybrid threaded-MPI approach, Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis on, SC '11, p.11, 2011.
DOI : 10.1145/2063384.2063477

C. Tadonki, G. Grosdidier, and O. Pene, An efficient CELL library for Lattice Quantum Chromodynamics, International Workshop on Highly Efficient Accelerators and Reconfigurable Technologies (HEART) in conjunction with the 24th, ACM International Conference on Supercomputing (ICS), pp.67-71, 2010.

C. Urbach, K. Jansen, A. Shindler, and U. Wenger, HMC algorithm with multiple time scale integration and mass preconditioning, Computer Physics Communications, vol.174, issue.2, p.87, 2006.
DOI : 10.1016/j.cpc.2005.08.006

URL : http://arxiv.org/abs/hep-lat/0506011

C. Van-loan, Computational Framework for the Fast Fourier Transform, 1992.
DOI : 10.1137/1.9781611970999

F. Wilczek, What QCD Tells Us About Nature and Why We Should Listen, Nuc, 2000.
DOI : 10.1016/s0375-9474(99)00567-9

URL : http://arxiv.org/abs/hep-ph/9907340