Wilson-Dirac Operator Revisited on Multicore Vector Computers - Mines Paris Accéder directement au contenu
Rapport (Rapport De Recherche) Année : 2016

Wilson-Dirac Operator Revisited on Multicore Vector Computers


We revisit the Wilson-Dirac operator, also refered as Dslash, on multicore vector machines. The Wilson-Dirac operator is the major computing kernel in Lattice Quantum ChromoDynamics (LQCD), which is the canonical discrete formalism for Quantum ChromoDynamics (QCD) investigations. QCD is the theory of sub-nuclear particles physics, aiming at modeling the strong nuclear force, which is responsible for the interactions of nuclear particles. Based on LQCD formalism, intensive simulations are performed following the Monte Carlo paradigm. Informative observations are expected from large-scale and numerically sensitive LQCD simulations. The corresponding computing demand is therefore tremendous, thus the serious consideration for powerful supercomputers. Designing efficient LQCD codes on modern (mostly hybrid) supercomputers requires to efficiently exploit all available levels of parallelism including accelerators. Since the Wilson-Dirac operator is a coarse-grain stencil computation performed on huge volume of data, any performance and scalability related investigation should skillfully address memory accesses and interprocessor communication overheads. In order the lower the latter, an explicit shared memory implementation should be considered at the node level, since this will lead to a less complex data communication graph. This the main focus of the current paper, where we provide, explain, and discuss a multi-threaded vector implementation, whose experimental results in double precision on the recently released INTEL BROADWELL based machine show a competitive absolute efficiency and a good scalability on one of its four NUMA nodes. An extension to all available nodes is currently under investigation through NUMA-awareness consideration. 545 cuba050new.aketitle
Fichier principal
Vignette du fichier
E-396.pdf (374.48 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-01433283 , version 1 (12-01-2017)


  • HAL Id : hal-01433283 , version 1


Claude Tadonki. Wilson-Dirac Operator Revisited on Multicore Vector Computers. [Research Report] Mines ParisTech - PSL Research University - Centre de Recherche en Informatique (CRI). 2016. ⟨hal-01433283⟩
79 Consultations
60 Téléchargements


Gmail Facebook Twitter LinkedIn More