Performance of OpenMP loop transformations for the acoustic wave stencil on GPUs - Archive ouverte HAL Accéder directement au contenu
Poster Année :

Performance of OpenMP loop transformations for the acoustic wave stencil on GPUs

(1, 2) , (1) , (2) , (3, 4) , (5) , (1, 2)
1
2
3
4
5

Résumé

Main Findings : • As a general remark, both loop transformations, unroll and tiling can yield significant improvements to the performance of the kernel evaluated on all GPUs evaluated. • Performance gains ranged from 1.13x to 2.93x. In most scenarios, the best performance was achieved by combining unroll and tiling. • The performance of tiling is highly sensitive to the choice of block size.
Fichier principal
Vignette du fichier
A-789.pdf (681.63 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03888100 , version 1 (29-12-2022)

Identifiants

  • HAL Id : hal-03888100 , version 1

Citer

Jaime Freire de Souza, Leticia Suellen Farias Machado, Edson Satoshi Gomi, Claude Tadonki, Simon Mcintosh-Smith, et al.. Performance of OpenMP loop transformations for the acoustic wave stencil on GPUs. SC22 The International Conference for High Performance Computing, Networking, Storage, and Analysis, Nov 2022, Dallas, United States. . ⟨hal-03888100⟩
0 Consultations
0 Téléchargements

Partager

Gmail Facebook Twitter LinkedIn More