Accéder directement au contenu Accéder directement à la navigation
Article dans une revue

API Compilation for Image Hardware Accelerators

Abstract : We present an API-based compilation strategy to optimize image applications, developed using a high-level image processing library, onto three different image processing hardware accelerators. We demonstrate that such a strategy is profitable for both development cost and overall performance, especially as it takes advantage of optimization opportunities across library calls otherwise beyond reach. The library API provides the semantics of the image computations. The three image accelerator targets are quite distinct: the first one uses a vector architecture; the second one presents a SIMD architecture; the last one runs both on GPGPU and multi-cores through OpenCL. We have adapted standard compilation techniques to perform these compilation and code generation tasks automatically. Our strategy is implemented in\IfAnon{ an existing}{ PIPS, a} source-to-source compiler which greatly reduces the development cost as standard phases are reused and parameterized. We carried out experiments with applications on hardware functional simulators and GPUs. Our contributions include: 1)~a general low-cost compilation strategy for image processing applications, based on the semantics provided by library calls, which improves locality by an order of magnitude; 2)~specific heuristics to minimize execution time on the target accelerators; 3)~numerous experiments that show the effectiveness of our strategies. We also discuss the conditions required to extend this approach to other application domains.
Type de document :
Article dans une revue
Liste complète des métadonnées

https://hal-mines-paristech.archives-ouvertes.fr/hal-00829650
Contributeur : Claire Medrala <>
Soumis le : lundi 3 juin 2013 - 16:11:41
Dernière modification le : jeudi 24 septembre 2020 - 16:36:01

Lien texte intégral

Identifiants

Citation

Fabien Coelho, François Irigoin. API Compilation for Image Hardware Accelerators. ACM Transactions on Architecture and Code Optimization, Association for Computing Machinery, 2013, Volume 9 (Issue 4), pp.1-25. ⟨10.1145/2400682.2400708⟩. ⟨hal-00829650⟩

Partager

Métriques

Consultations de la notice

226