Impact of vectorization and multithreading on performance and energy consumption on Jetson boards

Abstract : ARM processors are well known for their energy efficiency and are consequently widely used in embedded platforms. Like other processor architectures, they are built with different levels of parallelism, from Instruction Level Parallelism (out-of-order and superscalar capabilities) to Thread Level Parallelism (multicore), to increase their performance levels. These processors are now also targeting the HPC domain and will equip the Fujitsu Post-K supercomputer. Some ARM processors from the Cortex-A series, which equip smartphones and tablets, also provide Data Level Parallelism through SIMD units called NEON. These units are able to process 128-bit of data at a time, for example four 32-bit floating point values. Taking advantage of these units requires code vectorization which may be performed automatically by the compiler or explicitly by using NEON intrinsics. Exploiting all these levels of parallelism may lead to better performance as well as a higher energy consumption. This is not an issue in the HPC domain where application development is driven by finding the best performance. However, developing for embedded applications is driven by finding the best trade-off between energy consumption and performance. In this paper, we propose to study the impact of vectorization and multithreading on both performance and energy consumption on some Nvidia Jetson boards. Results show that depending on the algorithm and on its implementation, vectorization may bring a similar speedup as an OpenMP scalar implementation but with a lower energy consumption. However, combining vectorization and multithreading may lead close to both the best performance level and the lowest energy consumption but not when running cores at their maximum frequencies.
Type de document :
Communication dans un congrès
Liste complète des métadonnées

Littérature citée [16 références]  Voir  Masquer  Télécharger

https://hal-brgm.archives-ouvertes.fr/hal-01795146
Contributeur : Gauthier Sornet <>
Soumis le : mardi 21 août 2018 - 11:25:28
Dernière modification le : mardi 2 avril 2019 - 01:39:34
Document(s) archivé(s) le : jeudi 22 novembre 2018 - 13:13:25

Fichier

main-IEEE.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01795146, version 2

Citation

Sylvain Jubertie, Emmanuel Melin, Naly Raliravaka, Emmanuel Bodèle, Pablo Escot Bocanegra. Impact of vectorization and multithreading on performance and energy consumption on Jetson boards. The 2018 International Conference on High Performance Computing & Simulation (HPCS 2018) - HPCS 2018, Jul 2018, Orléans, France. ⟨hal-01795146v2⟩

Partager

Métriques

Consultations de la notice

75

Téléchargements de fichiers

24