Impact of vectorization and multithreading on performance and energy consumption on Jetson boards - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année :

Impact of vectorization and multithreading on performance and energy consumption on Jetson boards

(1) , (1) , (2) , (3) , (4)
1
2
3
4

Résumé

ARM processors are well known for their energy efficiency and are consequently widely used in embedded platforms. Like other processor architectures, they are built with different levels of parallelism, from Instruction Level Parallelism (out-of-order and superscalar capabilities) to Thread Level Parallelism (multicore), to increase their performance levels. These processors are now also targeting the HPC domain and will equip the Fujitsu Post-K supercomputer. Some ARM processors from the Cortex-A series, which equip smartphones and tablets, also provide Data Level Parallelism through SIMD units called NEON. These units are able to process 128-bit of data at a time, for example four 32-bit floating point values. Taking advantage of these units requires code vectorization which may be performed automatically by the compiler or explicitly by using NEON intrinsics. Exploiting all these levels of parallelism may lead to better performance as well as a higher energy consumption. This is not an issue in the HPC domain where application development is driven by finding the best performance. However, developing for embedded applications is driven by finding the best trade-off between energy consumption and performance. In this paper, we propose to study the impact of vectorization and multithreading on both performance and energy consumption on some Nvidia Jetson boards. Results show that depending on the algorithm and on its implementation, vectorization may bring a similar speedup as an OpenMP scalar implementation but with a lower energy consumption. However, combining vectorization and multithreading may lead close to both the best performance level and the lowest energy consumption but not when running cores at their maximum frequencies.
Fichier principal
Vignette du fichier
HPCS_energy_jubertie.pdf (1.04 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01795146 , version 1 (18-05-2018)
hal-01795146 , version 2 (21-08-2018)
hal-01795146 , version 3 (09-04-2020)

Identifiants

  • HAL Id : hal-01795146 , version 3

Citer

Sylvain Jubertie, Emmanuel Melin, Naly Raliravaka, Emmanuel Bodèle, Pablo Escot Bocanegra. Impact of vectorization and multithreading on performance and energy consumption on Jetson boards. The 2018 International Conference on High Performance Computing & Simulation (HPCS 2018) - HPCS 2018, Jul 2018, Orléans, France. ⟨hal-01795146v3⟩
329 Consultations
603 Téléchargements

Partager

Gmail Facebook Twitter LinkedIn More