S. Williams, J. Carter, L. Oliker, J. Shalf, and K. Yelick, Optimization of a lattice boltzmann computation on state-of-theart multicore platforms, Best Paper Awards and Panel Summary: 22nd International Parallel and Distributed Processing Symposium (IPDPS, vol.69, pp.762-777, 2008.

K. Kim, K. Kim, and Q. Park, Performance analysis and optimization of three-dimensional fdtd on gpu using roofline model, Computer Physics Communications, vol.182, issue.6, pp.1201-1207, 2011.

D. Marques, H. Duarte, A. Ilic, L. Sousa, R. Belenov et al., Performance analysis with cache-aware roofline model in intel advisor, 2017 International Conference on High Performance Computing Simulation (HPCS), pp.898-907, 2017.

N. Denoyelle, B. Goglin, A. Ilic, E. Jeannot, and L. Sousa, Modeling Large Compute Nodes with Heterogeneous Memories with Cache-Aware Roofline Model, High Performance Computing systems-Performance Modeling, Benchmarking, and Simulation-8th International Workshop, PMBS 2017, vol.10724, pp.91-113, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01622582

A. Ilic, F. Pratas, and L. Sousa, Cache-aware roofline model: Upgrading the loft, IEEE Computer Architecture Letters, vol.13, pp.21-24, 2014.

N. Möller, E. Petit, L. Thébault, and Q. Dinh, A case study on using a proto-application as a proxy for code modernization, ternational Conference On Computational Science, vol.51, pp.1433-1442, 2015.

D. Komatitsch, D. Michéa, and G. Erlebacher, Porting a high-order finite-element earthquake modeling application to {NVIDIA} graphics cards using {CUDA}, Journal of Parallel and Distributed Computing, vol.69, issue.5, pp.451-460, 2009.
URL : https://hal.archives-ouvertes.fr/inria-00436426

T. Ichimura, K. Fujita, P. E. Quinay, L. Maddegedara, M. Hori et al., Implicit nonlinear wave simulation with 1.08t dof and 0.270t unstructured finite elements to enhance comprehensive earthquake simulation, Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, SC '15, 2015.

F. and K. Bana´sbana´s, Vectorized opencl implementation of numerical integration for higher order finite elements, Computers & Mathematics with Applications, vol.66, issue.10, pp.2030-2044, 2012.

F. D. Martin, Verification of a spectral-element method code for the southern california earthquake center loh.3 viscoelastic case, Bull. Seism. Soc. Am, vol.101, issue.6, pp.2855-2865, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00660332

S. Jubertie, F. Dupros, and F. D. Martin, Vectorization of a spectral finite-element numerical kernel, Proceedings of the 4th Workshop on Programming Models for SIMD/Vector Processing, vol.8, pp.1-8, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01835745

G. Sornet, S. Jubertie, F. Dupros, F. De-martin, P. Thierry et al., Data-layout reorganization for an efficient intra-node assembly of a Spectral Finite-Element Method, p.2018, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01680058

S. Williams, A. Waterman, and D. Patterson, Roofline: An insightful visual performance model for multicore architectures, Commun. ACM, vol.52, pp.65-76, 2008.

D. Roten, Y. Cui, K. B. Olsen, S. M. Day, K. Withers et al., High-frequency nonlinear earthquake simulations on petascale heterogeneous supercomputers, Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, pp.957-968, 2016.

S. Tsuboi, K. Ando, T. Miyoshi, D. Peter, D. Komatitsch et al., A 1.8 trillion degrees-of-freedom, 1.24 petaflops global seismic wave simulation on the K computer, IJHPCA, vol.30, issue.4, pp.411-422, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01265154

J. Tobin, A. Breuer, A. Heinecke, C. Yount, and Y. Cui, Accelerating seismic simulations using the intel xeon phi knights landing processor, High Performance Computing-32nd International Conference, ISC High Performance, pp.139-157, 2017.

D. Göddeke, D. Komatitsch, M. Geveler, D. Ribbrock, N. Rajovic et al., Energy efficiency vs. performance of the numerical solution of pdes: An application study on a low-power arm-based cluster, J. Comput. Physics, vol.237, pp.132-150, 2013.

M. Castro, E. Francesquini, F. Dupros, H. Aochi, P. O. Navaux et al., Seismic wave propagation simulations on low-power and performance-centric manycores, Parallel Computing, vol.54, pp.108-120, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01273153

P. Souza, L. Borges, C. Andreolli, and P. Thierry, Chapter 24portable explicit vectorization intrinsics, High Performance Parallelism Pearls, pp.463-485, 2015.

F. Martínez-martínez, M. J. Rupérez-moreno, M. Martínezsober, J. A. Llorens, D. Lorente et al., A finite element-based machine learning approach for modeling the mechanical behavior of the breast tissues under compression in real-time, Comp. in Bio. and Med, vol.90, pp.116-124, 2017.

J. Bolz, I. Farmer, E. Grinspun, and P. Schröoder, Sparse matrix solvers on the gpu: Conjugate gradients and multigrid, ACM Trans. Graph, vol.22, pp.917-924, 2003.

C. Farhat and L. Crivelli, A general approach to nonlinear fe computations on shared-memory multiprocessors, Computer Methods in Applied Mechanics and Engineering, vol.72, issue.2, pp.153-171, 1989.

L. Thébault, E. Petit, M. Tchiboukdjian, Q. Dinh, and W. Jalby, Divide and conquer parallelization of finite element method assembly, Parallel Computing: Accelerating Computational Science and Engineering (CSE), Proceedings of the International Conference on Parallel Computing, pp.753-762, 2013.

C. Cecka, A. J. Lew, and E. Darve, Assembly of finite element methods on graphics processors, International Journal for Numerical Methods in Engineering, vol.85, issue.5, pp.640-669, 2011.

G. R. Markall, A. Slemmer, D. A. Ham, P. H. Kelly, C. D. Cantwell et al., Finite element assembly strategies on multi-core and many-core architectures, International Journal for Numerical Methods in Fluids, vol.71, issue.1, pp.80-97

A. T. Patera, A spectral element method for fluid dynamics: laminar flow in a channel expansion, J. Comput. Phys, vol.54, pp.468-488, 1984.

Y. Maday and A. T. Patera, Spectral element methods for the incompressible navier-stokes equations, pp.71-143, 1989.

P. F. Fischer and E. M. Rønquist, Spectral-element methods for large scale parallel Navier-Stokes calculations, Comput. Methods Appl. Mech. Engrg, vol.116, pp.69-76, 1994.

D. Komatitsch and J. Tromp, Spectral-element simulations of global seismic wave propagation-I. Validation, Geophys. J. Int, vol.149, issue.2, pp.390-412, 2002.
URL : https://hal.archives-ouvertes.fr/hal-00669061

D. Komatitsch, J. Labarta, and D. Michéa, A Simulation of Seismic Wave Propagation at High Resolution in the Inner Core of the Earth on 2166 Processors of MareNostrum, pp.364-377, 2008.

J. Mccalpin, Sustainable memory bandwidth in highperformance computers, 1995.