Optimization of a lattice boltzmann computation on state-of-theart multicore platforms, Best Paper Awards and Panel Summary: 22nd International Parallel and Distributed Processing Symposium (IPDPS, vol.69, pp.762-777, 2008. ,
Performance analysis and optimization of three-dimensional fdtd on gpu using roofline model, Computer Physics Communications, vol.182, issue.6, pp.1201-1207, 2011. ,
Performance analysis with cache-aware roofline model in intel advisor, 2017 International Conference on High Performance Computing Simulation (HPCS), pp.898-907, 2017. ,
Modeling Large Compute Nodes with Heterogeneous Memories with Cache-Aware Roofline Model, High Performance Computing systems-Performance Modeling, Benchmarking, and Simulation-8th International Workshop, PMBS 2017, vol.10724, pp.91-113, 2017. ,
URL : https://hal.archives-ouvertes.fr/hal-01622582
Cache-aware roofline model: Upgrading the loft, IEEE Computer Architecture Letters, vol.13, pp.21-24, 2014. ,
A case study on using a proto-application as a proxy for code modernization, ternational Conference On Computational Science, vol.51, pp.1433-1442, 2015. ,
Porting a high-order finite-element earthquake modeling application to {NVIDIA} graphics cards using {CUDA}, Journal of Parallel and Distributed Computing, vol.69, issue.5, pp.451-460, 2009. ,
URL : https://hal.archives-ouvertes.fr/inria-00436426
Implicit nonlinear wave simulation with 1.08t dof and 0.270t unstructured finite elements to enhance comprehensive earthquake simulation, Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, SC '15, 2015. ,
Vectorized opencl implementation of numerical integration for higher order finite elements, Computers & Mathematics with Applications, vol.66, issue.10, pp.2030-2044, 2012. ,
Verification of a spectral-element method code for the southern california earthquake center loh.3 viscoelastic case, Bull. Seism. Soc. Am, vol.101, issue.6, pp.2855-2865, 2011. ,
URL : https://hal.archives-ouvertes.fr/hal-00660332
Vectorization of a spectral finite-element numerical kernel, Proceedings of the 4th Workshop on Programming Models for SIMD/Vector Processing, vol.8, pp.1-8, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01835745
Data-layout reorganization for an efficient intra-node assembly of a Spectral Finite-Element Method, p.2018, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01680058
Roofline: An insightful visual performance model for multicore architectures, Commun. ACM, vol.52, pp.65-76, 2008. ,
High-frequency nonlinear earthquake simulations on petascale heterogeneous supercomputers, Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, pp.957-968, 2016. ,
A 1.8 trillion degrees-of-freedom, 1.24 petaflops global seismic wave simulation on the K computer, IJHPCA, vol.30, issue.4, pp.411-422, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01265154
Accelerating seismic simulations using the intel xeon phi knights landing processor, High Performance Computing-32nd International Conference, ISC High Performance, pp.139-157, 2017. ,
Energy efficiency vs. performance of the numerical solution of pdes: An application study on a low-power arm-based cluster, J. Comput. Physics, vol.237, pp.132-150, 2013. ,
Seismic wave propagation simulations on low-power and performance-centric manycores, Parallel Computing, vol.54, pp.108-120, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01273153
Chapter 24portable explicit vectorization intrinsics, High Performance Parallelism Pearls, pp.463-485, 2015. ,
A finite element-based machine learning approach for modeling the mechanical behavior of the breast tissues under compression in real-time, Comp. in Bio. and Med, vol.90, pp.116-124, 2017. ,
Sparse matrix solvers on the gpu: Conjugate gradients and multigrid, ACM Trans. Graph, vol.22, pp.917-924, 2003. ,
A general approach to nonlinear fe computations on shared-memory multiprocessors, Computer Methods in Applied Mechanics and Engineering, vol.72, issue.2, pp.153-171, 1989. ,
Divide and conquer parallelization of finite element method assembly, Parallel Computing: Accelerating Computational Science and Engineering (CSE), Proceedings of the International Conference on Parallel Computing, pp.753-762, 2013. ,
Assembly of finite element methods on graphics processors, International Journal for Numerical Methods in Engineering, vol.85, issue.5, pp.640-669, 2011. ,
Finite element assembly strategies on multi-core and many-core architectures, International Journal for Numerical Methods in Fluids, vol.71, issue.1, pp.80-97 ,
A spectral element method for fluid dynamics: laminar flow in a channel expansion, J. Comput. Phys, vol.54, pp.468-488, 1984. ,
Spectral element methods for the incompressible navier-stokes equations, pp.71-143, 1989. ,
Spectral-element methods for large scale parallel Navier-Stokes calculations, Comput. Methods Appl. Mech. Engrg, vol.116, pp.69-76, 1994. ,
Spectral-element simulations of global seismic wave propagation-I. Validation, Geophys. J. Int, vol.149, issue.2, pp.390-412, 2002. ,
URL : https://hal.archives-ouvertes.fr/hal-00669061
A Simulation of Seismic Wave Propagation at High Resolution in the Inner Core of the Earth on 2166 Processors of MareNostrum, pp.364-377, 2008. ,
Sustainable memory bandwidth in highperformance computers, 1995. ,