https://hal-brgm.archives-ouvertes.fr/hal-02475700Rohmer, JeremyJeremyRohmerBRGM - Bureau de Recherches Géologiques et Minières (BRGM)Idier, DéborahDéborahIdierBRGM - Bureau de Recherches Géologiques et Minières (BRGM)Lecacheux, SophieSophieLecacheuxBRGM - Bureau de Recherches Géologiques et Minières (BRGM)Pedreros, RodrigoRodrigoPedrerosBRGM - Bureau de Recherches Géologiques et Minières (BRGM)Deepening analysis of uncertain categorical inputs using Gaussian processes -application to marine floodingHAL CCSD2020UncertaintyGaussian ProcessMarine flooding[INFO.INFO-MO] Computer Science [cs]/Modeling and Simulation[SDE.ES] Environmental Sciences/Environmental and Society[SDE.IE] Environmental Sciences/Environmental Engineering[STAT.ML] Statistics [stat]/Machine Learning [stat.ML][STAT.AP] Statistics [stat]/Applications [stat.AP][SDU.STU.OC] Sciences of the Universe [physics]/Earth Sciences/Oceanography[SHS.STAT] Humanities and Social Sciences/Methods and statisticsRohmer, Jérémy2020-02-12 11:20:272022-07-23 03:57:502020-05-26 10:00:14enPreprints, Working Papers, ...application/pdf1Marine flooding assessment relies on the use of high-resolution hydrodynamic numerical models to understand the relationships between inputs (e.g. offshore sea conditions) and the output variables (e.g. flood spatial extent), and to characterize the uncertainty. The input-output mapping is, however, hindered by two aspects: 1. the high computation time cost of the numerical model limits the number of model runs (usually a few 100s); 2. some of the input variables are often categorical in nature, because different modelling assumptions are equally appropriate. In this communication, we propose to explore how recent advances in Gaussian Processes with mixed continuous/categorical inputs can bring valuable insights into the influence of such categorical inputs. Depending on how they are modelled (nominal, ordinal) and integrated in the kernel (level-related heteroscedasticity, compound symmetry, group structure and interactions, etc), different viewpoints can be brought. Three application cases with gradual complexity are investigated: i. nominal input to analyse the scenarios of rupture geometry for tsunamis; ii. ordinal variable to reveal the latent influence of the angle of approach for cyclone tracks; iii. group kernels to reveal the interactions between artificial and natural coastal defences' failure scenarios for storm tides. Finally, the problem of selecting the most appropriate kernel structure using exploratory analysis and cross-validation procedures is discussed.