Matches in SemOpenAlex for { <https://semopenalex.org/work/W4322011181> ?p ?o ?g. }
Showing items 1 to 70 of
70
with 100 items per page.
- W4322011181 abstract "The last release of the NEMO v4.2 ocean model includes many modifications that have a significant impact on the model performance. The goal of the work is to assess NEMO performance obtained due to the optimizations carried out during the last four years within the IMMERSE and IS-ENES3 projects. The computational analysis was conducted using Extrae and Paraver which are the performance tools developed at the Barcelona Supercomputing Center.Extrae provides a trace rich of information regarding the usage of the computational resources made by the model, these include measurements related to the memory subsystem, instruction cycles, vectorization level, communications among parallel processes and many others. Paraver provides a visual inspection of the trace and an insight of the computational features of the NEMO model; this allows to define easily a detailed quantitative evaluation of performance issues.The performance analysis carried out on NEMO is based on the evaluation of different metrics each one related to a different aspect of the computational resource. The main aspects analyzed are the execution time, the communication time, the number of instructions per cycle and the cache hit rate. In addition, we combined these metrics to evaluate the parallel scalability and the global efficiency of the model when the number of core increases.Our investigation was focused on evaluating the impact of the last HPC changes and namely: the use of collective neighbors communication pattern, available in MPI3, for the halo exchange; the use of the loop fusion technique to improve the data locality; the impact of the extended halo; the impact of the MPI+OpenMP version of NEMO obtained by means of PSyclone which is a DSL compiler developed at the STFC.The analysis has been carried out on MareNostrum4 supercomputer at BSC with the NEMO source code available @commit 1d9676ff (a.k.a 68-summer-body-2022 branch) and using the Bench Test configured for ORCA12-like resolution. The evaluation of the MPI+OpenMP was carried out using NEMO 4.0 in ORCA025 configuration kindly provided by STFC as outcome of the PSyclone DSL compiler.The use of the extended halo with 2 points provides a significant improvement on the performance with a factor of 13% due to a reduction of the number of exchanged messages.The use of MPI3 communications does not introduce many benefits: a lower number of MPI point-to-point exchanges is compensated by the higher message size of MPI3 neighbors collective communications.The use of loop fusion does not introduce many benefits: few routines with loop fusion and the little improvement registered in cache misses is compensated by the increase in the number of instructions due to the fusion of the loops.The analysis of the traces on the hybrid MPI/OpenMP NEMO version processed by Psyclone doesn’t highlight many benefits when the number of OpenMP threads increases due to the part of the code not parallelized.Finally, one of the most important HPC development, the tiling, has not been analyzed yet, since the last modifications have been merged recently and the resulting code is still under revision." @default.
- W4322011181 created "2023-02-26" @default.
- W4322011181 creator A5035035680 @default.
- W4322011181 creator A5036972405 @default.
- W4322011181 creator A5061419770 @default.
- W4322011181 creator A5079559754 @default.
- W4322011181 date "2023-05-15" @default.
- W4322011181 modified "2023-09-27" @default.
- W4322011181 title "Performance evaluation of NEMO4.2 with Paraver" @default.
- W4322011181 doi "https://doi.org/10.5194/egusphere-egu23-12858" @default.
- W4322011181 hasPublicationYear "2023" @default.
- W4322011181 type Work @default.
- W4322011181 citedByCount "0" @default.
- W4322011181 crossrefType "posted-content" @default.
- W4322011181 hasAuthorship W4322011181A5035035680 @default.
- W4322011181 hasAuthorship W4322011181A5036972405 @default.
- W4322011181 hasAuthorship W4322011181A5061419770 @default.
- W4322011181 hasAuthorship W4322011181A5079559754 @default.
- W4322011181 hasConcept C115537543 @default.
- W4322011181 hasConcept C120314980 @default.
- W4322011181 hasConcept C138885662 @default.
- W4322011181 hasConcept C173608175 @default.
- W4322011181 hasConcept C206345919 @default.
- W4322011181 hasConcept C27602214 @default.
- W4322011181 hasConcept C2777115002 @default.
- W4322011181 hasConcept C2779808786 @default.
- W4322011181 hasConcept C31258907 @default.
- W4322011181 hasConcept C41008148 @default.
- W4322011181 hasConcept C41681595 @default.
- W4322011181 hasConcept C41895202 @default.
- W4322011181 hasConcept C44154836 @default.
- W4322011181 hasConcept C48044578 @default.
- W4322011181 hasConcept C66024118 @default.
- W4322011181 hasConcept C75291252 @default.
- W4322011181 hasConcept C77088390 @default.
- W4322011181 hasConcept C83283714 @default.
- W4322011181 hasConceptScore W4322011181C115537543 @default.
- W4322011181 hasConceptScore W4322011181C120314980 @default.
- W4322011181 hasConceptScore W4322011181C138885662 @default.
- W4322011181 hasConceptScore W4322011181C173608175 @default.
- W4322011181 hasConceptScore W4322011181C206345919 @default.
- W4322011181 hasConceptScore W4322011181C27602214 @default.
- W4322011181 hasConceptScore W4322011181C2777115002 @default.
- W4322011181 hasConceptScore W4322011181C2779808786 @default.
- W4322011181 hasConceptScore W4322011181C31258907 @default.
- W4322011181 hasConceptScore W4322011181C41008148 @default.
- W4322011181 hasConceptScore W4322011181C41681595 @default.
- W4322011181 hasConceptScore W4322011181C41895202 @default.
- W4322011181 hasConceptScore W4322011181C44154836 @default.
- W4322011181 hasConceptScore W4322011181C48044578 @default.
- W4322011181 hasConceptScore W4322011181C66024118 @default.
- W4322011181 hasConceptScore W4322011181C75291252 @default.
- W4322011181 hasConceptScore W4322011181C77088390 @default.
- W4322011181 hasConceptScore W4322011181C83283714 @default.
- W4322011181 hasLocation W43220111811 @default.
- W4322011181 hasOpenAccess W4322011181 @default.
- W4322011181 hasPrimaryLocation W43220111811 @default.
- W4322011181 hasRelatedWork W1509211761 @default.
- W4322011181 hasRelatedWork W1516360963 @default.
- W4322011181 hasRelatedWork W1534227216 @default.
- W4322011181 hasRelatedWork W1548317368 @default.
- W4322011181 hasRelatedWork W1572318402 @default.
- W4322011181 hasRelatedWork W2125347099 @default.
- W4322011181 hasRelatedWork W2372170743 @default.
- W4322011181 hasRelatedWork W2519373338 @default.
- W4322011181 hasRelatedWork W2582456645 @default.
- W4322011181 hasRelatedWork W4322011181 @default.
- W4322011181 isParatext "false" @default.
- W4322011181 isRetracted "false" @default.
- W4322011181 workType "article" @default.