Matches in SemOpenAlex for { <https://semopenalex.org/work/W2979810166> ?p ?o ?g. }
Showing items 1 to 72 of
72
with 100 items per page.
- W2979810166 abstract "This dissertation studies the sources of poor performance in scientific computing codes based on partial differential equations (PDEs), which typically perform at a computational rate well below other scientific simulations (e.g., those with dense linear algebra or N-body kernels) on modern architectures with deep memory hierarchies. We identify that the primary factors responsible for this relatively poor performance are: insufficient available memory bandwidth, low ratio of work to data size (good algorithmic efficiency), and nonscaling cost of synchronization and gather/scatter operations (for a fixed problem size scaling). This dissertation also illustrates how to reuse the legacy scientific and engineering software within a library framework. Specifically, a three-dimensional unstructured grid incompressible Euler code from NASA has been parallelized with the Portable Extensible Toolkit for Scientific Computing (PETSc) library for distributed memory architectures. Using this newly instrumented code (called PETSc-FUN3D) as an example of a typical PDE solver, we demonstrate some strategies that are effective in tolerating the latencies arising from the hierarchical memory system and the network. Even on a single processor from each of the major contemporary architectural families, the PETSc-FUN3D code runs from 2.5 to 7.5 times faster than the legacy code on a medium-sized data set (with approximately 105 degrees of freedom). The major source of performance improvement is the increased locality in data reference patterns achieved through blocking, interlacing, and edge reordering. To explain these performance gains, we provide simple performance models based on memory bandwidth and instruction issue rates. Experimental evidence, in terms of translation lookaside buffer (TLB) and data cache miss rates, achieved memory bandwidth, and graduated floating point instructions per memory reference, is provided through accurate measurements with hardware counters. The performance models and experimental results motivate algorithmic and software practices that lead to improvements in both parallel scalability and per-node performance. We identify the bottlenecks to scalability (algorithmic as well as implementation) for a fixed-size problem when the number of processors grows to several thousands (the expected level of concurrency on terascale architectures). We also evaluate the hybrid programming model (mixed distributed/shared) from a performance standpoint." @default.
- W2979810166 created "2019-10-18" @default.
- W2979810166 creator A5036730865 @default.
- W2979810166 date "2002-01-01" @default.
- W2979810166 modified "2023-09-23" @default.
- W2979810166 title "Performance Modeling and Prediction for the Scalable Solution of Partial Differential Equations on Unstructured Grids" @default.
- W2979810166 doi "https://doi.org/10.25777/armz-kn08" @default.
- W2979810166 hasPublicationYear "2002" @default.
- W2979810166 type Work @default.
- W2979810166 sameAs 2979810166 @default.
- W2979810166 citedByCount "0" @default.
- W2979810166 crossrefType "journal-article" @default.
- W2979810166 hasAuthorship W2979810166A5036730865 @default.
- W2979810166 hasConcept C113775141 @default.
- W2979810166 hasConcept C121332964 @default.
- W2979810166 hasConcept C133875982 @default.
- W2979810166 hasConcept C173608175 @default.
- W2979810166 hasConcept C187691185 @default.
- W2979810166 hasConcept C188045654 @default.
- W2979810166 hasConcept C2524010 @default.
- W2979810166 hasConcept C33923547 @default.
- W2979810166 hasConcept C41008148 @default.
- W2979810166 hasConcept C459310 @default.
- W2979810166 hasConcept C48044578 @default.
- W2979810166 hasConcept C62520636 @default.
- W2979810166 hasConcept C77088390 @default.
- W2979810166 hasConcept C80444323 @default.
- W2979810166 hasConcept C91481028 @default.
- W2979810166 hasConcept C93779851 @default.
- W2979810166 hasConceptScore W2979810166C113775141 @default.
- W2979810166 hasConceptScore W2979810166C121332964 @default.
- W2979810166 hasConceptScore W2979810166C133875982 @default.
- W2979810166 hasConceptScore W2979810166C173608175 @default.
- W2979810166 hasConceptScore W2979810166C187691185 @default.
- W2979810166 hasConceptScore W2979810166C188045654 @default.
- W2979810166 hasConceptScore W2979810166C2524010 @default.
- W2979810166 hasConceptScore W2979810166C33923547 @default.
- W2979810166 hasConceptScore W2979810166C41008148 @default.
- W2979810166 hasConceptScore W2979810166C459310 @default.
- W2979810166 hasConceptScore W2979810166C48044578 @default.
- W2979810166 hasConceptScore W2979810166C62520636 @default.
- W2979810166 hasConceptScore W2979810166C77088390 @default.
- W2979810166 hasConceptScore W2979810166C80444323 @default.
- W2979810166 hasConceptScore W2979810166C91481028 @default.
- W2979810166 hasConceptScore W2979810166C93779851 @default.
- W2979810166 hasLocation W29798101661 @default.
- W2979810166 hasOpenAccess W2979810166 @default.
- W2979810166 hasPrimaryLocation W29798101661 @default.
- W2979810166 hasRelatedWork W1493815697 @default.
- W2979810166 hasRelatedWork W2005634731 @default.
- W2979810166 hasRelatedWork W2029014465 @default.
- W2979810166 hasRelatedWork W2048350991 @default.
- W2979810166 hasRelatedWork W2107709480 @default.
- W2979810166 hasRelatedWork W2115704527 @default.
- W2979810166 hasRelatedWork W2240204032 @default.
- W2979810166 hasRelatedWork W2302356120 @default.
- W2979810166 hasRelatedWork W2346028389 @default.
- W2979810166 hasRelatedWork W2529389297 @default.
- W2979810166 hasRelatedWork W2560781234 @default.
- W2979810166 hasRelatedWork W2608177516 @default.
- W2979810166 hasRelatedWork W2753015639 @default.
- W2979810166 hasRelatedWork W2899353149 @default.
- W2979810166 hasRelatedWork W2967495484 @default.
- W2979810166 hasRelatedWork W3023841047 @default.
- W2979810166 hasRelatedWork W3032127535 @default.
- W2979810166 hasRelatedWork W3091752332 @default.
- W2979810166 hasRelatedWork W3119799675 @default.
- W2979810166 hasRelatedWork W53560269 @default.
- W2979810166 isParatext "false" @default.
- W2979810166 isRetracted "false" @default.
- W2979810166 magId "2979810166" @default.
- W2979810166 workType "article" @default.