Matches in SemOpenAlex for { <https://semopenalex.org/work/W2887331837> ?p ?o ?g. }
- W2887331837 abstract "GPU performance of the lattice Boltzmann method (LBM) depends heavily on memory access patterns. When LBM is advanced with GPUs on complex computational domains, geometric data is typically accessed indirectly, and lattice data is typically accessed lexicographically in the Structure of Array (SoA) layout. Although there are a variety of existing access patterns beyond the typical choices, no study has yet examined the relative efficacy between them. Here, we compare a suite of memory access schemes via empirical testing and performance modeling. We find strong evidence that semi-direct addressing is the superior addressing scheme for the majority of cases examined: Semi-direct addressing increases computational speed and often reduces memory consumption. For lattice layout, we find that the Collected Structure of Arrays (CSoA) layout outperforms the SoA layout. When compared to state-of-the-art practices, our recommended addressing modifications lead to performance gains between 10-40% across different complex geometries, fluid volume fractions, and resolutions. The modifications also lead to a decrease in memory consumption by as much as 17%. Having discovered these improvements, we examine a highly resolved arterial geometry on a leadership class system. On this system we present the first near-optimal strong results for LBM with arterial geometries run on GPUs. We also demonstrate that the above recommendations remain valid for large scale, many device simulations, which leads to an increased computational speed and average memory usage reductions. To understand these observations, we employ performance modeling which reveals that semi-direct methods outperform indirect methods due to a reduced number of total loads/stores in memory, and that CSoA outperforms SoA and bundling due to improved caching behavior." @default.
- W2887331837 created "2018-08-22" @default.
- W2887331837 creator A5039335591 @default.
- W2887331837 creator A5056497188 @default.
- W2887331837 creator A5061838490 @default.
- W2887331837 creator A5090962911 @default.
- W2887331837 date "2018-05-01" @default.
- W2887331837 modified "2023-10-16" @default.
- W2887331837 title "GPU Data Access on Complex Geometries for D3Q19 Lattice Boltzmann Method" @default.
- W2887331837 cites W1599770531 @default.
- W2887331837 cites W1925088345 @default.
- W2887331837 cites W1964428959 @default.
- W2887331837 cites W1966277182 @default.
- W2887331837 cites W1976346389 @default.
- W2887331837 cites W1982008063 @default.
- W2887331837 cites W1986967511 @default.
- W2887331837 cites W1996820070 @default.
- W2887331837 cites W2000691406 @default.
- W2887331837 cites W2010386705 @default.
- W2887331837 cites W2012250964 @default.
- W2887331837 cites W2020395610 @default.
- W2887331837 cites W2023972518 @default.
- W2887331837 cites W2037561415 @default.
- W2887331837 cites W2043616378 @default.
- W2887331837 cites W2049800247 @default.
- W2887331837 cites W2058916022 @default.
- W2887331837 cites W2059088502 @default.
- W2887331837 cites W2071296827 @default.
- W2887331837 cites W2071642149 @default.
- W2887331837 cites W2073875590 @default.
- W2887331837 cites W2074573067 @default.
- W2887331837 cites W2076361081 @default.
- W2887331837 cites W2081233300 @default.
- W2887331837 cites W2089096153 @default.
- W2887331837 cites W2092687851 @default.
- W2887331837 cites W2099916305 @default.
- W2887331837 cites W2106821446 @default.
- W2887331837 cites W2109907633 @default.
- W2887331837 cites W2112053550 @default.
- W2887331837 cites W2117242079 @default.
- W2887331837 cites W2118784321 @default.
- W2887331837 cites W2147468445 @default.
- W2887331837 cites W2153730006 @default.
- W2887331837 cites W2195479374 @default.
- W2887331837 cites W3099053820 @default.
- W2887331837 cites W4240916550 @default.
- W2887331837 cites W4251386884 @default.
- W2887331837 doi "https://doi.org/10.1109/ipdps.2018.00092" @default.
- W2887331837 hasPublicationYear "2018" @default.
- W2887331837 type Work @default.
- W2887331837 sameAs 2887331837 @default.
- W2887331837 citedByCount "18" @default.
- W2887331837 countsByYear W28873318372019 @default.
- W2887331837 countsByYear W28873318372020 @default.
- W2887331837 countsByYear W28873318372021 @default.
- W2887331837 countsByYear W28873318372022 @default.
- W2887331837 countsByYear W28873318372023 @default.
- W2887331837 crossrefType "proceedings-article" @default.
- W2887331837 hasAuthorship W2887331837A5039335591 @default.
- W2887331837 hasAuthorship W2887331837A5056497188 @default.
- W2887331837 hasAuthorship W2887331837A5061838490 @default.
- W2887331837 hasAuthorship W2887331837A5090962911 @default.
- W2887331837 hasBestOaLocation W28873318372 @default.
- W2887331837 hasConcept C113775141 @default.
- W2887331837 hasConcept C114614502 @default.
- W2887331837 hasConcept C120314980 @default.
- W2887331837 hasConcept C121332964 @default.
- W2887331837 hasConcept C159254197 @default.
- W2887331837 hasConcept C166957645 @default.
- W2887331837 hasConcept C173608175 @default.
- W2887331837 hasConcept C21821499 @default.
- W2887331837 hasConcept C24890656 @default.
- W2887331837 hasConcept C2781204021 @default.
- W2887331837 hasConcept C33923547 @default.
- W2887331837 hasConcept C41008148 @default.
- W2887331837 hasConcept C459310 @default.
- W2887331837 hasConcept C47487241 @default.
- W2887331837 hasConcept C62520636 @default.
- W2887331837 hasConcept C68339613 @default.
- W2887331837 hasConcept C77088390 @default.
- W2887331837 hasConcept C79581498 @default.
- W2887331837 hasConcept C95457728 @default.
- W2887331837 hasConceptScore W2887331837C113775141 @default.
- W2887331837 hasConceptScore W2887331837C114614502 @default.
- W2887331837 hasConceptScore W2887331837C120314980 @default.
- W2887331837 hasConceptScore W2887331837C121332964 @default.
- W2887331837 hasConceptScore W2887331837C159254197 @default.
- W2887331837 hasConceptScore W2887331837C166957645 @default.
- W2887331837 hasConceptScore W2887331837C173608175 @default.
- W2887331837 hasConceptScore W2887331837C21821499 @default.
- W2887331837 hasConceptScore W2887331837C24890656 @default.
- W2887331837 hasConceptScore W2887331837C2781204021 @default.
- W2887331837 hasConceptScore W2887331837C33923547 @default.
- W2887331837 hasConceptScore W2887331837C41008148 @default.
- W2887331837 hasConceptScore W2887331837C459310 @default.
- W2887331837 hasConceptScore W2887331837C47487241 @default.
- W2887331837 hasConceptScore W2887331837C62520636 @default.
- W2887331837 hasConceptScore W2887331837C68339613 @default.
- W2887331837 hasConceptScore W2887331837C77088390 @default.
- W2887331837 hasConceptScore W2887331837C79581498 @default.