Matches in SemOpenAlex for { <https://semopenalex.org/work/W2037720554> ?p ?o ?g. }
Showing items 1 to 100 of
100
with 100 items per page.
- W2037720554 endingPage "549" @default.
- W2037720554 startingPage "537" @default.
- W2037720554 abstract "A GPU implementation of the discontinuous Galerkin lattice-Boltzmann method with square spectral elements, and highly optimised for speed and precision of calculations is presented. An extensive analysis of the numerous variants of the fluid solver unveils that best performance is obtained by maximising CUDA kernel fusion and by arranging the resulting kernel tasks so as to trigger memory coherent and scattered loads in a specific manner, albeit at the cost of introducing cross-thread load unbalancing. Surprisingly, any attempt to vanish this, to maximise thread occupancy and to adopt conventional work tiling or distinct custom kernels highly tuned via ad hoc data and computation layouts invariably deteriorate performance. As such, this work sheds light into the possibility to hide fetch latencies of workloads involving heterogeneous loads in a way that is more effective than what is achieved with frequently suggested techniques. When simulating the lid-driven cavity on a NVIDIA GeForce GTX 480 via a 5-stage 4th-order Runge–Kutta (RK) scheme, the first four digits of the obtained centreline velocity values, or more, converge to those of the state-of-the-art literature data at a simulation speed of 7.0G primitive variable updates per second during the collision stage and 4.4G ones during each RK step of the advection by employing double-precision arithmetic (DPA) and a computational grid of 642 4×4-point elements only. The new programming engine leads to about 2× performance w.r.t. the best programming guidelines in the field. The new fluid solver on the above GPU is also 20–30 times faster than a highly optimised version running on a single core of a Intel Xeon X5650 2.66 GHz." @default.
- W2037720554 created "2016-06-24" @default.
- W2037720554 creator A5057425464 @default.
- W2037720554 date "2013-03-01" @default.
- W2037720554 modified "2023-09-27" @default.
- W2037720554 title "Fast discontinuous Galerkin lattice-Boltzmann simulations on GPUs via maximal kernel fusion" @default.
- W2037720554 cites W1583515859 @default.
- W2037720554 cites W1966345649 @default.
- W2037720554 cites W1971144721 @default.
- W2037720554 cites W1993320379 @default.
- W2037720554 cites W2010386705 @default.
- W2037720554 cites W2026689794 @default.
- W2037720554 cites W2049875313 @default.
- W2037720554 cites W2055844022 @default.
- W2037720554 cites W2061728722 @default.
- W2037720554 cites W2062636826 @default.
- W2037720554 cites W2065080695 @default.
- W2037720554 cites W2066509732 @default.
- W2037720554 cites W2078961060 @default.
- W2037720554 cites W2082844573 @default.
- W2037720554 cites W2096661534 @default.
- W2037720554 cites W2120919211 @default.
- W2037720554 cites W2140640428 @default.
- W2037720554 cites W2156652224 @default.
- W2037720554 cites W2165831368 @default.
- W2037720554 cites W2315982789 @default.
- W2037720554 cites W2333521678 @default.
- W2037720554 cites W2931102813 @default.
- W2037720554 doi "https://doi.org/10.1016/j.cpc.2012.10.005" @default.
- W2037720554 hasPublicationYear "2013" @default.
- W2037720554 type Work @default.
- W2037720554 sameAs 2037720554 @default.
- W2037720554 citedByCount "2" @default.
- W2037720554 countsByYear W20377205542016 @default.
- W2037720554 countsByYear W20377205542019 @default.
- W2037720554 crossrefType "journal-article" @default.
- W2037720554 hasAuthorship W2037720554A5057425464 @default.
- W2037720554 hasConcept C111919701 @default.
- W2037720554 hasConcept C11413529 @default.
- W2037720554 hasConcept C114614502 @default.
- W2037720554 hasConcept C121332964 @default.
- W2037720554 hasConcept C138101251 @default.
- W2037720554 hasConcept C173608175 @default.
- W2037720554 hasConcept C187691185 @default.
- W2037720554 hasConcept C188045654 @default.
- W2037720554 hasConcept C199360897 @default.
- W2037720554 hasConcept C21821499 @default.
- W2037720554 hasConcept C2524010 @default.
- W2037720554 hasConcept C2778119891 @default.
- W2037720554 hasConcept C2778770139 @default.
- W2037720554 hasConcept C33923547 @default.
- W2037720554 hasConcept C41008148 @default.
- W2037720554 hasConcept C45374587 @default.
- W2037720554 hasConcept C459310 @default.
- W2037720554 hasConcept C57879066 @default.
- W2037720554 hasConcept C68339613 @default.
- W2037720554 hasConcept C74193536 @default.
- W2037720554 hasConcept C83283714 @default.
- W2037720554 hasConceptScore W2037720554C111919701 @default.
- W2037720554 hasConceptScore W2037720554C11413529 @default.
- W2037720554 hasConceptScore W2037720554C114614502 @default.
- W2037720554 hasConceptScore W2037720554C121332964 @default.
- W2037720554 hasConceptScore W2037720554C138101251 @default.
- W2037720554 hasConceptScore W2037720554C173608175 @default.
- W2037720554 hasConceptScore W2037720554C187691185 @default.
- W2037720554 hasConceptScore W2037720554C188045654 @default.
- W2037720554 hasConceptScore W2037720554C199360897 @default.
- W2037720554 hasConceptScore W2037720554C21821499 @default.
- W2037720554 hasConceptScore W2037720554C2524010 @default.
- W2037720554 hasConceptScore W2037720554C2778119891 @default.
- W2037720554 hasConceptScore W2037720554C2778770139 @default.
- W2037720554 hasConceptScore W2037720554C33923547 @default.
- W2037720554 hasConceptScore W2037720554C41008148 @default.
- W2037720554 hasConceptScore W2037720554C45374587 @default.
- W2037720554 hasConceptScore W2037720554C459310 @default.
- W2037720554 hasConceptScore W2037720554C57879066 @default.
- W2037720554 hasConceptScore W2037720554C68339613 @default.
- W2037720554 hasConceptScore W2037720554C74193536 @default.
- W2037720554 hasConceptScore W2037720554C83283714 @default.
- W2037720554 hasIssue "3" @default.
- W2037720554 hasLocation W20377205541 @default.
- W2037720554 hasOpenAccess W2037720554 @default.
- W2037720554 hasPrimaryLocation W20377205541 @default.
- W2037720554 hasRelatedWork W2037720554 @default.
- W2037720554 hasRelatedWork W2043616378 @default.
- W2037720554 hasRelatedWork W2076361081 @default.
- W2037720554 hasRelatedWork W2116951845 @default.
- W2037720554 hasRelatedWork W2791534362 @default.
- W2037720554 hasRelatedWork W4310279396 @default.
- W2037720554 hasRelatedWork W4323043748 @default.
- W2037720554 hasRelatedWork W4365601148 @default.
- W2037720554 hasRelatedWork W2582864838 @default.
- W2037720554 hasRelatedWork W2599858257 @default.
- W2037720554 hasVolume "184" @default.
- W2037720554 isParatext "false" @default.
- W2037720554 isRetracted "false" @default.
- W2037720554 magId "2037720554" @default.
- W2037720554 workType "article" @default.