Matches in SemOpenAlex for { <https://semopenalex.org/work/W2885049097> ?p ?o ?g. }
Showing items 1 to 78 of
78
with 100 items per page.
- W2885049097 abstract "An existing hybrid MPI-OpenMP scheme is augmented with a CUDA-based fine grain parallelization approach for multidimensional distributed Fourier transforms, in a well-characterized pseudospectral fluid turbulence code. Basics of the hybrid scheme are reviewed, and heuristics provided to show a potential benefit of the CUDA implementation. The method draws heavily on the CUDA runtime library to handle memory management, and on the cuFFT library for computing local FFTs. The manner in which the interfaces are constructed to these libraries, and ISO bindings utilized to facilitate platform portability, are discussed. CUDA streams are implemented to overlap data transfer with cuFFT computation. Testing with a baseline solver demonstrates significant aggregate speed-up over the hybrid MPI-OpenMP solver by offloading to GPUs on an NVLink-based test system. While the batch streamed approach provides little benefit with NVLink, we see a performance gain of 30% when tuned for the optimal number of streams on a PCIe-based system. It is found that strong GPU scaling is ideal, or slightly better than ideal, in all cases. In addition to speed-up measurements for the fiducial solver, we also consider several other solvers with different numbers of transform operations and find that aggregate speed-ups are nearly constant for all solvers." @default.
- W2885049097 created "2018-08-22" @default.
- W2885049097 creator A5002367451 @default.
- W2885049097 creator A5007082915 @default.
- W2885049097 creator A5011958615 @default.
- W2885049097 creator A5080617716 @default.
- W2885049097 date "2018-08-03" @default.
- W2885049097 modified "2023-09-27" @default.
- W2885049097 title "GPU parallelization of a hybrid pseudospectral fluid turbulence framework using CUDA" @default.
- W2885049097 cites W1577287242 @default.
- W2885049097 cites W2026689794 @default.
- W2885049097 cites W2062723280 @default.
- W2885049097 cites W2073446627 @default.
- W2885049097 cites W2083108197 @default.
- W2885049097 cites W2102182691 @default.
- W2885049097 cites W2129152507 @default.
- W2885049097 cites W2131778920 @default.
- W2885049097 cites W2156105398 @default.
- W2885049097 cites W2249326244 @default.
- W2885049097 cites W2337481013 @default.
- W2885049097 cites W2559823574 @default.
- W2885049097 cites W8849792 @default.
- W2885049097 hasPublicationYear "2018" @default.
- W2885049097 type Work @default.
- W2885049097 sameAs 2885049097 @default.
- W2885049097 citedByCount "0" @default.
- W2885049097 crossrefType "posted-content" @default.
- W2885049097 hasAuthorship W2885049097A5002367451 @default.
- W2885049097 hasAuthorship W2885049097A5007082915 @default.
- W2885049097 hasAuthorship W2885049097A5011958615 @default.
- W2885049097 hasAuthorship W2885049097A5080617716 @default.
- W2885049097 hasConcept C11413529 @default.
- W2885049097 hasConcept C173608175 @default.
- W2885049097 hasConcept C199360897 @default.
- W2885049097 hasConcept C2778119891 @default.
- W2885049097 hasConcept C2778770139 @default.
- W2885049097 hasConcept C41008148 @default.
- W2885049097 hasConcept C459310 @default.
- W2885049097 hasConcept C63000827 @default.
- W2885049097 hasConcept C68339613 @default.
- W2885049097 hasConcept C75172450 @default.
- W2885049097 hasConceptScore W2885049097C11413529 @default.
- W2885049097 hasConceptScore W2885049097C173608175 @default.
- W2885049097 hasConceptScore W2885049097C199360897 @default.
- W2885049097 hasConceptScore W2885049097C2778119891 @default.
- W2885049097 hasConceptScore W2885049097C2778770139 @default.
- W2885049097 hasConceptScore W2885049097C41008148 @default.
- W2885049097 hasConceptScore W2885049097C459310 @default.
- W2885049097 hasConceptScore W2885049097C63000827 @default.
- W2885049097 hasConceptScore W2885049097C68339613 @default.
- W2885049097 hasConceptScore W2885049097C75172450 @default.
- W2885049097 hasLocation W28850490971 @default.
- W2885049097 hasOpenAccess W2885049097 @default.
- W2885049097 hasPrimaryLocation W28850490971 @default.
- W2885049097 hasRelatedWork W1189940818 @default.
- W2885049097 hasRelatedWork W1434889832 @default.
- W2885049097 hasRelatedWork W1445273741 @default.
- W2885049097 hasRelatedWork W1559718310 @default.
- W2885049097 hasRelatedWork W1885254713 @default.
- W2885049097 hasRelatedWork W1974622208 @default.
- W2885049097 hasRelatedWork W2012928711 @default.
- W2885049097 hasRelatedWork W2018237354 @default.
- W2885049097 hasRelatedWork W2052356157 @default.
- W2885049097 hasRelatedWork W2067902980 @default.
- W2885049097 hasRelatedWork W2152447737 @default.
- W2885049097 hasRelatedWork W2348121177 @default.
- W2885049097 hasRelatedWork W2808583214 @default.
- W2885049097 hasRelatedWork W2810140143 @default.
- W2885049097 hasRelatedWork W2952903795 @default.
- W2885049097 hasRelatedWork W2978242189 @default.
- W2885049097 hasRelatedWork W2997275534 @default.
- W2885049097 hasRelatedWork W3000104217 @default.
- W2885049097 hasRelatedWork W3095350088 @default.
- W2885049097 hasRelatedWork W3206983896 @default.
- W2885049097 isParatext "false" @default.
- W2885049097 isRetracted "false" @default.
- W2885049097 magId "2885049097" @default.
- W2885049097 workType "article" @default.