Matches in SemOpenAlex for { <https://semopenalex.org/work/W4386033380> ?p ?o ?g. }
Showing items 1 to 94 of
94
with 100 items per page.
- W4386033380 abstract "Summary This article evaluates the current support for heterogeneous OpenMP 5.2 applications regarding the simultaneous activation of host and device computing units (e.g., CPUs, GPUs, or FPGAs). The article identifies limitations in the current OpenMP specification and describes the design and implementation of novel OpenMP extensions and runtime support for heterogeneous parallel programming. The Compute Unit (CUs) abstraction is introduced in the OpenMP programming model. The Compute Unit abstraction is defined in terms of an aggregation of computing elements (e.g., CPUs, GPUs, FPGAs). On top of CUs, the article describes dynamic work sharing constructs and schedulers that address the inherent differences in compute power of host and device CUs. New constructs and the corresponding runtime support are described for the new abstractions. The article evaluates the case of a hybrid multilevel parallelization of the NPB‐MZ benchmark suite. The implementation exploits both coarse‐grain and fine‐grain parallelism, mapped to CUs of different nature (GPUs and CPUs). All CUs are activated using the new extensions and runtime support. We compare hybrid and nonhybrid executions under two state‐of‐the‐art work‐distribution schemes (Static and Dynamic Task schedulers). On a computing node composed of one AMD EPYC 7742 @ 2.250GHz (64 cores and 2 threads/core, totalling 128 threads per node) and 2 GPU AMD Radeon Instinct MI50 with 32GB, hybrid executions present speedups from 1.08 up to 3.18 with respect to a nonhybrid GPU implementation, depending on the number of activated CUs." @default.
- W4386033380 created "2023-08-22" @default.
- W4386033380 creator A5000717476 @default.
- W4386033380 creator A5068543588 @default.
- W4386033380 date "2023-08-17" @default.
- W4386033380 modified "2023-09-25" @default.
- W4386033380 title "Compute units in OpenMP: Extensions for heterogeneous parallel programming" @default.
- W4386033380 cites W153259801 @default.
- W4386033380 cites W1543313205 @default.
- W4386033380 cites W1720285298 @default.
- W4386033380 cites W1893534197 @default.
- W4386033380 cites W2024639384 @default.
- W4386033380 cites W2031906600 @default.
- W4386033380 cites W2093883529 @default.
- W4386033380 cites W2097686029 @default.
- W4386033380 cites W2112121929 @default.
- W4386033380 cites W2119010809 @default.
- W4386033380 cites W2122653654 @default.
- W4386033380 cites W2131836561 @default.
- W4386033380 cites W2142421493 @default.
- W4386033380 cites W2144740242 @default.
- W4386033380 cites W2171473263 @default.
- W4386033380 cites W2769126660 @default.
- W4386033380 cites W2918924644 @default.
- W4386033380 cites W2935041335 @default.
- W4386033380 cites W2969858646 @default.
- W4386033380 cites W2999733275 @default.
- W4386033380 cites W3003944395 @default.
- W4386033380 cites W3082942023 @default.
- W4386033380 cites W3167401864 @default.
- W4386033380 cites W518915 @default.
- W4386033380 doi "https://doi.org/10.1002/cpe.7885" @default.
- W4386033380 hasPublicationYear "2023" @default.
- W4386033380 type Work @default.
- W4386033380 citedByCount "0" @default.
- W4386033380 crossrefType "journal-article" @default.
- W4386033380 hasAuthorship W4386033380A5000717476 @default.
- W4386033380 hasAuthorship W4386033380A5068543588 @default.
- W4386033380 hasBestOaLocation W43860333801 @default.
- W4386033380 hasConcept C111472728 @default.
- W4386033380 hasConcept C124304363 @default.
- W4386033380 hasConcept C126831891 @default.
- W4386033380 hasConcept C127413603 @default.
- W4386033380 hasConcept C13280743 @default.
- W4386033380 hasConcept C138885662 @default.
- W4386033380 hasConcept C165696696 @default.
- W4386033380 hasConcept C173608175 @default.
- W4386033380 hasConcept C185798385 @default.
- W4386033380 hasConcept C18903297 @default.
- W4386033380 hasConcept C205649164 @default.
- W4386033380 hasConcept C2778119891 @default.
- W4386033380 hasConcept C2780870223 @default.
- W4386033380 hasConcept C38652104 @default.
- W4386033380 hasConcept C41008148 @default.
- W4386033380 hasConcept C62611344 @default.
- W4386033380 hasConcept C66938386 @default.
- W4386033380 hasConcept C78766204 @default.
- W4386033380 hasConcept C86803240 @default.
- W4386033380 hasConceptScore W4386033380C111472728 @default.
- W4386033380 hasConceptScore W4386033380C124304363 @default.
- W4386033380 hasConceptScore W4386033380C126831891 @default.
- W4386033380 hasConceptScore W4386033380C127413603 @default.
- W4386033380 hasConceptScore W4386033380C13280743 @default.
- W4386033380 hasConceptScore W4386033380C138885662 @default.
- W4386033380 hasConceptScore W4386033380C165696696 @default.
- W4386033380 hasConceptScore W4386033380C173608175 @default.
- W4386033380 hasConceptScore W4386033380C185798385 @default.
- W4386033380 hasConceptScore W4386033380C18903297 @default.
- W4386033380 hasConceptScore W4386033380C205649164 @default.
- W4386033380 hasConceptScore W4386033380C2778119891 @default.
- W4386033380 hasConceptScore W4386033380C2780870223 @default.
- W4386033380 hasConceptScore W4386033380C38652104 @default.
- W4386033380 hasConceptScore W4386033380C41008148 @default.
- W4386033380 hasConceptScore W4386033380C62611344 @default.
- W4386033380 hasConceptScore W4386033380C66938386 @default.
- W4386033380 hasConceptScore W4386033380C78766204 @default.
- W4386033380 hasConceptScore W4386033380C86803240 @default.
- W4386033380 hasFunder F4320322930 @default.
- W4386033380 hasLocation W43860333801 @default.
- W4386033380 hasOpenAccess W4386033380 @default.
- W4386033380 hasPrimaryLocation W43860333801 @default.
- W4386033380 hasRelatedWork W1591147808 @default.
- W4386033380 hasRelatedWork W164750744 @default.
- W4386033380 hasRelatedWork W2023938924 @default.
- W4386033380 hasRelatedWork W2070468128 @default.
- W4386033380 hasRelatedWork W2161462353 @default.
- W4386033380 hasRelatedWork W2170268965 @default.
- W4386033380 hasRelatedWork W2323476605 @default.
- W4386033380 hasRelatedWork W2488897859 @default.
- W4386033380 hasRelatedWork W320786 @default.
- W4386033380 hasRelatedWork W3213381848 @default.
- W4386033380 isParatext "false" @default.
- W4386033380 isRetracted "false" @default.
- W4386033380 workType "article" @default.