Matches in SemOpenAlex for { <https://semopenalex.org/work/W4224223904> ?p ?o ?g. }
Showing items 1 to 81 of
81
with 100 items per page.
- W4224223904 endingPage "359" @default.
- W4224223904 startingPage "341" @default.
- W4224223904 abstract "Abstract Providing convenient APIs and notations for data parallelism which remain accessible for programmers while still providing good performance has been a long-term goal of researchers as well as language and library designers. C++20 introduces ranges and views, as well as the composition of operations on them using a concise syntax, but the efficient implementation of these library features is restricted to CPUs. We present the Celerity High-level API, which makes similarly concise mechanisms applicable to GPUs and accelerators, and even distributed memory clusters of GPUs. Crucially, we achieve this very high level of abstraction without a significant negative impact on performance compared to a lower-level implementation, and without introducing any non-standard toolchain components or compilers, by implementing a C++ library infrastructure on top of the Celerity system. This is made possible by two central API design and implementation strategies, which form the core of our contribution. Firstly, gathering as much information as possible at compile-time and using metaprogramming techniques to automatically fuse several distinctly formulated processing steps into a single accelerator kernel invocation. And secondly, leveraging C++20 “Concepts” in order to avoid type erasure, allowing for highly efficient code generation. We have evaluated our approach quantitatively in a comparison to lower-level manual implementations of several benchmarks, demonstrating its low overhead. Additionally, we investigated the individual performance impact of our specific optimizations and design choices, illustrating the advantages afforded by a Concepts-based approach." @default.
- W4224223904 created "2022-04-26" @default.
- W4224223904 creator A5000119692 @default.
- W4224223904 creator A5011445440 @default.
- W4224223904 creator A5037323768 @default.
- W4224223904 creator A5043309914 @default.
- W4224223904 date "2022-04-22" @default.
- W4224223904 modified "2023-10-18" @default.
- W4224223904 title "The Celerity High-level API: C++20 for Accelerator Clusters" @default.
- W4224223904 cites W1520071471 @default.
- W4224223904 cites W1981450409 @default.
- W4224223904 cites W1988888548 @default.
- W4224223904 cites W1990090459 @default.
- W4224223904 cites W1998284499 @default.
- W4224223904 cites W2038850241 @default.
- W4224223904 cites W2046430708 @default.
- W4224223904 cites W2090409324 @default.
- W4224223904 cites W2114192207 @default.
- W4224223904 cites W2142677441 @default.
- W4224223904 cites W2168732909 @default.
- W4224223904 cites W22188748 @default.
- W4224223904 cites W2506485539 @default.
- W4224223904 cites W2613829240 @default.
- W4224223904 cites W2878447367 @default.
- W4224223904 cites W2897298975 @default.
- W4224223904 cites W2968104784 @default.
- W4224223904 cites W3018185245 @default.
- W4224223904 cites W3099775292 @default.
- W4224223904 cites W3187688612 @default.
- W4224223904 cites W4205190765 @default.
- W4224223904 cites W4241553019 @default.
- W4224223904 doi "https://doi.org/10.1007/s10766-022-00731-8" @default.
- W4224223904 hasPublicationYear "2022" @default.
- W4224223904 type Work @default.
- W4224223904 citedByCount "3" @default.
- W4224223904 countsByYear W42242239042022 @default.
- W4224223904 countsByYear W42242239042023 @default.
- W4224223904 crossrefType "journal-article" @default.
- W4224223904 hasAuthorship W4224223904A5000119692 @default.
- W4224223904 hasAuthorship W4224223904A5011445440 @default.
- W4224223904 hasAuthorship W4224223904A5037323768 @default.
- W4224223904 hasAuthorship W4224223904A5043309914 @default.
- W4224223904 hasBestOaLocation W42242239041 @default.
- W4224223904 hasConcept C169590947 @default.
- W4224223904 hasConcept C173608175 @default.
- W4224223904 hasConcept C199360897 @default.
- W4224223904 hasConcept C2777062904 @default.
- W4224223904 hasConcept C2777904410 @default.
- W4224223904 hasConcept C2779960059 @default.
- W4224223904 hasConcept C35390924 @default.
- W4224223904 hasConcept C41008148 @default.
- W4224223904 hasConcept C76782552 @default.
- W4224223904 hasConceptScore W4224223904C169590947 @default.
- W4224223904 hasConceptScore W4224223904C173608175 @default.
- W4224223904 hasConceptScore W4224223904C199360897 @default.
- W4224223904 hasConceptScore W4224223904C2777062904 @default.
- W4224223904 hasConceptScore W4224223904C2777904410 @default.
- W4224223904 hasConceptScore W4224223904C2779960059 @default.
- W4224223904 hasConceptScore W4224223904C35390924 @default.
- W4224223904 hasConceptScore W4224223904C41008148 @default.
- W4224223904 hasConceptScore W4224223904C76782552 @default.
- W4224223904 hasIssue "3-4" @default.
- W4224223904 hasLocation W42242239041 @default.
- W4224223904 hasOpenAccess W4224223904 @default.
- W4224223904 hasPrimaryLocation W42242239041 @default.
- W4224223904 hasRelatedWork W1497385637 @default.
- W4224223904 hasRelatedWork W1583465708 @default.
- W4224223904 hasRelatedWork W2088006200 @default.
- W4224223904 hasRelatedWork W2192862863 @default.
- W4224223904 hasRelatedWork W2369288331 @default.
- W4224223904 hasRelatedWork W3162240892 @default.
- W4224223904 hasRelatedWork W4235530921 @default.
- W4224223904 hasRelatedWork W4248277950 @default.
- W4224223904 hasRelatedWork W4296694382 @default.
- W4224223904 hasRelatedWork W2479014312 @default.
- W4224223904 hasVolume "50" @default.
- W4224223904 isParatext "false" @default.
- W4224223904 isRetracted "false" @default.
- W4224223904 workType "article" @default.