Matches in SemOpenAlex for { <https://semopenalex.org/work/W4367147522> ?p ?o ?g. }
Showing items 1 to 68 of
68
with 100 items per page.
- W4367147522 abstract "Programming applications on heterogeneous systems with hardware accelerators is challenging due to the disjoint address spaces between the host (CPU) and the device (GPU). The limited device memory further exacerbates the challenges as most data-intensive applications will not fit in the limited device memory. CUDA Unified Memory (UM) was introduced to mitigate such challenges. UM improves GPU programmability by supporting oversubscription, on-demand paging, and migration. However, when the working set of an application exceeds the device memory capacity, the resulting data movement can cause significant performance losses. We propose a tiling-based task-parallel framework, named DeepSparseGPU, to accelerate sparse eigensolvers on GPUs by minimizing data movement between the host and device. To this end, we tile all operations in a sparse solver and express the entire computation as a directed acyclic graph (DAG). We design and develop a memory manager (MM) to execute larger inputs that do not fit into GPU memory. MM keeps track of the data on CPU and GPU, and automatically moves data between them as needed. We use OpenMP target offload in our implementation to achieve portability beyond NVIDIA hardware. Performance evaluations show that DeepSparseGPU transfers 1.39x-2.18x less host to device (H2D) and device to host (D2H) data, while executing up to 2.93x faster than the UM-based baseline version." @default.
- W4367147522 created "2023-04-28" @default.
- W4367147522 creator A5026046737 @default.
- W4367147522 creator A5057965288 @default.
- W4367147522 creator A5072718512 @default.
- W4367147522 creator A5077383014 @default.
- W4367147522 date "2022-12-01" @default.
- W4367147522 modified "2023-09-26" @default.
- W4367147522 title "A Portable Sparse Solver Framework for Large Matrices on Heterogeneous Architectures" @default.
- W4367147522 doi "https://doi.org/10.1109/hipc56025.2022.00030" @default.
- W4367147522 hasPublicationYear "2022" @default.
- W4367147522 type Work @default.
- W4367147522 citedByCount "0" @default.
- W4367147522 crossrefType "proceedings-article" @default.
- W4367147522 hasAuthorship W4367147522A5026046737 @default.
- W4367147522 hasAuthorship W4367147522A5057965288 @default.
- W4367147522 hasAuthorship W4367147522A5072718512 @default.
- W4367147522 hasAuthorship W4367147522A5077383014 @default.
- W4367147522 hasConcept C111919701 @default.
- W4367147522 hasConcept C121332964 @default.
- W4367147522 hasConcept C126831891 @default.
- W4367147522 hasConcept C149635348 @default.
- W4367147522 hasConcept C163716315 @default.
- W4367147522 hasConcept C173608175 @default.
- W4367147522 hasConcept C18903297 @default.
- W4367147522 hasConcept C199360897 @default.
- W4367147522 hasConcept C2778119891 @default.
- W4367147522 hasConcept C2778770139 @default.
- W4367147522 hasConcept C41008148 @default.
- W4367147522 hasConcept C50954386 @default.
- W4367147522 hasConcept C56372850 @default.
- W4367147522 hasConcept C62520636 @default.
- W4367147522 hasConcept C63000827 @default.
- W4367147522 hasConcept C86803240 @default.
- W4367147522 hasConceptScore W4367147522C111919701 @default.
- W4367147522 hasConceptScore W4367147522C121332964 @default.
- W4367147522 hasConceptScore W4367147522C126831891 @default.
- W4367147522 hasConceptScore W4367147522C149635348 @default.
- W4367147522 hasConceptScore W4367147522C163716315 @default.
- W4367147522 hasConceptScore W4367147522C173608175 @default.
- W4367147522 hasConceptScore W4367147522C18903297 @default.
- W4367147522 hasConceptScore W4367147522C199360897 @default.
- W4367147522 hasConceptScore W4367147522C2778119891 @default.
- W4367147522 hasConceptScore W4367147522C2778770139 @default.
- W4367147522 hasConceptScore W4367147522C41008148 @default.
- W4367147522 hasConceptScore W4367147522C50954386 @default.
- W4367147522 hasConceptScore W4367147522C56372850 @default.
- W4367147522 hasConceptScore W4367147522C62520636 @default.
- W4367147522 hasConceptScore W4367147522C63000827 @default.
- W4367147522 hasConceptScore W4367147522C86803240 @default.
- W4367147522 hasFunder F4320306084 @default.
- W4367147522 hasFunder F4320332359 @default.
- W4367147522 hasLocation W43671475221 @default.
- W4367147522 hasOpenAccess W4367147522 @default.
- W4367147522 hasPrimaryLocation W43671475221 @default.
- W4367147522 hasRelatedWork W1832263773 @default.
- W4367147522 hasRelatedWork W2006005151 @default.
- W4367147522 hasRelatedWork W2167567393 @default.
- W4367147522 hasRelatedWork W2343404593 @default.
- W4367147522 hasRelatedWork W2730282969 @default.
- W4367147522 hasRelatedWork W2892892150 @default.
- W4367147522 hasRelatedWork W2903351638 @default.
- W4367147522 hasRelatedWork W3019218318 @default.
- W4367147522 hasRelatedWork W46965433 @default.
- W4367147522 hasRelatedWork W2559346022 @default.
- W4367147522 isParatext "false" @default.
- W4367147522 isRetracted "false" @default.
- W4367147522 workType "article" @default.