Matches in SemOpenAlex for { <https://semopenalex.org/work/W4289693739> ?p ?o ?g. }
Showing items 1 to 71 of
71
with 100 items per page.
- W4289693739 abstract "Hardware accelerators such as Graphics Processing Units (GPUs), Intel Xeon Phi co-processors (PHIs), and Field-Programmable Gate Arrays (FPGAs) are now ubiquitous in extreme-scale high performance computing (HPC), cloud, and Big data platforms to facilitate execution of workloads that demand high energy efficiency. They present unique interfaces and programming models therefore posing several limitations, which must be addressed to facilitate execution of large workloads. There is no library providing a unifying interface that allows programmers to write reusable out-of-core implementations of their data-parallel kernels that can run efficiently on different mainstream accelerators such as GPUs, PHIs, and FPGAs. We address this shortage in this paper. We present a library called libhclooc, which provides a unifying interface facilitating out-of-core implementations for data parallel kernels on the three different mainstream accelerators (GPUs, Intel Xeon Phis, FPGAs). We implement out-of-core matrix-matrix multiplication (MMOOC) using the libhclooc API and demonstrate its superior performance over vendor implementations. We show that it suffers from a maximum overhead of 10%, 4%, and 8% (due to abstraction) compared to the state-of-the-art optimised implementations for Nvidia K40c GPU, Nvidia P100 PCIe GPU, and Intel Xeon Phi 3120P respectively. We also show that using libhclooc API reduces the number of lines of code (LOC) by 75% thereby drastically improving programmer productivity." @default.
- W4289693739 created "2022-08-04" @default.
- W4289693739 creator A5051613452 @default.
- W4289693739 creator A5078230040 @default.
- W4289693739 creator A5084068586 @default.
- W4289693739 creator A5091910946 @default.
- W4289693739 date "2018-08-15" @default.
- W4289693739 modified "2023-09-27" @default.
- W4289693739 title "libhclooc: Software Library Facilitating Out-of-core Implementations of Accelerator Kernels on Hybrid Computing Platforms" @default.
- W4289693739 doi "https://doi.org/10.48550/arxiv.1808.05056" @default.
- W4289693739 hasPublicationYear "2018" @default.
- W4289693739 type Work @default.
- W4289693739 citedByCount "0" @default.
- W4289693739 crossrefType "posted-content" @default.
- W4289693739 hasAuthorship W4289693739A5051613452 @default.
- W4289693739 hasAuthorship W4289693739A5078230040 @default.
- W4289693739 hasAuthorship W4289693739A5084068586 @default.
- W4289693739 hasAuthorship W4289693739A5091910946 @default.
- W4289693739 hasBestOaLocation W42896937391 @default.
- W4289693739 hasConcept C106251023 @default.
- W4289693739 hasConcept C111919701 @default.
- W4289693739 hasConcept C118524514 @default.
- W4289693739 hasConcept C145108525 @default.
- W4289693739 hasConcept C172430144 @default.
- W4289693739 hasConcept C173608175 @default.
- W4289693739 hasConcept C199360897 @default.
- W4289693739 hasConcept C26713055 @default.
- W4289693739 hasConcept C2777904410 @default.
- W4289693739 hasConcept C2778119891 @default.
- W4289693739 hasConcept C2778514511 @default.
- W4289693739 hasConcept C2779960059 @default.
- W4289693739 hasConcept C41008148 @default.
- W4289693739 hasConcept C42935608 @default.
- W4289693739 hasConcept C63000827 @default.
- W4289693739 hasConcept C64270927 @default.
- W4289693739 hasConcept C78766204 @default.
- W4289693739 hasConcept C96972482 @default.
- W4289693739 hasConceptScore W4289693739C106251023 @default.
- W4289693739 hasConceptScore W4289693739C111919701 @default.
- W4289693739 hasConceptScore W4289693739C118524514 @default.
- W4289693739 hasConceptScore W4289693739C145108525 @default.
- W4289693739 hasConceptScore W4289693739C172430144 @default.
- W4289693739 hasConceptScore W4289693739C173608175 @default.
- W4289693739 hasConceptScore W4289693739C199360897 @default.
- W4289693739 hasConceptScore W4289693739C26713055 @default.
- W4289693739 hasConceptScore W4289693739C2777904410 @default.
- W4289693739 hasConceptScore W4289693739C2778119891 @default.
- W4289693739 hasConceptScore W4289693739C2778514511 @default.
- W4289693739 hasConceptScore W4289693739C2779960059 @default.
- W4289693739 hasConceptScore W4289693739C41008148 @default.
- W4289693739 hasConceptScore W4289693739C42935608 @default.
- W4289693739 hasConceptScore W4289693739C63000827 @default.
- W4289693739 hasConceptScore W4289693739C64270927 @default.
- W4289693739 hasConceptScore W4289693739C78766204 @default.
- W4289693739 hasConceptScore W4289693739C96972482 @default.
- W4289693739 hasLocation W42896937391 @default.
- W4289693739 hasOpenAccess W4289693739 @default.
- W4289693739 hasPrimaryLocation W42896937391 @default.
- W4289693739 hasRelatedWork W1577137152 @default.
- W4289693739 hasRelatedWork W2133803493 @default.
- W4289693739 hasRelatedWork W2170268965 @default.
- W4289693739 hasRelatedWork W2259024840 @default.
- W4289693739 hasRelatedWork W2508073476 @default.
- W4289693739 hasRelatedWork W2562983784 @default.
- W4289693739 hasRelatedWork W2885904495 @default.
- W4289693739 hasRelatedWork W2896337188 @default.
- W4289693739 hasRelatedWork W3004176791 @default.
- W4289693739 hasRelatedWork W4318710947 @default.
- W4289693739 isParatext "false" @default.
- W4289693739 isRetracted "false" @default.
- W4289693739 workType "article" @default.