Matches in SemOpenAlex for { <https://semopenalex.org/work/W2171473263> ?p ?o ?g. }
Showing items 1 to 93 of
93
with 100 items per page.
- W2171473263 abstract "General-Purpose computing on Graphics Processing Units (GPGPU) is becoming popular in HPC because of its high peak performance. However, in spite of the potential performance improvements as well as recent promising results in scientific computing applications, its real performance is not necessarily higher than that of the current high-performance CPUs, especially with recent trends towards increasing the number of cores on a single die. This is because the GPU performance can be severely limited by such restrictions as memory size and bandwidth and programming using graphics-specific APIs. To overcome this problem, we propose a model-based, adaptive library for 2D FFT that automatically achieves optimal performance using available heterogeneous CPU-GPU computing resources. To find optimal load distribution ratios between CPUs and GPUs, we construct a performance model that captures the respective contributions of CPU vs. GPU, and predicts the total execution time of 2D-FFT for arbitrary problem sizes and load distribution. The performance model divides the FFT computation into several small sub steps, and predicts the execution time of each step using profiling results. Preliminary evaluation with our prototype shows that the performance model can predict the execution time of problem sizes that are 16 times as large as the profile runs with less than 20% error, and that the predicted optimal load distribution ratios have less than 1% error. We show that the resulting performance improvement using both CPUs and GPUs can be as high as 50% compared to using either a CPU core or a GPU." @default.
- W2171473263 created "2016-06-24" @default.
- W2171473263 creator A5011254074 @default.
- W2171473263 creator A5017240065 @default.
- W2171473263 creator A5035025604 @default.
- W2171473263 creator A5063607235 @default.
- W2171473263 date "2008-04-01" @default.
- W2171473263 modified "2023-09-30" @default.
- W2171473263 title "An efficient, model-based CPU-GPU heterogeneous FFT library" @default.
- W2171473263 cites W2032309817 @default.
- W2171473263 cites W2079524266 @default.
- W2171473263 cites W2102182691 @default.
- W2171473263 cites W2136834900 @default.
- W2171473263 cites W2139578306 @default.
- W2171473263 cites W2150606860 @default.
- W2171473263 cites W3150025736 @default.
- W2171473263 doi "https://doi.org/10.1109/ipdps.2008.4536163" @default.
- W2171473263 hasPublicationYear "2008" @default.
- W2171473263 type Work @default.
- W2171473263 sameAs 2171473263 @default.
- W2171473263 citedByCount "35" @default.
- W2171473263 countsByYear W21714732632012 @default.
- W2171473263 countsByYear W21714732632013 @default.
- W2171473263 countsByYear W21714732632014 @default.
- W2171473263 countsByYear W21714732632015 @default.
- W2171473263 countsByYear W21714732632016 @default.
- W2171473263 countsByYear W21714732632017 @default.
- W2171473263 countsByYear W21714732632018 @default.
- W2171473263 countsByYear W21714732632021 @default.
- W2171473263 countsByYear W21714732632022 @default.
- W2171473263 countsByYear W21714732632023 @default.
- W2171473263 crossrefType "proceedings-article" @default.
- W2171473263 hasAuthorship W2171473263A5011254074 @default.
- W2171473263 hasAuthorship W2171473263A5017240065 @default.
- W2171473263 hasAuthorship W2171473263A5035025604 @default.
- W2171473263 hasAuthorship W2171473263A5063607235 @default.
- W2171473263 hasConcept C111919701 @default.
- W2171473263 hasConcept C11413529 @default.
- W2171473263 hasConcept C162324750 @default.
- W2171473263 hasConcept C173608175 @default.
- W2171473263 hasConcept C187191949 @default.
- W2171473263 hasConcept C188045654 @default.
- W2171473263 hasConcept C21442007 @default.
- W2171473263 hasConcept C21547014 @default.
- W2171473263 hasConcept C2778119891 @default.
- W2171473263 hasConcept C2778915421 @default.
- W2171473263 hasConcept C2779851693 @default.
- W2171473263 hasConcept C2781335571 @default.
- W2171473263 hasConcept C41008148 @default.
- W2171473263 hasConcept C45374587 @default.
- W2171473263 hasConcept C49154492 @default.
- W2171473263 hasConcept C50630238 @default.
- W2171473263 hasConcept C75172450 @default.
- W2171473263 hasConcept C78766204 @default.
- W2171473263 hasConcept C83283714 @default.
- W2171473263 hasConcept C9390403 @default.
- W2171473263 hasConceptScore W2171473263C111919701 @default.
- W2171473263 hasConceptScore W2171473263C11413529 @default.
- W2171473263 hasConceptScore W2171473263C162324750 @default.
- W2171473263 hasConceptScore W2171473263C173608175 @default.
- W2171473263 hasConceptScore W2171473263C187191949 @default.
- W2171473263 hasConceptScore W2171473263C188045654 @default.
- W2171473263 hasConceptScore W2171473263C21442007 @default.
- W2171473263 hasConceptScore W2171473263C21547014 @default.
- W2171473263 hasConceptScore W2171473263C2778119891 @default.
- W2171473263 hasConceptScore W2171473263C2778915421 @default.
- W2171473263 hasConceptScore W2171473263C2779851693 @default.
- W2171473263 hasConceptScore W2171473263C2781335571 @default.
- W2171473263 hasConceptScore W2171473263C41008148 @default.
- W2171473263 hasConceptScore W2171473263C45374587 @default.
- W2171473263 hasConceptScore W2171473263C49154492 @default.
- W2171473263 hasConceptScore W2171473263C50630238 @default.
- W2171473263 hasConceptScore W2171473263C75172450 @default.
- W2171473263 hasConceptScore W2171473263C78766204 @default.
- W2171473263 hasConceptScore W2171473263C83283714 @default.
- W2171473263 hasConceptScore W2171473263C9390403 @default.
- W2171473263 hasLocation W21714732631 @default.
- W2171473263 hasOpenAccess W2171473263 @default.
- W2171473263 hasPrimaryLocation W21714732631 @default.
- W2171473263 hasRelatedWork W1464113540 @default.
- W2171473263 hasRelatedWork W2008492897 @default.
- W2171473263 hasRelatedWork W2075046026 @default.
- W2171473263 hasRelatedWork W2126502368 @default.
- W2171473263 hasRelatedWork W2171473263 @default.
- W2171473263 hasRelatedWork W2191246539 @default.
- W2171473263 hasRelatedWork W2320652536 @default.
- W2171473263 hasRelatedWork W2379607069 @default.
- W2171473263 hasRelatedWork W3193974638 @default.
- W2171473263 hasRelatedWork W2719498961 @default.
- W2171473263 isParatext "false" @default.
- W2171473263 isRetracted "false" @default.
- W2171473263 magId "2171473263" @default.
- W2171473263 workType "article" @default.