Matches in SemOpenAlex for { <https://semopenalex.org/work/W2899387068> ?p ?o ?g. }
- W2899387068 endingPage "64224" @default.
- W2899387068 startingPage "64202" @default.
- W2899387068 abstract "Fast Fourier transform (FFT) is a key routine employed in application domains such as molecular dynamics, computational fluid dynamics, signal processing, image processing, and condition monitoring systems. Its performance on modern multicore platforms is therefore of paramount concern to the high-performance computing community. The inherent complexities in these platforms such as severe resource contention and non-uniform memory access, however, pose formidable challenges. We study the performance profiles of multithreaded 2D FFTs provided in three highly optimized packages, FFTW-2.1.5, FFTW-3.3.7, and Intel Math Kernel Library (Intel MKL) FFT, on a modern Intel Haswell multicore processor consisting of 36 cores. We show that all the three routines exhibit drastic performance variations, and hence, their average performances are considerably lower than their peak performances. The ratios of average-topeak performance for the 2D FFT routines from the three packages are 40%, 30%, and 24%. We conclude that improving the average performance of 2D FFT on modern multicore processors by the removal of performance variations constitutes a tremendous research challenge. To address this challenge, we propose two novel optimization methods, PFFT-FPM and PFFT-FPM-PAD, specifically designed and implemented for 2D FFT. The methods employ model-based parallel computing using a load-imbalancing technique. They take as inputs, the discrete 3D functions of the performance of the processors against problem size, compute 2D DFT of a complex signal matrix of size N × N using p abstract processors, and output the transformed signal matrix. Based on our experiments on a modern Intel Haswell multicore server consisting of 36 physical cores, the average and maximum speedups observed for PFFT-FPM using FFTW-3.3.7 are 1.9× and 6.8×, and the average and maximum speedups observed using Intel MKL FFT are 1.3× and 2×. The average and maximum speedups observed for PFFT-FPM-PAD using FFTW-3.3.7 are 2× and 9.4×, and the average and maximum speedups observed using Intel MKL FFT are 1.4× and 5.9×." @default.
- W2899387068 created "2018-11-09" @default.
- W2899387068 creator A5047980649 @default.
- W2899387068 creator A5078230040 @default.
- W2899387068 creator A5084068586 @default.
- W2899387068 date "2018-01-01" @default.
- W2899387068 modified "2023-10-18" @default.
- W2899387068 title "Performance Optimization of Multithreaded 2D Fast Fourier Transform on Multicore Processors Using Load Imbalancing Parallel Computing Method" @default.
- W2899387068 cites W1647609603 @default.
- W2899387068 cites W1810866258 @default.
- W2899387068 cites W1966416130 @default.
- W2899387068 cites W1967802285 @default.
- W2899387068 cites W1984005572 @default.
- W2899387068 cites W2003094427 @default.
- W2899387068 cites W2005659209 @default.
- W2899387068 cites W2028899550 @default.
- W2899387068 cites W2030129803 @default.
- W2899387068 cites W2042478655 @default.
- W2899387068 cites W2043591914 @default.
- W2899387068 cites W2072375667 @default.
- W2899387068 cites W2074570088 @default.
- W2899387068 cites W2097999418 @default.
- W2899387068 cites W2102624126 @default.
- W2899387068 cites W2111320011 @default.
- W2899387068 cites W2115400927 @default.
- W2899387068 cites W2116367043 @default.
- W2899387068 cites W2120119701 @default.
- W2899387068 cites W2122405433 @default.
- W2899387068 cites W2125980577 @default.
- W2899387068 cites W2137789117 @default.
- W2899387068 cites W2166417226 @default.
- W2899387068 cites W2272797324 @default.
- W2899387068 cites W2415800180 @default.
- W2899387068 cites W2508492666 @default.
- W2899387068 cites W2519627041 @default.
- W2899387068 cites W2562411185 @default.
- W2899387068 cites W2618868084 @default.
- W2899387068 cites W2801415324 @default.
- W2899387068 cites W3103952756 @default.
- W2899387068 cites W4244789640 @default.
- W2899387068 cites W4251173935 @default.
- W2899387068 cites W2011544816 @default.
- W2899387068 doi "https://doi.org/10.1109/access.2018.2878271" @default.
- W2899387068 hasPublicationYear "2018" @default.
- W2899387068 type Work @default.
- W2899387068 sameAs 2899387068 @default.
- W2899387068 citedByCount "6" @default.
- W2899387068 countsByYear W28993870682019 @default.
- W2899387068 countsByYear W28993870682020 @default.
- W2899387068 countsByYear W28993870682021 @default.
- W2899387068 countsByYear W28993870682022 @default.
- W2899387068 countsByYear W28993870682023 @default.
- W2899387068 crossrefType "journal-article" @default.
- W2899387068 hasAuthorship W2899387068A5047980649 @default.
- W2899387068 hasAuthorship W2899387068A5078230040 @default.
- W2899387068 hasAuthorship W2899387068A5084068586 @default.
- W2899387068 hasBestOaLocation W28993870681 @default.
- W2899387068 hasConcept C104267543 @default.
- W2899387068 hasConcept C106487976 @default.
- W2899387068 hasConcept C11413529 @default.
- W2899387068 hasConcept C114614502 @default.
- W2899387068 hasConcept C159985019 @default.
- W2899387068 hasConcept C162324750 @default.
- W2899387068 hasConcept C173608175 @default.
- W2899387068 hasConcept C192562407 @default.
- W2899387068 hasConcept C21547014 @default.
- W2899387068 hasConcept C2778915421 @default.
- W2899387068 hasConcept C33923547 @default.
- W2899387068 hasConcept C41008148 @default.
- W2899387068 hasConcept C459310 @default.
- W2899387068 hasConcept C74193536 @default.
- W2899387068 hasConcept C75172450 @default.
- W2899387068 hasConcept C78766204 @default.
- W2899387068 hasConcept C83283714 @default.
- W2899387068 hasConcept C84462506 @default.
- W2899387068 hasConcept C9390403 @default.
- W2899387068 hasConceptScore W2899387068C104267543 @default.
- W2899387068 hasConceptScore W2899387068C106487976 @default.
- W2899387068 hasConceptScore W2899387068C11413529 @default.
- W2899387068 hasConceptScore W2899387068C114614502 @default.
- W2899387068 hasConceptScore W2899387068C159985019 @default.
- W2899387068 hasConceptScore W2899387068C162324750 @default.
- W2899387068 hasConceptScore W2899387068C173608175 @default.
- W2899387068 hasConceptScore W2899387068C192562407 @default.
- W2899387068 hasConceptScore W2899387068C21547014 @default.
- W2899387068 hasConceptScore W2899387068C2778915421 @default.
- W2899387068 hasConceptScore W2899387068C33923547 @default.
- W2899387068 hasConceptScore W2899387068C41008148 @default.
- W2899387068 hasConceptScore W2899387068C459310 @default.
- W2899387068 hasConceptScore W2899387068C74193536 @default.
- W2899387068 hasConceptScore W2899387068C75172450 @default.
- W2899387068 hasConceptScore W2899387068C78766204 @default.
- W2899387068 hasConceptScore W2899387068C83283714 @default.
- W2899387068 hasConceptScore W2899387068C84462506 @default.
- W2899387068 hasConceptScore W2899387068C9390403 @default.
- W2899387068 hasFunder F4320320847 @default.
- W2899387068 hasLocation W28993870681 @default.
- W2899387068 hasLocation W28993870682 @default.