Matches in SemOpenAlex for { <https://semopenalex.org/work/W2022750656> ?p ?o ?g. }
Showing items 1 to 71 of
71
with 100 items per page.
- W2022750656 endingPage "72" @default.
- W2022750656 startingPage "66" @default.
- W2022750656 abstract "Branch predication is a program transformation technique that combines instructions of multiple branches of an if statement into a straight-line sequence and associates each instruction of the sequence with a predicate. The branch predication improves the execution of branch statements on processors that support predicated execution of instruction, e.g., Intel IA-64, because such transformation improves the instruction scheduling and might help cache performance. This paper proposes a novel software-based branch predication technique for GPU. The main motivation is that branch instructions can easily become a performance bottleneck for a GPU program because of the cost of branch instructions compared to ALU instructions and the possibility of low ALU utilization due to separation of ALU instructions within control flow blocks. Due to the SIMD nature and massive multi-threading architecture of the GPU, branching can be costly if more than one path is taken by a set of concurrent threads in a kernel. In this paper we reveal that branch predication can enable instruction packing, a VLIW-like GPU feature that is designed to increase the parallel execution of independent instructions, and can also decrease the number of control flow instructions thereby improving the performance of GPU kernels with both single and multiple branch paths. The key of our novel branch predication technique is a set of transformation rules that takes into consideration the specialties of the GPU architecture and implements software-based predicated execution of instruction on the GPU with little to no overhead. Furthermore, we identify architectural and program factors that affect the effectiveness of our technique and build a benefit analysis model for the transformation. The implementation of our technique on synthetic benchmarks and real-world application proves its effectiveness." @default.
- W2022750656 created "2016-06-24" @default.
- W2022750656 creator A5007820323 @default.
- W2022750656 creator A5084461414 @default.
- W2022750656 date "2010-09-14" @default.
- W2022750656 modified "2023-09-26" @default.
- W2022750656 title "Software-based branch predication for AMD GPUs" @default.
- W2022750656 cites W1992851788 @default.
- W2022750656 cites W2040701288 @default.
- W2022750656 cites W2089482246 @default.
- W2022750656 cites W2153492376 @default.
- W2022750656 cites W2163599246 @default.
- W2022750656 cites W2169810797 @default.
- W2022750656 cites W2295946175 @default.
- W2022750656 doi "https://doi.org/10.1145/1926367.1926379" @default.
- W2022750656 hasPublicationYear "2010" @default.
- W2022750656 type Work @default.
- W2022750656 sameAs 2022750656 @default.
- W2022750656 citedByCount "8" @default.
- W2022750656 countsByYear W20227506562013 @default.
- W2022750656 countsByYear W20227506562015 @default.
- W2022750656 countsByYear W20227506562016 @default.
- W2022750656 countsByYear W20227506562017 @default.
- W2022750656 crossrefType "journal-article" @default.
- W2022750656 hasAuthorship W2022750656A5007820323 @default.
- W2022750656 hasAuthorship W2022750656A5084461414 @default.
- W2022750656 hasConcept C140763907 @default.
- W2022750656 hasConcept C150552126 @default.
- W2022750656 hasConcept C160191386 @default.
- W2022750656 hasConcept C168522837 @default.
- W2022750656 hasConcept C170595534 @default.
- W2022750656 hasConcept C173608175 @default.
- W2022750656 hasConcept C188854837 @default.
- W2022750656 hasConcept C199360897 @default.
- W2022750656 hasConcept C202491316 @default.
- W2022750656 hasConcept C2777904410 @default.
- W2022750656 hasConcept C2781172179 @default.
- W2022750656 hasConcept C41008148 @default.
- W2022750656 hasConceptScore W2022750656C140763907 @default.
- W2022750656 hasConceptScore W2022750656C150552126 @default.
- W2022750656 hasConceptScore W2022750656C160191386 @default.
- W2022750656 hasConceptScore W2022750656C168522837 @default.
- W2022750656 hasConceptScore W2022750656C170595534 @default.
- W2022750656 hasConceptScore W2022750656C173608175 @default.
- W2022750656 hasConceptScore W2022750656C188854837 @default.
- W2022750656 hasConceptScore W2022750656C199360897 @default.
- W2022750656 hasConceptScore W2022750656C202491316 @default.
- W2022750656 hasConceptScore W2022750656C2777904410 @default.
- W2022750656 hasConceptScore W2022750656C2781172179 @default.
- W2022750656 hasConceptScore W2022750656C41008148 @default.
- W2022750656 hasIssue "4" @default.
- W2022750656 hasLocation W20227506561 @default.
- W2022750656 hasOpenAccess W2022750656 @default.
- W2022750656 hasPrimaryLocation W20227506561 @default.
- W2022750656 hasRelatedWork W1535590766 @default.
- W2022750656 hasRelatedWork W1602521801 @default.
- W2022750656 hasRelatedWork W1606391442 @default.
- W2022750656 hasRelatedWork W2022515236 @default.
- W2022750656 hasRelatedWork W2022750656 @default.
- W2022750656 hasRelatedWork W2096621412 @default.
- W2022750656 hasRelatedWork W2115688358 @default.
- W2022750656 hasRelatedWork W2993981457 @default.
- W2022750656 hasRelatedWork W3150560774 @default.
- W2022750656 hasRelatedWork W2109400628 @default.
- W2022750656 hasVolume "38" @default.
- W2022750656 isParatext "false" @default.
- W2022750656 isRetracted "false" @default.
- W2022750656 magId "2022750656" @default.
- W2022750656 workType "article" @default.