Matches in SemOpenAlex for { <https://semopenalex.org/work/W2007742815> ?p ?o ?g. }
Showing items 1 to 76 of
76
with 100 items per page.
- W2007742815 abstract "We present TRISH, a 256-bin histogram method for byte data that runs up to 50% faster than previous GPU methods for random data and 2-4× faster for image data. The performance gains come from reducing total cycle counts. Reducing cycles comes from improving 1) thread level parallelism (TLP), 2) instruction level parallelism (ILP) and 3) software vector parallelism (VP). TLP is improved by increasing occupancy from 2 to 3 thread blocks, achieved by compacting “per thread” histograms in shared memory, and by using register arrays. ILP is improved by increasing independent instructions via loop unrolling by a factor of k= [1..63] and batching operations into groups of four. VP is supported by compacting bin counts into four 8-bit quads per 32-bit element and reducing binning & accumulating instructions by working with 32-bit elements as overlapping 16-bit pairs instead of 4 individual bytes. Note that TRISH is a deterministic algorithm that avoids atomic operations and gives performance that is data independent." @default.
- W2007742815 created "2016-06-24" @default.
- W2007742815 creator A5046373592 @default.
- W2007742815 creator A5074061303 @default.
- W2007742815 date "2012-05-01" @default.
- W2007742815 modified "2023-10-17" @default.
- W2007742815 title "Modestly faster histogram computations on GPUs" @default.
- W2007742815 cites W1979593580 @default.
- W2007742815 cites W2017086619 @default.
- W2007742815 cites W2145455679 @default.
- W2007742815 cites W2148869717 @default.
- W2007742815 doi "https://doi.org/10.1109/inpar.2012.6339589" @default.
- W2007742815 hasPublicationYear "2012" @default.
- W2007742815 type Work @default.
- W2007742815 sameAs 2007742815 @default.
- W2007742815 citedByCount "9" @default.
- W2007742815 countsByYear W20077428152014 @default.
- W2007742815 countsByYear W20077428152016 @default.
- W2007742815 countsByYear W20077428152017 @default.
- W2007742815 countsByYear W20077428152020 @default.
- W2007742815 crossrefType "proceedings-article" @default.
- W2007742815 hasAuthorship W2007742815A5046373592 @default.
- W2007742815 hasAuthorship W2007742815A5074061303 @default.
- W2007742815 hasConcept C11413529 @default.
- W2007742815 hasConcept C115961682 @default.
- W2007742815 hasConcept C138101251 @default.
- W2007742815 hasConcept C154945302 @default.
- W2007742815 hasConcept C156273044 @default.
- W2007742815 hasConcept C169590947 @default.
- W2007742815 hasConcept C173608175 @default.
- W2007742815 hasConcept C199360897 @default.
- W2007742815 hasConcept C2777904410 @default.
- W2007742815 hasConcept C2781172179 @default.
- W2007742815 hasConcept C41008148 @default.
- W2007742815 hasConcept C42992933 @default.
- W2007742815 hasConcept C43364308 @default.
- W2007742815 hasConcept C45374587 @default.
- W2007742815 hasConcept C53533937 @default.
- W2007742815 hasConcept C61483411 @default.
- W2007742815 hasConcept C76970557 @default.
- W2007742815 hasConcept C9390403 @default.
- W2007742815 hasConceptScore W2007742815C11413529 @default.
- W2007742815 hasConceptScore W2007742815C115961682 @default.
- W2007742815 hasConceptScore W2007742815C138101251 @default.
- W2007742815 hasConceptScore W2007742815C154945302 @default.
- W2007742815 hasConceptScore W2007742815C156273044 @default.
- W2007742815 hasConceptScore W2007742815C169590947 @default.
- W2007742815 hasConceptScore W2007742815C173608175 @default.
- W2007742815 hasConceptScore W2007742815C199360897 @default.
- W2007742815 hasConceptScore W2007742815C2777904410 @default.
- W2007742815 hasConceptScore W2007742815C2781172179 @default.
- W2007742815 hasConceptScore W2007742815C41008148 @default.
- W2007742815 hasConceptScore W2007742815C42992933 @default.
- W2007742815 hasConceptScore W2007742815C43364308 @default.
- W2007742815 hasConceptScore W2007742815C45374587 @default.
- W2007742815 hasConceptScore W2007742815C53533937 @default.
- W2007742815 hasConceptScore W2007742815C61483411 @default.
- W2007742815 hasConceptScore W2007742815C76970557 @default.
- W2007742815 hasConceptScore W2007742815C9390403 @default.
- W2007742815 hasLocation W20077428151 @default.
- W2007742815 hasOpenAccess W2007742815 @default.
- W2007742815 hasPrimaryLocation W20077428151 @default.
- W2007742815 hasRelatedWork W1608806855 @default.
- W2007742815 hasRelatedWork W1850053445 @default.
- W2007742815 hasRelatedWork W1972912085 @default.
- W2007742815 hasRelatedWork W2023505575 @default.
- W2007742815 hasRelatedWork W2082701182 @default.
- W2007742815 hasRelatedWork W2164693448 @default.
- W2007742815 hasRelatedWork W2313503008 @default.
- W2007742815 hasRelatedWork W2366027386 @default.
- W2007742815 hasRelatedWork W2378666660 @default.
- W2007742815 hasRelatedWork W99192079 @default.
- W2007742815 isParatext "false" @default.
- W2007742815 isRetracted "false" @default.
- W2007742815 magId "2007742815" @default.
- W2007742815 workType "article" @default.