Matches in SemOpenAlex for { <https://semopenalex.org/work/W4245019076> ?p ?o ?g. }
- W4245019076 endingPage "508" @default.
- W4245019076 startingPage "493" @default.
- W4245019076 abstract "Given a user-specified minimum correlation threshold /spl theta/ and a market-basket database with N items and T transactions, an all-strong-pairs correlation query finds all item pairs with correlations above the threshold /spl theta/. However, when the number of items and transactions are large, the computation cost of this query can be very high. The goal of this paper is to provide computationally efficient algorithms to answer the all-strong-pairs correlation query. Indeed, we identify an upper bound of Pearson's correlation coefficient for binary variables. This upper bound is not only much cheaper to compute than Pearson's correlation coefficient, but also exhibits special monotone properties which allow pruning of many item pairs even without computing their upper bounds. A two-step all-strong-pairs correlation query (TAPER) algorithm is proposed to exploit these properties in a filter-and-refine manner. Furthermore, we provide an algebraic cost model which shows that the computation savings from pruning is independent of or improves when the number of items is increased in data sets with Zipf-like or linear rank-support distributions. Experimental results from synthetic and real-world data sets exhibit similar trends and show that the TAPER algorithm can be an order of magnitude faster than brute-force alternatives. Finally, we demonstrate that the algorithmic ideas developed in the TAPER algorithm can be extended to efficiently compute negative correlation and uncentered Pearson's correlation coefficient." @default.
- W4245019076 created "2022-05-12" @default.
- W4245019076 creator A5022878003 @default.
- W4245019076 creator A5037233397 @default.
- W4245019076 creator A5067731925 @default.
- W4245019076 creator A5071546444 @default.
- W4245019076 date "2006-04-01" @default.
- W4245019076 modified "2023-09-29" @default.
- W4245019076 title "TAPER: a two-step approach for all-strong-pairs correlation query in large databases" @default.
- W4245019076 cites W1575391561 @default.
- W4245019076 cites W1888464276 @default.
- W4245019076 cites W1969842215 @default.
- W4245019076 cites W1988332933 @default.
- W4245019076 cites W2000106226 @default.
- W4245019076 cites W2026562765 @default.
- W4245019076 cites W2043066010 @default.
- W4245019076 cites W2083991698 @default.
- W4245019076 cites W2102489964 @default.
- W4245019076 cites W2108560469 @default.
- W4245019076 cites W2131020804 @default.
- W4245019076 cites W2140129471 @default.
- W4245019076 cites W2156031219 @default.
- W4245019076 cites W2166559705 @default.
- W4245019076 cites W2167482307 @default.
- W4245019076 cites W2210278139 @default.
- W4245019076 cites W3023428462 @default.
- W4245019076 cites W4245913787 @default.
- W4245019076 cites W4252403066 @default.
- W4245019076 doi "https://doi.org/10.1109/tkde.2006.1599388" @default.
- W4245019076 hasPublicationYear "2006" @default.
- W4245019076 type Work @default.
- W4245019076 citedByCount "19" @default.
- W4245019076 countsByYear W42450190762012 @default.
- W4245019076 countsByYear W42450190762013 @default.
- W4245019076 countsByYear W42450190762015 @default.
- W4245019076 countsByYear W42450190762016 @default.
- W4245019076 countsByYear W42450190762018 @default.
- W4245019076 countsByYear W42450190762021 @default.
- W4245019076 crossrefType "journal-article" @default.
- W4245019076 hasAuthorship W4245019076A5022878003 @default.
- W4245019076 hasAuthorship W4245019076A5037233397 @default.
- W4245019076 hasAuthorship W4245019076A5067731925 @default.
- W4245019076 hasAuthorship W4245019076A5071546444 @default.
- W4245019076 hasBestOaLocation W42450190762 @default.
- W4245019076 hasConcept C105795698 @default.
- W4245019076 hasConcept C106131492 @default.
- W4245019076 hasConcept C108010975 @default.
- W4245019076 hasConcept C11413529 @default.
- W4245019076 hasConcept C117220453 @default.
- W4245019076 hasConcept C119857082 @default.
- W4245019076 hasConcept C134306372 @default.
- W4245019076 hasConcept C165696696 @default.
- W4245019076 hasConcept C2524010 @default.
- W4245019076 hasConcept C2780092901 @default.
- W4245019076 hasConcept C2834757 @default.
- W4245019076 hasConcept C31972630 @default.
- W4245019076 hasConcept C33923547 @default.
- W4245019076 hasConcept C38652104 @default.
- W4245019076 hasConcept C41008148 @default.
- W4245019076 hasConcept C45374587 @default.
- W4245019076 hasConcept C55078378 @default.
- W4245019076 hasConcept C6557445 @default.
- W4245019076 hasConcept C77553402 @default.
- W4245019076 hasConcept C86803240 @default.
- W4245019076 hasConceptScore W4245019076C105795698 @default.
- W4245019076 hasConceptScore W4245019076C106131492 @default.
- W4245019076 hasConceptScore W4245019076C108010975 @default.
- W4245019076 hasConceptScore W4245019076C11413529 @default.
- W4245019076 hasConceptScore W4245019076C117220453 @default.
- W4245019076 hasConceptScore W4245019076C119857082 @default.
- W4245019076 hasConceptScore W4245019076C134306372 @default.
- W4245019076 hasConceptScore W4245019076C165696696 @default.
- W4245019076 hasConceptScore W4245019076C2524010 @default.
- W4245019076 hasConceptScore W4245019076C2780092901 @default.
- W4245019076 hasConceptScore W4245019076C2834757 @default.
- W4245019076 hasConceptScore W4245019076C31972630 @default.
- W4245019076 hasConceptScore W4245019076C33923547 @default.
- W4245019076 hasConceptScore W4245019076C38652104 @default.
- W4245019076 hasConceptScore W4245019076C41008148 @default.
- W4245019076 hasConceptScore W4245019076C45374587 @default.
- W4245019076 hasConceptScore W4245019076C55078378 @default.
- W4245019076 hasConceptScore W4245019076C6557445 @default.
- W4245019076 hasConceptScore W4245019076C77553402 @default.
- W4245019076 hasConceptScore W4245019076C86803240 @default.
- W4245019076 hasIssue "4" @default.
- W4245019076 hasLocation W42450190761 @default.
- W4245019076 hasLocation W42450190762 @default.
- W4245019076 hasOpenAccess W4245019076 @default.
- W4245019076 hasPrimaryLocation W42450190761 @default.
- W4245019076 hasRelatedWork W2081116272 @default.
- W4245019076 hasRelatedWork W2150672947 @default.
- W4245019076 hasRelatedWork W2243585625 @default.
- W4245019076 hasRelatedWork W2893067056 @default.
- W4245019076 hasRelatedWork W3013810674 @default.
- W4245019076 hasRelatedWork W3036722656 @default.
- W4245019076 hasRelatedWork W3081640970 @default.
- W4245019076 hasRelatedWork W3096637473 @default.
- W4245019076 hasRelatedWork W3109425891 @default.