Matches in SemOpenAlex for { <https://semopenalex.org/work/W105364189> ?p ?o ?g. }
Showing items 1 to 86 of
86
with 100 items per page.
- W105364189 endingPage "693" @default.
- W105364189 startingPage "687" @default.
- W105364189 abstract "All Pairs Similarity Search (APSS) is the problem of finding all pairs of records with similarity scores above a specified threshold. Incremental All Pairs Similarity Search (IAPSS) is the problem of performing APSS multiple times over the same dataset by varying the similarity threshold. This problem is ubiquitous in many real-world systems like search engines, online social networks, and digital libraries. A significant part of the computation is redundant across multiple invocations of APSS. Our solution to the IAPSS problem avoids these redundant computations by storing the history of previous APSS invocations and splitting the inverted index that maps each dimension into a list of records that have non-zero projections along that dimension. The size of the computation history increases quadratically with the number of records in the dataset. We introduce the concept of a similarity floor to store partial computation history, resulting in reduced I/O overhead. We empirically evaluate the effectiveness of our techniques using four real-world large-scale datasets. Our IAPSS solution achieves speed-ups in the order of 2X to over 105 X over the state-of-the-art APSS algorithm, while reducing the size of the computation history by at least an order of magnitude." @default.
- W105364189 created "2016-06-24" @default.
- W105364189 creator A5000346770 @default.
- W105364189 creator A5002863516 @default.
- W105364189 creator A5040551107 @default.
- W105364189 date "2009-01-01" @default.
- W105364189 modified "2023-09-24" @default.
- W105364189 title "Incremental All Pairs Similarity Search for Varying Similarity Thresholds with Reduced I/O Overhead." @default.
- W105364189 cites W1671906456 @default.
- W105364189 cites W2080128271 @default.
- W105364189 cites W2089923519 @default.
- W105364189 cites W2096598900 @default.
- W105364189 cites W2097184821 @default.
- W105364189 cites W2097776316 @default.
- W105364189 cites W2105436061 @default.
- W105364189 cites W2108620170 @default.
- W105364189 cites W2114353347 @default.
- W105364189 cites W2115022330 @default.
- W105364189 cites W2117350857 @default.
- W105364189 cites W2123060058 @default.
- W105364189 cites W2165611133 @default.
- W105364189 hasPublicationYear "2009" @default.
- W105364189 type Work @default.
- W105364189 sameAs 105364189 @default.
- W105364189 citedByCount "2" @default.
- W105364189 crossrefType "journal-article" @default.
- W105364189 hasAuthorship W105364189A5000346770 @default.
- W105364189 hasAuthorship W105364189A5002863516 @default.
- W105364189 hasAuthorship W105364189A5040551107 @default.
- W105364189 hasConcept C103278499 @default.
- W105364189 hasConcept C111919701 @default.
- W105364189 hasConcept C11413529 @default.
- W105364189 hasConcept C114614502 @default.
- W105364189 hasConcept C115961682 @default.
- W105364189 hasConcept C116738811 @default.
- W105364189 hasConcept C124101348 @default.
- W105364189 hasConcept C154945302 @default.
- W105364189 hasConcept C2779960059 @default.
- W105364189 hasConcept C33676613 @default.
- W105364189 hasConcept C33923547 @default.
- W105364189 hasConcept C41008148 @default.
- W105364189 hasConcept C45374587 @default.
- W105364189 hasConcept C80444323 @default.
- W105364189 hasConceptScore W105364189C103278499 @default.
- W105364189 hasConceptScore W105364189C111919701 @default.
- W105364189 hasConceptScore W105364189C11413529 @default.
- W105364189 hasConceptScore W105364189C114614502 @default.
- W105364189 hasConceptScore W105364189C115961682 @default.
- W105364189 hasConceptScore W105364189C116738811 @default.
- W105364189 hasConceptScore W105364189C124101348 @default.
- W105364189 hasConceptScore W105364189C154945302 @default.
- W105364189 hasConceptScore W105364189C2779960059 @default.
- W105364189 hasConceptScore W105364189C33676613 @default.
- W105364189 hasConceptScore W105364189C33923547 @default.
- W105364189 hasConceptScore W105364189C41008148 @default.
- W105364189 hasConceptScore W105364189C45374587 @default.
- W105364189 hasConceptScore W105364189C80444323 @default.
- W105364189 hasLocation W1053641891 @default.
- W105364189 hasOpenAccess W105364189 @default.
- W105364189 hasPrimaryLocation W1053641891 @default.
- W105364189 hasRelatedWork W1520663999 @default.
- W105364189 hasRelatedWork W2050653561 @default.
- W105364189 hasRelatedWork W2065110301 @default.
- W105364189 hasRelatedWork W2129799397 @default.
- W105364189 hasRelatedWork W2135390578 @default.
- W105364189 hasRelatedWork W2491319876 @default.
- W105364189 hasRelatedWork W2492705872 @default.
- W105364189 hasRelatedWork W2756522268 @default.
- W105364189 hasRelatedWork W2765818088 @default.
- W105364189 hasRelatedWork W2898318433 @default.
- W105364189 hasRelatedWork W2945594983 @default.
- W105364189 hasRelatedWork W2951208214 @default.
- W105364189 hasRelatedWork W2979341317 @default.
- W105364189 hasRelatedWork W2990912288 @default.
- W105364189 hasRelatedWork W2995786143 @default.
- W105364189 hasRelatedWork W3027211216 @default.
- W105364189 hasRelatedWork W3080440842 @default.
- W105364189 hasRelatedWork W3128603515 @default.
- W105364189 hasRelatedWork W36574333 @default.
- W105364189 hasRelatedWork W68877885 @default.
- W105364189 isParatext "false" @default.
- W105364189 isRetracted "false" @default.
- W105364189 magId "105364189" @default.
- W105364189 workType "article" @default.