Matches in SemOpenAlex for { <https://semopenalex.org/work/W1672725576> ?p ?o ?g. }
- W1672725576 abstract "Let D be a collection of D documents, which are strings over an alphabet of size σ, of total length n. We describe a data structure that uses linear space and and reports k most relevant documents that contain a query pattern P, which is a string of length p, in time O(p/logn+k), which is optimal in the RAM model in the general case where lgD = �(logn), and involves a novel RAM-optimal suffix tree search. Our construction supports an ample set of important relevance measures, such as the number of times P appears in a document (called term frequency), a fixed document importance, and the minimal distance between two occurrences of P in a document. When lgD = o(logn), we show how to reduce the space of the data structure from O(nlogn) to O(n(logσ+logD+loglogn)) bits, and to O(n(logσ+logD)) bits in the case of the popular term frequency measure of relevance, at the price of an additive term O(log nlogσ) in the query time, for any constant e > 0. We also consider the dynamic scenario, where documents can be inserted and deleted from the collection. We obtain linear space and query time O(p(loglogn) 2 /logn+logn+kloglogk), whereas insertions and deletions require O(log 1+ n) time per symbol, for any constant e > 0. Finally, we consider an extended static scenario where an extra parameter par(P,d) is de- fined, and the query must retrieve only documents d such that par(P,d) ∈ (τ1,τ2), where this range is specified at query time. We solve these queries using linear space and O(p/logn + log 1+ n + k log n) time, for any constant e > 0. Our technique is to translate these top-k problems into multidimensional geometric search problems. As an additional bonus, we describe some improvements to those problems." @default.
- W1672725576 created "2016-06-24" @default.
- W1672725576 creator A5050513868 @default.
- W1672725576 creator A5080743153 @default.
- W1672725576 date "2013-07-25" @default.
- W1672725576 modified "2023-09-27" @default.
- W1672725576 title "Optimal Top-k Document Retrieval ∗" @default.
- W1672725576 cites W142567295 @default.
- W1672725576 cites W1485516007 @default.
- W1672725576 cites W1496038746 @default.
- W1672725576 cites W1504477191 @default.
- W1672725576 cites W1520568851 @default.
- W1672725576 cites W1562034888 @default.
- W1672725576 cites W1571941879 @default.
- W1672725576 cites W1575350389 @default.
- W1672725576 cites W1660390307 @default.
- W1672725576 cites W166139681 @default.
- W1672725576 cites W1672848638 @default.
- W1672725576 cites W1676865579 @default.
- W1672725576 cites W1752316941 @default.
- W1672725576 cites W1876495223 @default.
- W1672725576 cites W1909754222 @default.
- W1672725576 cites W1970194312 @default.
- W1672725576 cites W1973520416 @default.
- W1672725576 cites W1979109797 @default.
- W1672725576 cites W1989682699 @default.
- W1672725576 cites W1990061958 @default.
- W1672725576 cites W2006131099 @default.
- W1672725576 cites W2007791040 @default.
- W1672725576 cites W2014318353 @default.
- W1672725576 cites W2026511056 @default.
- W1672725576 cites W2027252317 @default.
- W1672725576 cites W2029132631 @default.
- W1672725576 cites W2030839740 @default.
- W1672725576 cites W2049204576 @default.
- W1672725576 cites W2051158076 @default.
- W1672725576 cites W2059513841 @default.
- W1672725576 cites W2066362074 @default.
- W1672725576 cites W2073921136 @default.
- W1672725576 cites W2080106004 @default.
- W1672725576 cites W2080990114 @default.
- W1672725576 cites W2085933841 @default.
- W1672725576 cites W2093918274 @default.
- W1672725576 cites W2103850023 @default.
- W1672725576 cites W2107079154 @default.
- W1672725576 cites W2107082304 @default.
- W1672725576 cites W2118274795 @default.
- W1672725576 cites W2121252285 @default.
- W1672725576 cites W2129805272 @default.
- W1672725576 cites W2130080588 @default.
- W1672725576 cites W2134696992 @default.
- W1672725576 cites W2135050452 @default.
- W1672725576 cites W2135208303 @default.
- W1672725576 cites W2135639194 @default.
- W1672725576 cites W2137120608 @default.
- W1672725576 cites W2138662031 @default.
- W1672725576 cites W2141957180 @default.
- W1672725576 cites W2144759920 @default.
- W1672725576 cites W2149710566 @default.
- W1672725576 cites W2151453116 @default.
- W1672725576 cites W2165621523 @default.
- W1672725576 cites W2173123188 @default.
- W1672725576 cites W227213435 @default.
- W1672725576 cites W2283054559 @default.
- W1672725576 cites W2533248932 @default.
- W1672725576 cites W2951188822 @default.
- W1672725576 cites W2953015733 @default.
- W1672725576 cites W297061308 @default.
- W1672725576 cites W3198160809 @default.
- W1672725576 cites W346857011 @default.
- W1672725576 cites W92500321 @default.
- W1672725576 hasPublicationYear "2013" @default.
- W1672725576 type Work @default.
- W1672725576 sameAs 1672725576 @default.
- W1672725576 citedByCount "2" @default.
- W1672725576 countsByYear W16727255762013 @default.
- W1672725576 countsByYear W16727255762015 @default.
- W1672725576 crossrefType "posted-content" @default.
- W1672725576 hasAuthorship W1672725576A5050513868 @default.
- W1672725576 hasAuthorship W1672725576A5080743153 @default.
- W1672725576 hasConcept C111919701 @default.
- W1672725576 hasConcept C112876837 @default.
- W1672725576 hasConcept C114614502 @default.
- W1672725576 hasConcept C121332964 @default.
- W1672725576 hasConcept C138885662 @default.
- W1672725576 hasConcept C157486923 @default.
- W1672725576 hasConcept C158154518 @default.
- W1672725576 hasConcept C162319229 @default.
- W1672725576 hasConcept C176370821 @default.
- W1672725576 hasConcept C177264268 @default.
- W1672725576 hasConcept C17744445 @default.
- W1672725576 hasConcept C199360897 @default.
- W1672725576 hasConcept C199539241 @default.
- W1672725576 hasConcept C2778572836 @default.
- W1672725576 hasConcept C2779804580 @default.
- W1672725576 hasConcept C2781166958 @default.
- W1672725576 hasConcept C33923547 @default.
- W1672725576 hasConcept C37914503 @default.
- W1672725576 hasConcept C41008148 @default.
- W1672725576 hasConcept C41895202 @default.