Matches in SemOpenAlex for { <https://semopenalex.org/work/W2187383495> ?p ?o ?g. }
- W2187383495 abstract "Databases and information retrieval systems storing data from the World Wide Web require new optimization strategies to deal with the different types of data (textual, semi-structured and relational), the massiveness of the data, and the dynamic nature of the data. This dissertation investigates on-line methods for the optimization of databases storing web data. On-line methods avoid access to the underlying data, avoid the rebuilding of data structures from scratch, and adapt to changes in the data and query workload characteristics. On-line techniques are therefore especially suited to web data. In this dissertation, we present on-line techniques for two general problems: how to update inverted indexes and how to estimate the selectivity or result size of queries in database systems. For the index update problem, we present the landmark-diff method for updating the inverted index in response to changes in previously indexed documents. The landmark-diff method allows indexes to be updated incrementally without a complete rebuild. For selectivity estimation, we propose three on-line techniques that address the selectivity estimation problem in three different context. In the context of relational databases, we present SASH, a Self-Adapting Set of Histograms, that uses probabilistic graphical models to address the following issues in an on-line manner: which sets of attributes to build histograms on, how to build these histograms without looking at data, and how much memory should be allocated to these histograms. For XML native databases, we present XPathLearner, an on-line workload-aware method for estimating the result size of given XPath queries. XPathLearner learns the path statistics from query feedback (past query answers) in an on-line manner, and adapts itself to changes in the data and the query workload characteristics. In a more general context, we present CXHist, an on-line classification based histogram for estimating the selectivity of a broad class of queries. CXHist models queries instead of data and stores the mapping between queries and their selectivity using a naive Bayesian classifier. We show that CXHist is very accurate in estimating the selectivity of exact match and substring predicates on leaf values reachable by a given XPath in XML databases." @default.
- W2187383495 created "2016-06-24" @default.
- W2187383495 creator A5003340402 @default.
- W2187383495 creator A5005957197 @default.
- W2187383495 date "2004-01-01" @default.
- W2187383495 modified "2023-09-28" @default.
- W2187383495 title "On-line methods for database optimization" @default.
- W2187383495 cites W119813091 @default.
- W2187383495 cites W1480794755 @default.
- W2187383495 cites W1507029541 @default.
- W2187383495 cites W1508914815 @default.
- W2187383495 cites W1537369822 @default.
- W2187383495 cites W1544843123 @default.
- W2187383495 cites W1566984846 @default.
- W2187383495 cites W1578539484 @default.
- W2187383495 cites W1581547316 @default.
- W2187383495 cites W1591473008 @default.
- W2187383495 cites W1593538391 @default.
- W2187383495 cites W1625057832 @default.
- W2187383495 cites W1659541576 @default.
- W2187383495 cites W1660390307 @default.
- W2187383495 cites W1669131436 @default.
- W2187383495 cites W1721621563 @default.
- W2187383495 cites W1965555277 @default.
- W2187383495 cites W1975887898 @default.
- W2187383495 cites W1976624301 @default.
- W2187383495 cites W1984629602 @default.
- W2187383495 cites W1985108724 @default.
- W2187383495 cites W1991271936 @default.
- W2187383495 cites W1994694191 @default.
- W2187383495 cites W1997841190 @default.
- W2187383495 cites W1998244781 @default.
- W2187383495 cites W2002722920 @default.
- W2187383495 cites W2011632873 @default.
- W2187383495 cites W2017392697 @default.
- W2187383495 cites W2021850646 @default.
- W2187383495 cites W2023695725 @default.
- W2187383495 cites W2034385326 @default.
- W2187383495 cites W2041851430 @default.
- W2187383495 cites W2046983134 @default.
- W2187383495 cites W2049342105 @default.
- W2187383495 cites W2053049304 @default.
- W2187383495 cites W2059387258 @default.
- W2187383495 cites W2066636486 @default.
- W2187383495 cites W2075833184 @default.
- W2187383495 cites W2079145130 @default.
- W2187383495 cites W2095374884 @default.
- W2187383495 cites W2099126271 @default.
- W2187383495 cites W2099132625 @default.
- W2187383495 cites W2102098892 @default.
- W2187383495 cites W2107628111 @default.
- W2187383495 cites W2109464129 @default.
- W2187383495 cites W2109808436 @default.
- W2187383495 cites W2110459974 @default.
- W2187383495 cites W2112056262 @default.
- W2187383495 cites W2120108467 @default.
- W2187383495 cites W2121604804 @default.
- W2187383495 cites W2122416857 @default.
- W2187383495 cites W2132560950 @default.
- W2187383495 cites W2134826720 @default.
- W2187383495 cites W2138793904 @default.
- W2187383495 cites W2139242647 @default.
- W2187383495 cites W2144621469 @default.
- W2187383495 cites W2147440220 @default.
- W2187383495 cites W2148832586 @default.
- W2187383495 cites W2150145391 @default.
- W2187383495 cites W2150593711 @default.
- W2187383495 cites W2151310484 @default.
- W2187383495 cites W2153329411 @default.
- W2187383495 cites W2156210689 @default.
- W2187383495 cites W2160484851 @default.
- W2187383495 cites W2167439683 @default.
- W2187383495 cites W2168865746 @default.
- W2187383495 cites W2171903035 @default.
- W2187383495 cites W2203835818 @default.
- W2187383495 cites W2212047990 @default.
- W2187383495 cites W2230830246 @default.
- W2187383495 cites W2293896416 @default.
- W2187383495 cites W2321470647 @default.
- W2187383495 cites W2766736793 @default.
- W2187383495 cites W44987379 @default.
- W2187383495 hasPublicationYear "2004" @default.
- W2187383495 type Work @default.
- W2187383495 sameAs 2187383495 @default.
- W2187383495 citedByCount "0" @default.
- W2187383495 crossrefType "journal-article" @default.
- W2187383495 hasAuthorship W2187383495A5003340402 @default.
- W2187383495 hasAuthorship W2187383495A5005957197 @default.
- W2187383495 hasConcept C115961682 @default.
- W2187383495 hasConcept C124101348 @default.
- W2187383495 hasConcept C151730666 @default.
- W2187383495 hasConcept C154945302 @default.
- W2187383495 hasConcept C177264268 @default.
- W2187383495 hasConcept C199360897 @default.
- W2187383495 hasConcept C23123220 @default.
- W2187383495 hasConcept C2779343474 @default.
- W2187383495 hasConcept C41008148 @default.
- W2187383495 hasConcept C53533937 @default.
- W2187383495 hasConcept C5655090 @default.
- W2187383495 hasConcept C59276292 @default.