Matches in SemOpenAlex for { <https://semopenalex.org/work/W636149895> ?p ?o ?g. }
Showing items 1 to 76 of
76
with 100 items per page.
- W636149895 abstract "The Extensible Markup Language (XML) has become an increasingly popular format for representing and exchanging data. Its flexible and exstensible syntax makes it suitable for representing both structured data and textual information, or a mixture of both. The popularization of XML has lead to the development of a new database type. XML databases serve as repositories of large collections of XML documents, and seek to provide the same benefits for XML data as relational databases for relational data; indexing, transactional processing, failsafe physical storage, querying collections etc.. There are two standardized query languages for XML, XQuery and XPath, which are both powerful for querying and navigating the structure XML. However, they offer limited support for full-text search, and cannot be used alone for typical Information Retrieval (IR) applications. To address IR-related issues in XML, a new standard is emerging as an extension to XPath and XQuery: XQuery and XPath Full Text 1.0 (XQFT). XQFT is carefully investigated to determine how well-known IR techniques apply to XML, and the chracateristics of full-text search and indexing in existing XML databases are described in a state-of-the-art study. Based on findings from literature and source code review, the design and implementation of XQFT is discussed; first in general terms, then in the context of Oracle Berkeley DB XML (BDB XML). Experimental support for XQFT is enabled in BDB XML, and a few experiments are conducted in order to evaluate functionality aspects of the XQFT implementation. A scheme for full-text indexing in BDB XML is proposed. The full-text index acts as an augmented version of an inverted list, and is implemented on top of an Oracle Berkeley DB database. Tokens are used as keys, with data tuples for each distinct (document, path) combination the token occurs in. Lookups in the index are based on keywords, and should allow answering various queries without materializing data. Investigation shows that XML-based IR with XQFT is not fundamentally different from traditional text-based IR. Full-text queries rely on linguistic tokens, which --- in XQFT --- are derived from nodes without considering the XML structure. Further, it is discovered that full-text indexing is crucial for query efficiency in large document collections. In summary, common issues with full-text search are present in XML-based IR, and are addressed in the same manner as text-based IR." @default.
- W636149895 created "2016-06-24" @default.
- W636149895 creator A5007879219 @default.
- W636149895 date "2009-01-01" @default.
- W636149895 modified "2023-09-24" @default.
- W636149895 title "Full-Text Search in XML Databases" @default.
- W636149895 cites W1568367777 @default.
- W636149895 cites W2101413593 @default.
- W636149895 cites W2135432866 @default.
- W636149895 hasPublicationYear "2009" @default.
- W636149895 type Work @default.
- W636149895 sameAs 636149895 @default.
- W636149895 citedByCount "0" @default.
- W636149895 crossrefType "dissertation" @default.
- W636149895 hasAuthorship W636149895A5007879219 @default.
- W636149895 hasConcept C11508877 @default.
- W636149895 hasConcept C136764020 @default.
- W636149895 hasConcept C173242113 @default.
- W636149895 hasConcept C183068750 @default.
- W636149895 hasConcept C199360897 @default.
- W636149895 hasConcept C23123220 @default.
- W636149895 hasConcept C2780213375 @default.
- W636149895 hasConcept C2780512708 @default.
- W636149895 hasConcept C34330436 @default.
- W636149895 hasConcept C34716815 @default.
- W636149895 hasConcept C40713593 @default.
- W636149895 hasConcept C41008148 @default.
- W636149895 hasConcept C44883583 @default.
- W636149895 hasConcept C55348073 @default.
- W636149895 hasConcept C68699486 @default.
- W636149895 hasConcept C77088390 @default.
- W636149895 hasConcept C8797682 @default.
- W636149895 hasConceptScore W636149895C11508877 @default.
- W636149895 hasConceptScore W636149895C136764020 @default.
- W636149895 hasConceptScore W636149895C173242113 @default.
- W636149895 hasConceptScore W636149895C183068750 @default.
- W636149895 hasConceptScore W636149895C199360897 @default.
- W636149895 hasConceptScore W636149895C23123220 @default.
- W636149895 hasConceptScore W636149895C2780213375 @default.
- W636149895 hasConceptScore W636149895C2780512708 @default.
- W636149895 hasConceptScore W636149895C34330436 @default.
- W636149895 hasConceptScore W636149895C34716815 @default.
- W636149895 hasConceptScore W636149895C40713593 @default.
- W636149895 hasConceptScore W636149895C41008148 @default.
- W636149895 hasConceptScore W636149895C44883583 @default.
- W636149895 hasConceptScore W636149895C55348073 @default.
- W636149895 hasConceptScore W636149895C68699486 @default.
- W636149895 hasConceptScore W636149895C77088390 @default.
- W636149895 hasConceptScore W636149895C8797682 @default.
- W636149895 hasLocation W6361498951 @default.
- W636149895 hasOpenAccess W636149895 @default.
- W636149895 hasPrimaryLocation W6361498951 @default.
- W636149895 hasRelatedWork W1489079028 @default.
- W636149895 hasRelatedWork W1497580234 @default.
- W636149895 hasRelatedWork W1555726118 @default.
- W636149895 hasRelatedWork W1579422622 @default.
- W636149895 hasRelatedWork W1981009354 @default.
- W636149895 hasRelatedWork W2009332535 @default.
- W636149895 hasRelatedWork W2036954656 @default.
- W636149895 hasRelatedWork W2074863013 @default.
- W636149895 hasRelatedWork W2080346608 @default.
- W636149895 hasRelatedWork W2106024890 @default.
- W636149895 hasRelatedWork W2134210615 @default.
- W636149895 hasRelatedWork W2146673488 @default.
- W636149895 hasRelatedWork W2149169606 @default.
- W636149895 hasRelatedWork W2391832737 @default.
- W636149895 hasRelatedWork W2465015636 @default.
- W636149895 hasRelatedWork W2530482074 @default.
- W636149895 hasRelatedWork W3087905464 @default.
- W636149895 hasRelatedWork W754413898 @default.
- W636149895 hasRelatedWork W89090514 @default.
- W636149895 hasRelatedWork W2585354504 @default.
- W636149895 isParatext "false" @default.
- W636149895 isRetracted "false" @default.
- W636149895 magId "636149895" @default.
- W636149895 workType "dissertation" @default.