Matches in SemOpenAlex for { <https://semopenalex.org/work/W2210347907> ?p ?o ?g. }
- W2210347907 abstract "_____________________________________________________________________________ The Web is comprised of a vast quantity of text. Modern search engines struggle to index it independent of the structure of queries and type of Web data, and commonly use indexing based on Web‘s graph structure to identify high-quality relevant pages. However, despite the apparent widespread use of these algorithms, Web indexing based on human feedback and document content is controversial. There are many fundamental questions that need to be addressed, including: How many types of domains/websites are there in the Web? What type of data is in each type of domain? For each type, which segments/HTML fields in the documents are most useful? What are the relationships between the segments? How can web content be indexed efficiently in all forms of document configurations? Our investigation of these questions has led to a novel way to use Wikipedia to find the relationships between the query structures and document configurations throughout the document indexing process and to use them to build an efficient index that allows fast indexing and searching, and optimizes the retrieval of highly relevant results. We consider the top page on the ranked list to be highly important in determining the types of queries. Our aim is to design a powerful search engine with a strong focus on how to make the first page highly relevant to the user, and on how to retrieve other pages based on that first page. Through processing the user query using the Wikipedia index and determining the type of the query, our approach could trace the path of a query in our index, and retrieve specific results for each type. We use two kinds of data to increase the relevancy and efficiency of the ranked results: offline and real-time. Traditional search engines find it difficult to use these two kinds of data together, because building a real-time index from social data and integrating it with the index for the offline data is difficult in a traditional distributed index. As a source of offline data, we use data from the Text Retrieval Conference (TREC) evaluation campaign. The web track at TREC offers researchers chance to investigate different retrieval approaches for web indexing and searching. The crawled offline dataset makes it possible to design powerful search engines that extends current methods and to evaluate and compare them. We propose a new indexing method, based on the structures of the queries and the content of documents. Our search engine uses a core index for offline data and a hash index for real-time" @default.
- W2210347907 created "2016-06-24" @default.
- W2210347907 creator A5004764627 @default.
- W2210347907 creator A5072218899 @default.
- W2210347907 date "2014-01-01" @default.
- W2210347907 modified "2023-09-25" @default.
- W2210347907 title "Using Wikipedia Knowledge and Query Types in a New Indexing Approach for Web Search Engines" @default.
- W2210347907 cites W146125910 @default.
- W2210347907 cites W1484898176 @default.
- W2210347907 cites W1486033873 @default.
- W2210347907 cites W1487889746 @default.
- W2210347907 cites W1493893823 @default.
- W2210347907 cites W1522032273 @default.
- W2210347907 cites W1525225449 @default.
- W2210347907 cites W1528140509 @default.
- W2210347907 cites W1532325895 @default.
- W2210347907 cites W1533907908 @default.
- W2210347907 cites W1534714852 @default.
- W2210347907 cites W1540841176 @default.
- W2210347907 cites W155795827 @default.
- W2210347907 cites W1562573314 @default.
- W2210347907 cites W1568535688 @default.
- W2210347907 cites W1577278726 @default.
- W2210347907 cites W1595375842 @default.
- W2210347907 cites W1603653375 @default.
- W2210347907 cites W1660390307 @default.
- W2210347907 cites W1679913846 @default.
- W2210347907 cites W1685426458 @default.
- W2210347907 cites W1735420517 @default.
- W2210347907 cites W1737065859 @default.
- W2210347907 cites W1776265914 @default.
- W2210347907 cites W184023255 @default.
- W2210347907 cites W1845927601 @default.
- W2210347907 cites W1878118535 @default.
- W2210347907 cites W1885937115 @default.
- W2210347907 cites W188956776 @default.
- W2210347907 cites W193708675 @default.
- W2210347907 cites W1943989130 @default.
- W2210347907 cites W1967160701 @default.
- W2210347907 cites W1968927634 @default.
- W2210347907 cites W1978394996 @default.
- W2210347907 cites W1984808605 @default.
- W2210347907 cites W1988686126 @default.
- W2210347907 cites W1990517717 @default.
- W2210347907 cites W1994129769 @default.
- W2210347907 cites W1995262888 @default.
- W2210347907 cites W1996764654 @default.
- W2210347907 cites W1996934732 @default.
- W2210347907 cites W2006504964 @default.
- W2210347907 cites W2006608770 @default.
- W2210347907 cites W2032459394 @default.
- W2210347907 cites W2034927834 @default.
- W2210347907 cites W2036362156 @default.
- W2210347907 cites W204105018 @default.
- W2210347907 cites W2041179002 @default.
- W2210347907 cites W2041563865 @default.
- W2210347907 cites W2047008218 @default.
- W2210347907 cites W2051804774 @default.
- W2210347907 cites W2055247046 @default.
- W2210347907 cites W2055385473 @default.
- W2210347907 cites W2060494110 @default.
- W2210347907 cites W2061410315 @default.
- W2210347907 cites W2066636486 @default.
- W2210347907 cites W2075719792 @default.
- W2210347907 cites W2078396654 @default.
- W2210347907 cites W2085030399 @default.
- W2210347907 cites W2086253379 @default.
- W2210347907 cites W2088314245 @default.
- W2210347907 cites W2089199911 @default.
- W2210347907 cites W20893098 @default.
- W2210347907 cites W2092902419 @default.
- W2210347907 cites W2094790959 @default.
- W2210347907 cites W2096913158 @default.
- W2210347907 cites W2106421124 @default.
- W2210347907 cites W2106750491 @default.
- W2210347907 cites W2108009052 @default.
- W2210347907 cites W2110802877 @default.
- W2210347907 cites W2111713978 @default.
- W2210347907 cites W2113987729 @default.
- W2210347907 cites W2116241375 @default.
- W2210347907 cites W2118809019 @default.
- W2210347907 cites W2119312811 @default.
- W2210347907 cites W2119577990 @default.
- W2210347907 cites W2119875355 @default.
- W2210347907 cites W2121167884 @default.
- W2210347907 cites W2121587139 @default.
- W2210347907 cites W2124939717 @default.
- W2210347907 cites W2126782364 @default.
- W2210347907 cites W2127466325 @default.
- W2210347907 cites W2128319626 @default.
- W2210347907 cites W2135220386 @default.
- W2210347907 cites W2138621811 @default.
- W2210347907 cites W2141175968 @default.
- W2210347907 cites W2142510125 @default.
- W2210347907 cites W2146081744 @default.
- W2210347907 cites W2148212498 @default.
- W2210347907 cites W2148507357 @default.
- W2210347907 cites W2149420958 @default.
- W2210347907 cites W2160214409 @default.
- W2210347907 cites W2162161511 @default.