Matches in SemOpenAlex for { <https://semopenalex.org/work/W578951954> ?p ?o ?g. }
Showing items 1 to 67 of
67
with 100 items per page.
- W578951954 abstract "The goal of this master thesis has been to evaluate methods for redistribution of data on search engine clusters. For all of the methods the redistribution is done when the cluster changes size. Redistribution methods that are specifically designed for search engines are not common, so the methods compared in this thesis are based on other distributed settings. This is from among other things distributed database systems, distributed files and continuous media systems. The evaluation of the methods consists of two parts, a theoretical analysis and an implementation and testing of the methods. In the theoretical analysis the methods are compared by deduction of expressions of performance. In the practical approach the algorithms are implemented on a simplified search engine cluster of 6 computers. The methods have been evaluated using three criteria. The first criteria of evaluation are how well the methods distribute documents across the cluster. In the theoretical analysis this also includes worst case scenarios. The practical evaluation compares the distribution at the end of the tests. The second criterion of evaluation is efficiency of document access. The theoretical approach focuses on the number of operations required while the practical approach calculates indexing throughput. The last area of focus examined is the document volume transported during redistribution. For the final part of the comparison of the methods, some relevant scenarios are introduced. These scenarios focus on dynamic data sets with high frequency of updates, often new documents and much searching. Using the scenarios and results from the method testing, we found some methods that performed be better than others. It is worth noting that the conclusions are for a given the type of workload from the scenarios and the setting for the test. Given other situations, other methods might be more suitable. When concluding our results we found, for the give scenarios, the best distribution method was the distributed version of linear hashing (LH*). The results from the method using hashing/range-partitioning also showed to be the least suitable as a consequence of high transport volume." @default.
- W578951954 created "2016-06-24" @default.
- W578951954 creator A5037777386 @default.
- W578951954 date "2009-01-01" @default.
- W578951954 modified "2023-09-24" @default.
- W578951954 title "Redistribution of Documents across Search Engine Clusters" @default.
- W578951954 hasPublicationYear "2009" @default.
- W578951954 type Work @default.
- W578951954 sameAs 578951954 @default.
- W578951954 citedByCount "0" @default.
- W578951954 crossrefType "dissertation" @default.
- W578951954 hasAuthorship W578951954A5037777386 @default.
- W578951954 hasConcept C120665830 @default.
- W578951954 hasConcept C121332964 @default.
- W578951954 hasConcept C124101348 @default.
- W578951954 hasConcept C164866538 @default.
- W578951954 hasConcept C17744445 @default.
- W578951954 hasConcept C192209626 @default.
- W578951954 hasConcept C199539241 @default.
- W578951954 hasConcept C23123220 @default.
- W578951954 hasConcept C31258907 @default.
- W578951954 hasConcept C41008148 @default.
- W578951954 hasConcept C74080474 @default.
- W578951954 hasConcept C75165309 @default.
- W578951954 hasConcept C94625758 @default.
- W578951954 hasConcept C97854310 @default.
- W578951954 hasConceptScore W578951954C120665830 @default.
- W578951954 hasConceptScore W578951954C121332964 @default.
- W578951954 hasConceptScore W578951954C124101348 @default.
- W578951954 hasConceptScore W578951954C164866538 @default.
- W578951954 hasConceptScore W578951954C17744445 @default.
- W578951954 hasConceptScore W578951954C192209626 @default.
- W578951954 hasConceptScore W578951954C199539241 @default.
- W578951954 hasConceptScore W578951954C23123220 @default.
- W578951954 hasConceptScore W578951954C31258907 @default.
- W578951954 hasConceptScore W578951954C41008148 @default.
- W578951954 hasConceptScore W578951954C74080474 @default.
- W578951954 hasConceptScore W578951954C75165309 @default.
- W578951954 hasConceptScore W578951954C94625758 @default.
- W578951954 hasConceptScore W578951954C97854310 @default.
- W578951954 hasLocation W5789519541 @default.
- W578951954 hasOpenAccess W578951954 @default.
- W578951954 hasPrimaryLocation W5789519541 @default.
- W578951954 hasRelatedWork W1572506298 @default.
- W578951954 hasRelatedWork W1968376769 @default.
- W578951954 hasRelatedWork W2014244478 @default.
- W578951954 hasRelatedWork W2033115441 @default.
- W578951954 hasRelatedWork W2036268228 @default.
- W578951954 hasRelatedWork W2044972257 @default.
- W578951954 hasRelatedWork W2057826945 @default.
- W578951954 hasRelatedWork W2120762159 @default.
- W578951954 hasRelatedWork W213626548 @default.
- W578951954 hasRelatedWork W2137525499 @default.
- W578951954 hasRelatedWork W2155828618 @default.
- W578951954 hasRelatedWork W2188741592 @default.
- W578951954 hasRelatedWork W2227939805 @default.
- W578951954 hasRelatedWork W2460178210 @default.
- W578951954 hasRelatedWork W2474186128 @default.
- W578951954 hasRelatedWork W2546753936 @default.
- W578951954 hasRelatedWork W2951286080 @default.
- W578951954 hasRelatedWork W60432105 @default.
- W578951954 hasRelatedWork W63199369 @default.
- W578951954 hasRelatedWork W769524000 @default.
- W578951954 isParatext "false" @default.
- W578951954 isRetracted "false" @default.
- W578951954 magId "578951954" @default.
- W578951954 workType "dissertation" @default.