Matches in SemOpenAlex for { <https://semopenalex.org/work/W62198243> ?p ?o ?g. }
Showing items 1 to 86 of
86
with 100 items per page.
- W62198243 endingPage "357" @default.
- W62198243 startingPage "343" @default.
- W62198243 abstract "AbstractUniform Resource Locator (URL) ordering algorithms are used by Web crawlers to determine the order in which to download pages from the Web. The current approaches for URL ordering based on link structure are expensive and/or miss many good pages, particularly in social network environments. In this paper, we present a novel URL ordering system that relies on a cooperative approach between crawlers and web servers based on file system and Web log information. In particular, we develop algorithms based on file timestamps and Web log internal and external counts. By using this change and popularity information for URL ordering, we are able to retrieve high quality pages earlier in the crawl while avoiding requests for pages that are unchanged or no longer available. We perform our experiments on two data sets using the Web logs from university and CiteSeer websites. On these data sets, we achieve a statistically significant improvement in the ordering of the high quality pages (as indicated by Google’s PageRank) of 57.2% and 65.7% over that of a breadth-first search crawl while increasing the number of unique pages gathered by skipping unchanged or deleted pages.KeywordsFile SystemUniform Resource LocatorContent Management SystemCooperative ApproachBandwidth SavingThese keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves." @default.
- W62198243 created "2016-06-24" @default.
- W62198243 creator A5022867909 @default.
- W62198243 creator A5025853069 @default.
- W62198243 creator A5032138644 @default.
- W62198243 date "2012-01-01" @default.
- W62198243 modified "2023-09-24" @default.
- W62198243 title "A Cooperative Approach to Web Crawler URL Ordering" @default.
- W62198243 cites W2007687650 @default.
- W62198243 cites W2029341294 @default.
- W62198243 cites W2110073539 @default.
- W62198243 cites W2128384372 @default.
- W62198243 cites W2134308336 @default.
- W62198243 cites W2294632035 @default.
- W62198243 doi "https://doi.org/10.1007/978-3-642-23187-2_22" @default.
- W62198243 hasPublicationYear "2012" @default.
- W62198243 type Work @default.
- W62198243 sameAs 62198243 @default.
- W62198243 citedByCount "5" @default.
- W62198243 countsByYear W621982432012 @default.
- W62198243 countsByYear W621982432013 @default.
- W62198243 countsByYear W621982432014 @default.
- W62198243 countsByYear W621982432015 @default.
- W62198243 crossrefType "book-chapter" @default.
- W62198243 hasAuthorship W62198243A5022867909 @default.
- W62198243 hasAuthorship W62198243A5025853069 @default.
- W62198243 hasAuthorship W62198243A5032138644 @default.
- W62198243 hasConcept C104352257 @default.
- W62198243 hasConcept C110875604 @default.
- W62198243 hasConcept C11392498 @default.
- W62198243 hasConcept C113954288 @default.
- W62198243 hasConcept C136764020 @default.
- W62198243 hasConcept C13743948 @default.
- W62198243 hasConcept C15744967 @default.
- W62198243 hasConcept C173576120 @default.
- W62198243 hasConcept C195409031 @default.
- W62198243 hasConcept C21959979 @default.
- W62198243 hasConcept C23123220 @default.
- W62198243 hasConcept C2779172887 @default.
- W62198243 hasConcept C2780586970 @default.
- W62198243 hasConcept C38652104 @default.
- W62198243 hasConcept C41008148 @default.
- W62198243 hasConcept C61096286 @default.
- W62198243 hasConcept C71325787 @default.
- W62198243 hasConcept C73340581 @default.
- W62198243 hasConcept C77088390 @default.
- W62198243 hasConcept C77805123 @default.
- W62198243 hasConceptScore W62198243C104352257 @default.
- W62198243 hasConceptScore W62198243C110875604 @default.
- W62198243 hasConceptScore W62198243C11392498 @default.
- W62198243 hasConceptScore W62198243C113954288 @default.
- W62198243 hasConceptScore W62198243C136764020 @default.
- W62198243 hasConceptScore W62198243C13743948 @default.
- W62198243 hasConceptScore W62198243C15744967 @default.
- W62198243 hasConceptScore W62198243C173576120 @default.
- W62198243 hasConceptScore W62198243C195409031 @default.
- W62198243 hasConceptScore W62198243C21959979 @default.
- W62198243 hasConceptScore W62198243C23123220 @default.
- W62198243 hasConceptScore W62198243C2779172887 @default.
- W62198243 hasConceptScore W62198243C2780586970 @default.
- W62198243 hasConceptScore W62198243C38652104 @default.
- W62198243 hasConceptScore W62198243C41008148 @default.
- W62198243 hasConceptScore W62198243C61096286 @default.
- W62198243 hasConceptScore W62198243C71325787 @default.
- W62198243 hasConceptScore W62198243C73340581 @default.
- W62198243 hasConceptScore W62198243C77088390 @default.
- W62198243 hasConceptScore W62198243C77805123 @default.
- W62198243 hasLocation W621982431 @default.
- W62198243 hasOpenAccess W62198243 @default.
- W62198243 hasPrimaryLocation W621982431 @default.
- W62198243 hasRelatedWork W14315322 @default.
- W62198243 hasRelatedWork W1673346501 @default.
- W62198243 hasRelatedWork W1963973829 @default.
- W62198243 hasRelatedWork W1979144454 @default.
- W62198243 hasRelatedWork W2051135816 @default.
- W62198243 hasRelatedWork W2056064491 @default.
- W62198243 hasRelatedWork W2092706362 @default.
- W62198243 hasRelatedWork W2145288810 @default.
- W62198243 hasRelatedWork W2533432419 @default.
- W62198243 hasRelatedWork W62198243 @default.
- W62198243 isParatext "false" @default.
- W62198243 isRetracted "false" @default.
- W62198243 magId "62198243" @default.
- W62198243 workType "book-chapter" @default.