Matches in SemOpenAlex for { <https://semopenalex.org/work/W2023165943> ?p ?o ?g. }
- W2023165943 endingPage "1051" @default.
- W2023165943 startingPage "1051" @default.
- W2023165943 abstract "With the massive and ever increasing pages in the Web, incremental crawling has become a promising method to achieve on-line information. Its main advantage is the resource economization, which comes from the avoidance of downloading unchanged pages. For the precision of change prediction, the evolution of Web is generally studied to find out how pages change. In sum, incremental crawlers often integrate change frequency, change extent, and document quality for each page to determine its relative order as well as its download frequency. In this paper, the researches on Web evolution and incremental crawling in recent years are summarized: First, the change of page is modeled as a Poisson process, and the solutions are given to estimate its parameters, especially the change frequency, and then experimental results are shown. Second, based on the change of pages, three public large-scale incremental crawling systems are introduced, with emphasis on their scheduling policies and strategies to enhance page qualities. Third, theoretical analysis and exploration are performed to find the optimal scheduling policy, three approaches from different points of views are utilized to achieve this object, and a heuristic approximate solution is supplied for the feasibility in practice. Finally, research trends in this area are predicted, and three main issues are listed." @default.
- W2023165943 created "2016-06-24" @default.
- W2023165943 creator A5084342142 @default.
- W2023165943 date "2006-01-01" @default.
- W2023165943 modified "2023-09-26" @default.
- W2023165943 title "Web Evolution and Incremental Crawling " @default.
- W2023165943 cites W1489021933 @default.
- W2023165943 cites W1506845741 @default.
- W2023165943 cites W1546842576 @default.
- W2023165943 cites W1566984846 @default.
- W2023165943 cites W1574322650 @default.
- W2023165943 cites W1591738266 @default.
- W2023165943 cites W1604630858 @default.
- W2023165943 cites W1613836731 @default.
- W2023165943 cites W1845137714 @default.
- W2023165943 cites W1854214752 @default.
- W2023165943 cites W1965061793 @default.
- W2023165943 cites W1967825155 @default.
- W2023165943 cites W1968155810 @default.
- W2023165943 cites W1976624301 @default.
- W2023165943 cites W1978056887 @default.
- W2023165943 cites W1987272746 @default.
- W2023165943 cites W1987365175 @default.
- W2023165943 cites W1988020171 @default.
- W2023165943 cites W1994687138 @default.
- W2023165943 cites W1994727615 @default.
- W2023165943 cites W1995821742 @default.
- W2023165943 cites W1997438973 @default.
- W2023165943 cites W1999548128 @default.
- W2023165943 cites W2000273502 @default.
- W2023165943 cites W2002287579 @default.
- W2023165943 cites W2004943036 @default.
- W2023165943 cites W2007687650 @default.
- W2023165943 cites W2014134732 @default.
- W2023165943 cites W2016122268 @default.
- W2023165943 cites W2018928332 @default.
- W2023165943 cites W2023162794 @default.
- W2023165943 cites W2028014545 @default.
- W2023165943 cites W2029341294 @default.
- W2023165943 cites W2029500199 @default.
- W2023165943 cites W2030453570 @default.
- W2023165943 cites W2031160476 @default.
- W2023165943 cites W2038378248 @default.
- W2023165943 cites W2043435490 @default.
- W2023165943 cites W2053049304 @default.
- W2023165943 cites W2056598903 @default.
- W2023165943 cites W2059713800 @default.
- W2023165943 cites W2059723463 @default.
- W2023165943 cites W2065356639 @default.
- W2023165943 cites W2066636486 @default.
- W2023165943 cites W2081470778 @default.
- W2023165943 cites W2087303323 @default.
- W2023165943 cites W2096565380 @default.
- W2023165943 cites W2098660810 @default.
- W2023165943 cites W2099126271 @default.
- W2023165943 cites W2106180529 @default.
- W2023165943 cites W2113184419 @default.
- W2023165943 cites W2117044215 @default.
- W2023165943 cites W2120136770 @default.
- W2023165943 cites W2128272588 @default.
- W2023165943 cites W2129620481 @default.
- W2023165943 cites W2130225127 @default.
- W2023165943 cites W2134308336 @default.
- W2023165943 cites W2134549628 @default.
- W2023165943 cites W2138621811 @default.
- W2023165943 cites W2139148100 @default.
- W2023165943 cites W2139532006 @default.
- W2023165943 cites W2140279085 @default.
- W2023165943 cites W2142747917 @default.
- W2023165943 cites W2148361904 @default.
- W2023165943 cites W2148839232 @default.
- W2023165943 cites W2151306141 @default.
- W2023165943 cites W2151932833 @default.
- W2023165943 cites W2152565070 @default.
- W2023165943 cites W2154116572 @default.
- W2023165943 cites W2155542681 @default.
- W2023165943 cites W2156632103 @default.
- W2023165943 cites W2157748587 @default.
- W2023165943 cites W2158058396 @default.
- W2023165943 cites W2158601853 @default.
- W2023165943 cites W2164542999 @default.
- W2023165943 cites W2165862792 @default.
- W2023165943 cites W2295141584 @default.
- W2023165943 cites W2353539429 @default.
- W2023165943 cites W2356600196 @default.
- W2023165943 cites W2137526809 @default.
- W2023165943 doi "https://doi.org/10.1360/jos171051" @default.
- W2023165943 hasPublicationYear "2006" @default.
- W2023165943 type Work @default.
- W2023165943 sameAs 2023165943 @default.
- W2023165943 citedByCount "6" @default.
- W2023165943 countsByYear W20231659432022 @default.
- W2023165943 crossrefType "journal-article" @default.
- W2023165943 hasAuthorship W2023165943A5084342142 @default.
- W2023165943 hasBestOaLocation W20231659432 @default.
- W2023165943 hasConcept C100368936 @default.
- W2023165943 hasConcept C105702510 @default.
- W2023165943 hasConcept C118643609 @default.