Matches in SemOpenAlex for { <https://semopenalex.org/work/W3012679427> ?p ?o ?g. }
Showing items 1 to 88 of
88
with 100 items per page.
- W3012679427 abstract "Data generated by web crawlers has formed the basis for much of our current understanding of the Internet. However, not all crawlers are created equal and crawlers generally find themselves trading off between computational overhead, developer effort, data accuracy, and completeness. Therefore, the choice of crawler has a critical impact on the data generated and knowledge inferred from it. In this paper, we conduct a systematic study of the trade-offs presented by different crawlers and the impact that these can have on various types of measurement studies. We make the following contributions: First, we conduct a survey of all research published since 2015 in the premier security and Internet measurement venues to identify and verify the repeatability of crawling methodologies deployed for different problem domains and publication venues. Next, we conduct a qualitative evaluation of a subset of all crawling tools identified in our survey. This evaluation allows us to draw conclusions about the suitability of each tool for specific types of data gathering. Finally, we present a methodology and a measurement framework to empirically highlight the differences between crawlers and how the choice of crawler can impact our understanding of the web." @default.
- W3012679427 created "2020-03-27" @default.
- W3012679427 creator A5016569835 @default.
- W3012679427 creator A5045896307 @default.
- W3012679427 creator A5046944830 @default.
- W3012679427 creator A5051828613 @default.
- W3012679427 creator A5069036689 @default.
- W3012679427 date "2020-04-20" @default.
- W3012679427 modified "2023-10-17" @default.
- W3012679427 title "Apophanies or Epiphanies? How Crawlers Impact Our Understanding of the Web" @default.
- W3012679427 cites W1965773314 @default.
- W3012679427 cites W2012210084 @default.
- W3012679427 cites W2015452169 @default.
- W3012679427 cites W2016697314 @default.
- W3012679427 cites W2046015955 @default.
- W3012679427 cites W2072841881 @default.
- W3012679427 cites W2108217512 @default.
- W3012679427 cites W2121973001 @default.
- W3012679427 cites W2135088779 @default.
- W3012679427 cites W2141554582 @default.
- W3012679427 cites W2159569559 @default.
- W3012679427 cites W2283438976 @default.
- W3012679427 cites W2535603283 @default.
- W3012679427 cites W2554791272 @default.
- W3012679427 cites W2561535538 @default.
- W3012679427 cites W2743928556 @default.
- W3012679427 cites W2782867239 @default.
- W3012679427 cites W2793294490 @default.
- W3012679427 cites W2801956435 @default.
- W3012679427 cites W2806944993 @default.
- W3012679427 cites W2891928976 @default.
- W3012679427 cites W2902341081 @default.
- W3012679427 cites W2947585023 @default.
- W3012679427 cites W4300175397 @default.
- W3012679427 doi "https://doi.org/10.1145/3366423.3380113" @default.
- W3012679427 hasPublicationYear "2020" @default.
- W3012679427 type Work @default.
- W3012679427 sameAs 3012679427 @default.
- W3012679427 citedByCount "6" @default.
- W3012679427 countsByYear W30126794272020 @default.
- W3012679427 countsByYear W30126794272021 @default.
- W3012679427 countsByYear W30126794272022 @default.
- W3012679427 countsByYear W30126794272023 @default.
- W3012679427 crossrefType "proceedings-article" @default.
- W3012679427 hasAuthorship W3012679427A5016569835 @default.
- W3012679427 hasAuthorship W3012679427A5045896307 @default.
- W3012679427 hasAuthorship W3012679427A5046944830 @default.
- W3012679427 hasAuthorship W3012679427A5051828613 @default.
- W3012679427 hasAuthorship W3012679427A5069036689 @default.
- W3012679427 hasConcept C100368936 @default.
- W3012679427 hasConcept C105702510 @default.
- W3012679427 hasConcept C110875604 @default.
- W3012679427 hasConcept C11392498 @default.
- W3012679427 hasConcept C136764020 @default.
- W3012679427 hasConcept C13743948 @default.
- W3012679427 hasConcept C173576120 @default.
- W3012679427 hasConcept C2522767166 @default.
- W3012679427 hasConcept C41008148 @default.
- W3012679427 hasConcept C71924100 @default.
- W3012679427 hasConcept C73340581 @default.
- W3012679427 hasConceptScore W3012679427C100368936 @default.
- W3012679427 hasConceptScore W3012679427C105702510 @default.
- W3012679427 hasConceptScore W3012679427C110875604 @default.
- W3012679427 hasConceptScore W3012679427C11392498 @default.
- W3012679427 hasConceptScore W3012679427C136764020 @default.
- W3012679427 hasConceptScore W3012679427C13743948 @default.
- W3012679427 hasConceptScore W3012679427C173576120 @default.
- W3012679427 hasConceptScore W3012679427C2522767166 @default.
- W3012679427 hasConceptScore W3012679427C41008148 @default.
- W3012679427 hasConceptScore W3012679427C71924100 @default.
- W3012679427 hasConceptScore W3012679427C73340581 @default.
- W3012679427 hasLocation W30126794271 @default.
- W3012679427 hasOpenAccess W3012679427 @default.
- W3012679427 hasPrimaryLocation W30126794271 @default.
- W3012679427 hasRelatedWork W1506122440 @default.
- W3012679427 hasRelatedWork W2019080882 @default.
- W3012679427 hasRelatedWork W2026132847 @default.
- W3012679427 hasRelatedWork W2042034567 @default.
- W3012679427 hasRelatedWork W2137810919 @default.
- W3012679427 hasRelatedWork W2274831913 @default.
- W3012679427 hasRelatedWork W2352686120 @default.
- W3012679427 hasRelatedWork W2358310581 @default.
- W3012679427 hasRelatedWork W2375180657 @default.
- W3012679427 hasRelatedWork W4385695127 @default.
- W3012679427 isParatext "false" @default.
- W3012679427 isRetracted "false" @default.
- W3012679427 magId "3012679427" @default.
- W3012679427 workType "article" @default.