Matches in SemOpenAlex for { <https://semopenalex.org/work/W2792222265> ?p ?o ?g. }
- W2792222265 endingPage "92" @default.
- W2792222265 startingPage "67" @default.
- W2792222265 abstract "Despite recent effort to estimate topology characteristics of large graphs (e.g., online social networks and peer-to-peer networks), little attention has been given to develop a formal crawling methodology to characterize the vast amount of content distributed over these networks. Due to the large-scale nature of these networks and a limited query rate imposed by network service providers, exhaustively crawling and enumerating content maintained by each vertex is computationally prohibitive. In this paper, we show how one can obtain content properties by crawling only a small fraction of vertices and collecting their content. We first show that when sampling is naively applied, this can produce a huge bias in content statistics (i.e., average number of content replicas). To remove this bias, one may use maximum likelihood estimation to estimate content characteristics. However, our experimental results show that this straightforward method requires to sample most vertices to obtain accurate estimates. To address this challenge, we propose two efficient estimators: special copy estimator (SCE) and weighted copy estimator (WCE) to estimate content characteristics using available information in sampled content. SCE uses the special content copy indicator to compute the estimate, while WCE derives the estimate based on meta-information in sampled vertices. We conduct experiments on a variety of real-word and synthetic datasets, and the results show that WCE and SCE are cost effective and also “asymptotically unbiased”. Our methodology provides a new tool for researchers to efficiently query content distributed in large-scale networks." @default.
- W2792222265 created "2018-03-29" @default.
- W2792222265 creator A5007402211 @default.
- W2792222265 creator A5022240408 @default.
- W2792222265 creator A5036683370 @default.
- W2792222265 creator A5068489266 @default.
- W2792222265 creator A5075845093 @default.
- W2792222265 date "2018-03-15" @default.
- W2792222265 modified "2023-10-16" @default.
- W2792222265 title "Fast crawling methods of exploring content distributed over large graphs" @default.
- W2792222265 cites W1495750374 @default.
- W2792222265 cites W1524314678 @default.
- W2792222265 cites W1551652843 @default.
- W2792222265 cites W1993518004 @default.
- W2792222265 cites W2000280231 @default.
- W2792222265 cites W2004671457 @default.
- W2792222265 cites W2007168590 @default.
- W2792222265 cites W2022362911 @default.
- W2792222265 cites W2026318959 @default.
- W2792222265 cites W2031082424 @default.
- W2792222265 cites W2037774459 @default.
- W2792222265 cites W2074584493 @default.
- W2792222265 cites W2081082600 @default.
- W2792222265 cites W2089418726 @default.
- W2792222265 cites W2094308804 @default.
- W2792222265 cites W2094631006 @default.
- W2792222265 cites W2102260665 @default.
- W2792222265 cites W2103799649 @default.
- W2792222265 cites W2115022330 @default.
- W2792222265 cites W2117740169 @default.
- W2792222265 cites W2120511087 @default.
- W2792222265 cites W2124450885 @default.
- W2792222265 cites W2124533460 @default.
- W2792222265 cites W2127455097 @default.
- W2792222265 cites W2127503167 @default.
- W2792222265 cites W2134711723 @default.
- W2792222265 cites W2137135938 @default.
- W2792222265 cites W2138309709 @default.
- W2792222265 cites W2140591663 @default.
- W2792222265 cites W2140600512 @default.
- W2792222265 cites W2142645441 @default.
- W2792222265 cites W2152710063 @default.
- W2792222265 cites W2154191591 @default.
- W2792222265 cites W2158432527 @default.
- W2792222265 cites W2168380307 @default.
- W2792222265 cites W2242576003 @default.
- W2792222265 cites W2278390984 @default.
- W2792222265 cites W2293974546 @default.
- W2792222265 cites W2443728350 @default.
- W2792222265 cites W4233471163 @default.
- W2792222265 doi "https://doi.org/10.1007/s10115-018-1178-x" @default.
- W2792222265 hasPublicationYear "2018" @default.
- W2792222265 type Work @default.
- W2792222265 sameAs 2792222265 @default.
- W2792222265 citedByCount "2" @default.
- W2792222265 countsByYear W27922222652019 @default.
- W2792222265 countsByYear W27922222652022 @default.
- W2792222265 crossrefType "journal-article" @default.
- W2792222265 hasAuthorship W2792222265A5007402211 @default.
- W2792222265 hasAuthorship W2792222265A5022240408 @default.
- W2792222265 hasAuthorship W2792222265A5036683370 @default.
- W2792222265 hasAuthorship W2792222265A5068489266 @default.
- W2792222265 hasAuthorship W2792222265A5075845093 @default.
- W2792222265 hasBestOaLocation W27922222652 @default.
- W2792222265 hasConcept C100368936 @default.
- W2792222265 hasConcept C105702510 @default.
- W2792222265 hasConcept C105795698 @default.
- W2792222265 hasConcept C106131492 @default.
- W2792222265 hasConcept C11413529 @default.
- W2792222265 hasConcept C124101348 @default.
- W2792222265 hasConcept C132525143 @default.
- W2792222265 hasConcept C134306372 @default.
- W2792222265 hasConcept C136197465 @default.
- W2792222265 hasConcept C140779682 @default.
- W2792222265 hasConcept C149629883 @default.
- W2792222265 hasConcept C154945302 @default.
- W2792222265 hasConcept C178790620 @default.
- W2792222265 hasConcept C185429906 @default.
- W2792222265 hasConcept C185592680 @default.
- W2792222265 hasConcept C2778152352 @default.
- W2792222265 hasConcept C31972630 @default.
- W2792222265 hasConcept C33923547 @default.
- W2792222265 hasConcept C41008148 @default.
- W2792222265 hasConcept C71924100 @default.
- W2792222265 hasConcept C80444323 @default.
- W2792222265 hasConcept C80899671 @default.
- W2792222265 hasConceptScore W2792222265C100368936 @default.
- W2792222265 hasConceptScore W2792222265C105702510 @default.
- W2792222265 hasConceptScore W2792222265C105795698 @default.
- W2792222265 hasConceptScore W2792222265C106131492 @default.
- W2792222265 hasConceptScore W2792222265C11413529 @default.
- W2792222265 hasConceptScore W2792222265C124101348 @default.
- W2792222265 hasConceptScore W2792222265C132525143 @default.
- W2792222265 hasConceptScore W2792222265C134306372 @default.
- W2792222265 hasConceptScore W2792222265C136197465 @default.
- W2792222265 hasConceptScore W2792222265C140779682 @default.
- W2792222265 hasConceptScore W2792222265C149629883 @default.
- W2792222265 hasConceptScore W2792222265C154945302 @default.