Matches in SemOpenAlex for { <https://semopenalex.org/work/W1681759250> ?p ?o ?g. }
Showing items 1 to 59 of
59
with 100 items per page.
- W1681759250 endingPage "490" @default.
- W1681759250 startingPage "479" @default.
- W1681759250 abstract "We develop an abstract model of information acquisition from redundant data. We assume a random sampling process from data which contain information with bias and are interested in the fraction of information we expect to learn as function of (i) the sampled fraction (recall) and (ii) varying bias of information (redundancy distributions). We develop two rules of thumb with varying robustness. We first show that, when information bias follows a Zipf distribution, the 80-20 rule or Pareto principle does surprisingly not hold, and we rather expect to learn less than 40% of the information when randomly sampling 20% of the overall data. We then analytically prove that for large data sets, randomized sampling from power-law leads to truncated distributions with the same power-law exponent. This second rule is very robust and also holds for that deviate substantially from a strict power law. We further give one particular family of powerlaw functions that remain completely invariant under sampling. Finally, we validate our model with two large Web data sets: link to web domains and tag on delicious.com." @default.
- W1681759250 created "2016-06-24" @default.
- W1681759250 creator A5086781628 @default.
- W1681759250 date "2011-01-01" @default.
- W1681759250 modified "2023-09-25" @default.
- W1681759250 title "Rules of Thumb for Information Acquisition from Large and Redundant Data" @default.
- W1681759250 cites W157725869 @default.
- W1681759250 cites W1681759250 @default.
- W1681759250 cites W186403072 @default.
- W1681759250 cites W1993803315 @default.
- W1681759250 cites W2020340745 @default.
- W1681759250 cites W2031149459 @default.
- W1681759250 cites W2037022968 @default.
- W1681759250 cites W2048596679 @default.
- W1681759250 cites W2076914324 @default.
- W1681759250 cites W2103224511 @default.
- W1681759250 cites W2120511087 @default.
- W1681759250 cites W2133953554 @default.
- W1681759250 cites W2143439708 @default.
- W1681759250 cites W2950627632 @default.
- W1681759250 cites W3103362336 @default.
- W1681759250 doi "https://doi.org/10.1007/978-3-642-20161-5_47" @default.
- W1681759250 hasPublicationYear "2011" @default.
- W1681759250 type Work @default.
- W1681759250 sameAs 1681759250 @default.
- W1681759250 citedByCount "3" @default.
- W1681759250 countsByYear W16817592502013 @default.
- W1681759250 countsByYear W16817592502014 @default.
- W1681759250 crossrefType "book-chapter" @default.
- W1681759250 hasAuthorship W1681759250A5086781628 @default.
- W1681759250 hasBestOaLocation W16817592502 @default.
- W1681759250 hasConcept C11413529 @default.
- W1681759250 hasConcept C23123220 @default.
- W1681759250 hasConcept C41008148 @default.
- W1681759250 hasConcept C89246107 @default.
- W1681759250 hasConceptScore W1681759250C11413529 @default.
- W1681759250 hasConceptScore W1681759250C23123220 @default.
- W1681759250 hasConceptScore W1681759250C41008148 @default.
- W1681759250 hasConceptScore W1681759250C89246107 @default.
- W1681759250 hasLocation W16817592501 @default.
- W1681759250 hasLocation W16817592502 @default.
- W1681759250 hasOpenAccess W1681759250 @default.
- W1681759250 hasPrimaryLocation W16817592501 @default.
- W1681759250 hasRelatedWork W1561729373 @default.
- W1681759250 hasRelatedWork W2103338134 @default.
- W1681759250 hasRelatedWork W2115485936 @default.
- W1681759250 hasRelatedWork W2144190808 @default.
- W1681759250 hasRelatedWork W2153015554 @default.
- W1681759250 hasRelatedWork W2357241418 @default.
- W1681759250 hasRelatedWork W2366644548 @default.
- W1681759250 hasRelatedWork W2376314740 @default.
- W1681759250 hasRelatedWork W2384888906 @default.
- W1681759250 hasRelatedWork W3022131925 @default.
- W1681759250 isParatext "false" @default.
- W1681759250 isRetracted "false" @default.
- W1681759250 magId "1681759250" @default.
- W1681759250 workType "book-chapter" @default.