Matches in SemOpenAlex for { <https://semopenalex.org/work/W1500871507> ?p ?o ?g. }
- W1500871507 abstract "Natural languages are full of rules and exceptions. One of the most famous quantitative rules is Zipf's law, which states that the frequency of occurrence of a word is approximately inversely proportional to its rank. Though this law of ranks has been found to hold across disparate texts and forms of data, analyses of increasingly large corpora since the late 1990s have revealed the existence of two scaling regimes. These regimes have thus far been explained by a hypothesis suggesting a separability of languages into core and noncore lexica. Here we present and defend an alternative hypothesis that the two scaling regimes result from the act of aggregating texts. We observe that text mixing leads to an effective decay of word introduction, which we show provides accurate predictions of the location and severity of breaks in scaling. Upon examining large corpora from 10 languages in the Project Gutenberg eBooks collection, we find emphatic empirical support for the universality of our claim." @default.
- W1500871507 created "2016-06-24" @default.
- W1500871507 creator A5002034958 @default.
- W1500871507 creator A5027686067 @default.
- W1500871507 creator A5040821463 @default.
- W1500871507 creator A5062403203 @default.
- W1500871507 date "2015-05-20" @default.
- W1500871507 modified "2023-10-01" @default.
- W1500871507 title "Text mixing shapes the anatomy of rank-frequency distributions" @default.
- W1500871507 cites W1992021819 @default.
- W1500871507 cites W2008203686 @default.
- W1500871507 cites W2008620264 @default.
- W1500871507 cites W2036671379 @default.
- W1500871507 cites W2090618725 @default.
- W1500871507 cites W2135869163 @default.
- W1500871507 cites W2160943512 @default.
- W1500871507 cites W2963726000 @default.
- W1500871507 cites W3099460613 @default.
- W1500871507 doi "https://doi.org/10.1103/physreve.91.052811" @default.
- W1500871507 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/26066216" @default.
- W1500871507 hasPublicationYear "2015" @default.
- W1500871507 type Work @default.
- W1500871507 sameAs 1500871507 @default.
- W1500871507 citedByCount "25" @default.
- W1500871507 countsByYear W15008715072015 @default.
- W1500871507 countsByYear W15008715072016 @default.
- W1500871507 countsByYear W15008715072017 @default.
- W1500871507 countsByYear W15008715072018 @default.
- W1500871507 countsByYear W15008715072019 @default.
- W1500871507 countsByYear W15008715072020 @default.
- W1500871507 countsByYear W15008715072021 @default.
- W1500871507 countsByYear W15008715072022 @default.
- W1500871507 countsByYear W15008715072023 @default.
- W1500871507 crossrefType "journal-article" @default.
- W1500871507 hasAuthorship W1500871507A5002034958 @default.
- W1500871507 hasAuthorship W1500871507A5027686067 @default.
- W1500871507 hasAuthorship W1500871507A5040821463 @default.
- W1500871507 hasAuthorship W1500871507A5062403203 @default.
- W1500871507 hasBestOaLocation W15008715072 @default.
- W1500871507 hasConcept C105795698 @default.
- W1500871507 hasConcept C114614502 @default.
- W1500871507 hasConcept C121332964 @default.
- W1500871507 hasConcept C121864883 @default.
- W1500871507 hasConcept C125932096 @default.
- W1500871507 hasConcept C138777275 @default.
- W1500871507 hasConcept C138885662 @default.
- W1500871507 hasConcept C149782125 @default.
- W1500871507 hasConcept C164226766 @default.
- W1500871507 hasConcept C175293574 @default.
- W1500871507 hasConcept C183992945 @default.
- W1500871507 hasConcept C204321447 @default.
- W1500871507 hasConcept C2524010 @default.
- W1500871507 hasConcept C2777530160 @default.
- W1500871507 hasConcept C2988430800 @default.
- W1500871507 hasConcept C33923547 @default.
- W1500871507 hasConcept C41008148 @default.
- W1500871507 hasConcept C41895202 @default.
- W1500871507 hasConcept C51921466 @default.
- W1500871507 hasConcept C62520636 @default.
- W1500871507 hasConcept C90805587 @default.
- W1500871507 hasConcept C99844830 @default.
- W1500871507 hasConceptScore W1500871507C105795698 @default.
- W1500871507 hasConceptScore W1500871507C114614502 @default.
- W1500871507 hasConceptScore W1500871507C121332964 @default.
- W1500871507 hasConceptScore W1500871507C121864883 @default.
- W1500871507 hasConceptScore W1500871507C125932096 @default.
- W1500871507 hasConceptScore W1500871507C138777275 @default.
- W1500871507 hasConceptScore W1500871507C138885662 @default.
- W1500871507 hasConceptScore W1500871507C149782125 @default.
- W1500871507 hasConceptScore W1500871507C164226766 @default.
- W1500871507 hasConceptScore W1500871507C175293574 @default.
- W1500871507 hasConceptScore W1500871507C183992945 @default.
- W1500871507 hasConceptScore W1500871507C204321447 @default.
- W1500871507 hasConceptScore W1500871507C2524010 @default.
- W1500871507 hasConceptScore W1500871507C2777530160 @default.
- W1500871507 hasConceptScore W1500871507C2988430800 @default.
- W1500871507 hasConceptScore W1500871507C33923547 @default.
- W1500871507 hasConceptScore W1500871507C41008148 @default.
- W1500871507 hasConceptScore W1500871507C41895202 @default.
- W1500871507 hasConceptScore W1500871507C51921466 @default.
- W1500871507 hasConceptScore W1500871507C62520636 @default.
- W1500871507 hasConceptScore W1500871507C90805587 @default.
- W1500871507 hasConceptScore W1500871507C99844830 @default.
- W1500871507 hasIssue "5" @default.
- W1500871507 hasLocation W15008715071 @default.
- W1500871507 hasLocation W15008715072 @default.
- W1500871507 hasLocation W15008715073 @default.
- W1500871507 hasLocation W15008715074 @default.
- W1500871507 hasLocation W15008715075 @default.
- W1500871507 hasOpenAccess W1500871507 @default.
- W1500871507 hasPrimaryLocation W15008715071 @default.
- W1500871507 hasRelatedWork W1500871507 @default.
- W1500871507 hasRelatedWork W1990001655 @default.
- W1500871507 hasRelatedWork W2008447690 @default.
- W1500871507 hasRelatedWork W2069979223 @default.
- W1500871507 hasRelatedWork W2089769688 @default.
- W1500871507 hasRelatedWork W2090220080 @default.
- W1500871507 hasRelatedWork W2121424285 @default.
- W1500871507 hasRelatedWork W2310664541 @default.
- W1500871507 hasRelatedWork W3102653896 @default.