Matches in SemOpenAlex for { <https://semopenalex.org/work/W2092329487> ?p ?o ?g. }
- W2092329487 endingPage "430" @default.
- W2092329487 startingPage "422" @default.
- W2092329487 abstract "In a critical review of the heuristics used to deal with zero word frequencies, we show that four are suboptimal, one is good, and one may be acceptable. The four suboptimal strategies are discarding words with zero frequencies, giving words with zero frequencies a very low frequency, adding 1 to the frequency per million, and making use of the Good–Turing algorithm. The good algorithm is the Laplace transformation, which consists of adding 1 to each frequency count and increasing the total corpus size by the number of word types observed. A strategy that may be acceptable is to guess the frequency of absent words on the basis of other corpora and then increasing the total corpus size by the estimated summed frequency of the missing words. A comparison with the lexical decision times of the English Lexicon Project and the British Lexicon Project suggests that the Laplace transformation gives the most useful estimates (in addition to being easy to calculate). Therefore, we recommend it to researchers." @default.
- W2092329487 created "2016-06-24" @default.
- W2092329487 creator A5011816607 @default.
- W2092329487 creator A5083209779 @default.
- W2092329487 date "2012-10-06" @default.
- W2092329487 modified "2023-10-17" @default.
- W2092329487 title "Dealing with zero word frequencies: A review of the existing rules of thumb and a suggestion for an evidence-based choice" @default.
- W2092329487 cites W1997161938 @default.
- W2092329487 cites W2019096529 @default.
- W2092329487 cites W2063918473 @default.
- W2092329487 cites W2113676135 @default.
- W2092329487 cites W2115054880 @default.
- W2092329487 cites W2136525955 @default.
- W2092329487 cites W2168979204 @default.
- W2092329487 cites W2169716859 @default.
- W2092329487 doi "https://doi.org/10.3758/s13428-012-0270-5" @default.
- W2092329487 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/23055175" @default.
- W2092329487 hasPublicationYear "2012" @default.
- W2092329487 type Work @default.
- W2092329487 sameAs 2092329487 @default.
- W2092329487 citedByCount "28" @default.
- W2092329487 countsByYear W20923294872013 @default.
- W2092329487 countsByYear W20923294872014 @default.
- W2092329487 countsByYear W20923294872015 @default.
- W2092329487 countsByYear W20923294872016 @default.
- W2092329487 countsByYear W20923294872017 @default.
- W2092329487 countsByYear W20923294872018 @default.
- W2092329487 countsByYear W20923294872019 @default.
- W2092329487 countsByYear W20923294872020 @default.
- W2092329487 countsByYear W20923294872021 @default.
- W2092329487 countsByYear W20923294872022 @default.
- W2092329487 countsByYear W20923294872023 @default.
- W2092329487 crossrefType "journal-article" @default.
- W2092329487 hasAuthorship W2092329487A5011816607 @default.
- W2092329487 hasAuthorship W2092329487A5083209779 @default.
- W2092329487 hasBestOaLocation W20923294871 @default.
- W2092329487 hasConcept C104317684 @default.
- W2092329487 hasConcept C111919701 @default.
- W2092329487 hasConcept C11413529 @default.
- W2092329487 hasConcept C12426560 @default.
- W2092329487 hasConcept C127705205 @default.
- W2092329487 hasConcept C138885662 @default.
- W2092329487 hasConcept C154945302 @default.
- W2092329487 hasConcept C175293574 @default.
- W2092329487 hasConcept C185592680 @default.
- W2092329487 hasConcept C204241405 @default.
- W2092329487 hasConcept C204321447 @default.
- W2092329487 hasConcept C2524010 @default.
- W2092329487 hasConcept C2777530160 @default.
- W2092329487 hasConcept C2778121359 @default.
- W2092329487 hasConcept C2780813799 @default.
- W2092329487 hasConcept C28490314 @default.
- W2092329487 hasConcept C33923547 @default.
- W2092329487 hasConcept C41008148 @default.
- W2092329487 hasConcept C41895202 @default.
- W2092329487 hasConcept C55493867 @default.
- W2092329487 hasConcept C89246107 @default.
- W2092329487 hasConcept C90805587 @default.
- W2092329487 hasConceptScore W2092329487C104317684 @default.
- W2092329487 hasConceptScore W2092329487C111919701 @default.
- W2092329487 hasConceptScore W2092329487C11413529 @default.
- W2092329487 hasConceptScore W2092329487C12426560 @default.
- W2092329487 hasConceptScore W2092329487C127705205 @default.
- W2092329487 hasConceptScore W2092329487C138885662 @default.
- W2092329487 hasConceptScore W2092329487C154945302 @default.
- W2092329487 hasConceptScore W2092329487C175293574 @default.
- W2092329487 hasConceptScore W2092329487C185592680 @default.
- W2092329487 hasConceptScore W2092329487C204241405 @default.
- W2092329487 hasConceptScore W2092329487C204321447 @default.
- W2092329487 hasConceptScore W2092329487C2524010 @default.
- W2092329487 hasConceptScore W2092329487C2777530160 @default.
- W2092329487 hasConceptScore W2092329487C2778121359 @default.
- W2092329487 hasConceptScore W2092329487C2780813799 @default.
- W2092329487 hasConceptScore W2092329487C28490314 @default.
- W2092329487 hasConceptScore W2092329487C33923547 @default.
- W2092329487 hasConceptScore W2092329487C41008148 @default.
- W2092329487 hasConceptScore W2092329487C41895202 @default.
- W2092329487 hasConceptScore W2092329487C55493867 @default.
- W2092329487 hasConceptScore W2092329487C89246107 @default.
- W2092329487 hasConceptScore W2092329487C90805587 @default.
- W2092329487 hasIssue "2" @default.
- W2092329487 hasLocation W20923294871 @default.
- W2092329487 hasLocation W20923294872 @default.
- W2092329487 hasOpenAccess W2092329487 @default.
- W2092329487 hasPrimaryLocation W20923294871 @default.
- W2092329487 hasRelatedWork W1583489134 @default.
- W2092329487 hasRelatedWork W2068257033 @default.
- W2092329487 hasRelatedWork W2092329487 @default.
- W2092329487 hasRelatedWork W2122691642 @default.
- W2092329487 hasRelatedWork W2131738124 @default.
- W2092329487 hasRelatedWork W2296152660 @default.
- W2092329487 hasRelatedWork W2394602299 @default.
- W2092329487 hasRelatedWork W2740768945 @default.
- W2092329487 hasRelatedWork W2788211988 @default.
- W2092329487 hasRelatedWork W4372262688 @default.
- W2092329487 hasVolume "45" @default.
- W2092329487 isParatext "false" @default.
- W2092329487 isRetracted "false" @default.