Matches in SemOpenAlex for { <https://semopenalex.org/work/W2158195707> ?p ?o ?g. }
Showing items 1 to 90 of
90
with 100 items per page.
- W2158195707 endingPage "393" @default.
- W2158195707 startingPage "359" @default.
- W2158195707 abstract "We survey the most widely-used algorithms for smoothing models for language n -gram modeling. We then present an extensive empirical comparison of several of these smoothing techniques, including those described by Jelinek and Mercer (1980); Katz (1987); Bell, Cleary and Witten (1990); Ney, Essen and Kneser (1994), and Kneser and Ney (1995). We investigate how factors such as training data size, training corpus (e.g. Brown vs. Wall Street Journal), count cutoffs, and n -gram order (bigram vs. trigram) affect the relative performance of these methods, which is measured through the cross-entropy of test data. We find that these factors can significantly affect the relative performance of models, with the most significant factor being training data size. Since no previous comparisons have examined these factors systematically, this is the first thorough characterization of the relative performance of various algorithms. In addition, we introduce methodologies for analyzing smoothing algorithm efficacy in detail, and using these techniques we motivate a novel variation of Kneser–Ney smoothing that consistently outperforms all other algorithms evaluated. Finally, results showing that improved language model smoothing leads to improved speech recognition performance are presented." @default.
- W2158195707 created "2016-06-24" @default.
- W2158195707 creator A5064207231 @default.
- W2158195707 creator A5090361189 @default.
- W2158195707 date "1999-10-01" @default.
- W2158195707 modified "2023-10-17" @default.
- W2158195707 title "An empirical study of smoothing techniques for language modeling" @default.
- W2158195707 cites W1536631629 @default.
- W2158195707 cites W1966812932 @default.
- W2158195707 cites W2059800182 @default.
- W2158195707 cites W2075201173 @default.
- W2158195707 cites W2082092506 @default.
- W2158195707 cites W2099345940 @default.
- W2158195707 cites W2113641473 @default.
- W2158195707 cites W2132957691 @default.
- W2158195707 cites W2134237567 @default.
- W2158195707 cites W2159782014 @default.
- W2158195707 cites W2168938909 @default.
- W2158195707 doi "https://doi.org/10.1006/csla.1999.0128" @default.
- W2158195707 hasPublicationYear "1999" @default.
- W2158195707 type Work @default.
- W2158195707 sameAs 2158195707 @default.
- W2158195707 citedByCount "1629" @default.
- W2158195707 countsByYear W21581957072012 @default.
- W2158195707 countsByYear W21581957072013 @default.
- W2158195707 countsByYear W21581957072014 @default.
- W2158195707 countsByYear W21581957072015 @default.
- W2158195707 countsByYear W21581957072016 @default.
- W2158195707 countsByYear W21581957072017 @default.
- W2158195707 countsByYear W21581957072018 @default.
- W2158195707 countsByYear W21581957072019 @default.
- W2158195707 countsByYear W21581957072020 @default.
- W2158195707 countsByYear W21581957072021 @default.
- W2158195707 countsByYear W21581957072022 @default.
- W2158195707 countsByYear W21581957072023 @default.
- W2158195707 crossrefType "journal-article" @default.
- W2158195707 hasAuthorship W2158195707A5064207231 @default.
- W2158195707 hasAuthorship W2158195707A5090361189 @default.
- W2158195707 hasBestOaLocation W21581957072 @default.
- W2158195707 hasConcept C106301342 @default.
- W2158195707 hasConcept C108757681 @default.
- W2158195707 hasConcept C11413529 @default.
- W2158195707 hasConcept C119857082 @default.
- W2158195707 hasConcept C121332964 @default.
- W2158195707 hasConcept C137293760 @default.
- W2158195707 hasConcept C137546455 @default.
- W2158195707 hasConcept C154945302 @default.
- W2158195707 hasConcept C204321447 @default.
- W2158195707 hasConcept C28490314 @default.
- W2158195707 hasConcept C31972630 @default.
- W2158195707 hasConcept C3770464 @default.
- W2158195707 hasConcept C41008148 @default.
- W2158195707 hasConcept C62520636 @default.
- W2158195707 hasConceptScore W2158195707C106301342 @default.
- W2158195707 hasConceptScore W2158195707C108757681 @default.
- W2158195707 hasConceptScore W2158195707C11413529 @default.
- W2158195707 hasConceptScore W2158195707C119857082 @default.
- W2158195707 hasConceptScore W2158195707C121332964 @default.
- W2158195707 hasConceptScore W2158195707C137293760 @default.
- W2158195707 hasConceptScore W2158195707C137546455 @default.
- W2158195707 hasConceptScore W2158195707C154945302 @default.
- W2158195707 hasConceptScore W2158195707C204321447 @default.
- W2158195707 hasConceptScore W2158195707C28490314 @default.
- W2158195707 hasConceptScore W2158195707C31972630 @default.
- W2158195707 hasConceptScore W2158195707C3770464 @default.
- W2158195707 hasConceptScore W2158195707C41008148 @default.
- W2158195707 hasConceptScore W2158195707C62520636 @default.
- W2158195707 hasIssue "4" @default.
- W2158195707 hasLocation W21581957071 @default.
- W2158195707 hasLocation W21581957072 @default.
- W2158195707 hasLocation W21581957073 @default.
- W2158195707 hasOpenAccess W2158195707 @default.
- W2158195707 hasPrimaryLocation W21581957071 @default.
- W2158195707 hasRelatedWork W1516007350 @default.
- W2158195707 hasRelatedWork W1602608327 @default.
- W2158195707 hasRelatedWork W1666710534 @default.
- W2158195707 hasRelatedWork W2002296182 @default.
- W2158195707 hasRelatedWork W2083462476 @default.
- W2158195707 hasRelatedWork W2115497217 @default.
- W2158195707 hasRelatedWork W2466762647 @default.
- W2158195707 hasRelatedWork W2756857732 @default.
- W2158195707 hasRelatedWork W2950186769 @default.
- W2158195707 hasRelatedWork W3107474891 @default.
- W2158195707 hasVolume "13" @default.
- W2158195707 isParatext "false" @default.
- W2158195707 isRetracted "false" @default.
- W2158195707 magId "2158195707" @default.
- W2158195707 workType "article" @default.