Matches in SemOpenAlex for { <https://semopenalex.org/work/W2029553648> ?p ?o ?g. }
- W2029553648 abstract "Inverse Document Frequency (IDF) is an important quantity in many applications, including Information Retrieval. IDF is defined in terms of document frequency, df (w), the number of documents that mention w at least once. This quantity is relatively easy to compute over textual documents, but spoken documents are more challenging. This paper considers two baselines: (1) an estimate based on the 1-best ASR output and (2) an estimate based on expected term frequencies computed from the lattice. We improve over these baselines by taking advantage of repetition. Whatever the document is about is likely to be repeated, unlike ASR errors, which tend to be more random (Poisson). In addition, we find it helpful to consider an ensemble of language models. There is an opportunity for the ensemble to reduce noise, assuming that the errors across language models are relatively uncorrelated. The opportunity for improvement is larger when WER is high. This paper considers a pairing task application that could benefit from improved estimates of df. The pairing task inputs conversational sides from the English Fisher corpus and outputs estimates of which sides were from the same conversation. Better estimates of df lead to better performance on this task." @default.
- W2029553648 created "2016-06-24" @default.
- W2029553648 creator A5006451413 @default.
- W2029553648 creator A5010122901 @default.
- W2029553648 creator A5024437840 @default.
- W2029553648 creator A5041367049 @default.
- W2029553648 creator A5054865609 @default.
- W2029553648 date "2011-12-01" @default.
- W2029553648 modified "2023-09-25" @default.
- W2029553648 title "Estimating document frequencies in a speech corpus" @default.
- W2029553648 cites W1508165687 @default.
- W2029553648 cites W1532325895 @default.
- W2029553648 cites W160722039 @default.
- W2029553648 cites W1976625337 @default.
- W2029553648 cites W1996903695 @default.
- W2029553648 cites W2046134527 @default.
- W2029553648 cites W2086904543 @default.
- W2029553648 cites W2130180273 @default.
- W2029553648 cites W2130629108 @default.
- W2029553648 cites W2138309071 @default.
- W2029553648 cites W7082836 @default.
- W2029553648 doi "https://doi.org/10.1109/asru.2011.6163966" @default.
- W2029553648 hasPublicationYear "2011" @default.
- W2029553648 type Work @default.
- W2029553648 sameAs 2029553648 @default.
- W2029553648 citedByCount "11" @default.
- W2029553648 countsByYear W20295536482012 @default.
- W2029553648 countsByYear W20295536482013 @default.
- W2029553648 countsByYear W20295536482014 @default.
- W2029553648 countsByYear W20295536482015 @default.
- W2029553648 crossrefType "proceedings-article" @default.
- W2029553648 hasAuthorship W2029553648A5006451413 @default.
- W2029553648 hasAuthorship W2029553648A5010122901 @default.
- W2029553648 hasAuthorship W2029553648A5024437840 @default.
- W2029553648 hasAuthorship W2029553648A5041367049 @default.
- W2029553648 hasAuthorship W2029553648A5054865609 @default.
- W2029553648 hasConcept C100906024 @default.
- W2029553648 hasConcept C105795698 @default.
- W2029553648 hasConcept C115961682 @default.
- W2029553648 hasConcept C121332964 @default.
- W2029553648 hasConcept C137293760 @default.
- W2029553648 hasConcept C138885662 @default.
- W2029553648 hasConcept C14103023 @default.
- W2029553648 hasConcept C154945302 @default.
- W2029553648 hasConcept C162324750 @default.
- W2029553648 hasConcept C169345407 @default.
- W2029553648 hasConcept C187736073 @default.
- W2029553648 hasConcept C204321447 @default.
- W2029553648 hasConcept C2776141515 @default.
- W2029553648 hasConcept C2777200299 @default.
- W2029553648 hasConcept C2780451532 @default.
- W2029553648 hasConcept C28490314 @default.
- W2029553648 hasConcept C33923547 @default.
- W2029553648 hasConcept C41008148 @default.
- W2029553648 hasConcept C41895202 @default.
- W2029553648 hasConcept C54101563 @default.
- W2029553648 hasConcept C61797465 @default.
- W2029553648 hasConcept C62520636 @default.
- W2029553648 hasConcept C81758059 @default.
- W2029553648 hasConcept C99498987 @default.
- W2029553648 hasConceptScore W2029553648C100906024 @default.
- W2029553648 hasConceptScore W2029553648C105795698 @default.
- W2029553648 hasConceptScore W2029553648C115961682 @default.
- W2029553648 hasConceptScore W2029553648C121332964 @default.
- W2029553648 hasConceptScore W2029553648C137293760 @default.
- W2029553648 hasConceptScore W2029553648C138885662 @default.
- W2029553648 hasConceptScore W2029553648C14103023 @default.
- W2029553648 hasConceptScore W2029553648C154945302 @default.
- W2029553648 hasConceptScore W2029553648C162324750 @default.
- W2029553648 hasConceptScore W2029553648C169345407 @default.
- W2029553648 hasConceptScore W2029553648C187736073 @default.
- W2029553648 hasConceptScore W2029553648C204321447 @default.
- W2029553648 hasConceptScore W2029553648C2776141515 @default.
- W2029553648 hasConceptScore W2029553648C2777200299 @default.
- W2029553648 hasConceptScore W2029553648C2780451532 @default.
- W2029553648 hasConceptScore W2029553648C28490314 @default.
- W2029553648 hasConceptScore W2029553648C33923547 @default.
- W2029553648 hasConceptScore W2029553648C41008148 @default.
- W2029553648 hasConceptScore W2029553648C41895202 @default.
- W2029553648 hasConceptScore W2029553648C54101563 @default.
- W2029553648 hasConceptScore W2029553648C61797465 @default.
- W2029553648 hasConceptScore W2029553648C62520636 @default.
- W2029553648 hasConceptScore W2029553648C81758059 @default.
- W2029553648 hasConceptScore W2029553648C99498987 @default.
- W2029553648 hasLocation W20295536481 @default.
- W2029553648 hasOpenAccess W2029553648 @default.
- W2029553648 hasPrimaryLocation W20295536481 @default.
- W2029553648 hasRelatedWork W142374489 @default.
- W2029553648 hasRelatedWork W1503858070 @default.
- W2029553648 hasRelatedWork W1542956019 @default.
- W2029553648 hasRelatedWork W1563618553 @default.
- W2029553648 hasRelatedWork W1803932089 @default.
- W2029553648 hasRelatedWork W2029553648 @default.
- W2029553648 hasRelatedWork W2081647779 @default.
- W2029553648 hasRelatedWork W2359001871 @default.
- W2029553648 hasRelatedWork W3107474891 @default.
- W2029553648 hasRelatedWork W4205820553 @default.
- W2029553648 isParatext "false" @default.
- W2029553648 isRetracted "false" @default.
- W2029553648 magId "2029553648" @default.