Matches in SemOpenAlex for { <https://semopenalex.org/work/W2000665223> ?p ?o ?g. }
Showing items 1 to 95 of
95
with 100 items per page.
- W2000665223 abstract "Statistical part-of-speech (POS) taggers achieve high accuracy and robustness when based on large scale manually tagged corpora. However, enhancements of the learning models are necessary to achieve better performance. We are developing a learning tool for a Japanese morphological analyzer called ChaSen. Currently we use a fine-grained POS tag set with about 500 tags. To apply a normal tri gram model on the tag set, we need unrealistic size of corpora. Even, for a bi-gram model, we cannot prepare a moderate size of an annotated corpus, when we take all the tags as distinct. A usual technique to cope with such fine-grained tags is to reduce the size of the tag set by grouping the set of tags into equivalence classes. We introduce the concept of position-wise grouping where the tag set is partitioned into different equivalence classes at each position in the conditional probabilities in the Markov Model. Moreover, to cope with the data sparseness problem caused by exceptional phenomena, we introduce several other techniques such as word-level statistics, smoothing of word-level and POS-level statistics and a selective tri-gram model. To help users determine probabilistic parameters, we introduce an error-driven method for the parameter selection. We then give results of experiments to see the effect of the tools applied to an existing Japanese morphological analyzer." @default.
- W2000665223 created "2016-06-24" @default.
- W2000665223 creator A5002196974 @default.
- W2000665223 creator A5072032804 @default.
- W2000665223 date "2000-01-01" @default.
- W2000665223 modified "2023-10-16" @default.
- W2000665223 title "Extended models and tools for high-performance part-of-speech tagger" @default.
- W2000665223 cites W1508165687 @default.
- W2000665223 cites W1522263329 @default.
- W2000665223 cites W1966320875 @default.
- W2000665223 cites W2046224275 @default.
- W2000665223 cites W2066539191 @default.
- W2000665223 cites W2117400858 @default.
- W2000665223 doi "https://doi.org/10.3115/990820.990824" @default.
- W2000665223 hasPublicationYear "2000" @default.
- W2000665223 type Work @default.
- W2000665223 sameAs 2000665223 @default.
- W2000665223 citedByCount "95" @default.
- W2000665223 countsByYear W20006652232012 @default.
- W2000665223 countsByYear W20006652232013 @default.
- W2000665223 countsByYear W20006652232014 @default.
- W2000665223 countsByYear W20006652232015 @default.
- W2000665223 countsByYear W20006652232016 @default.
- W2000665223 countsByYear W20006652232017 @default.
- W2000665223 countsByYear W20006652232018 @default.
- W2000665223 countsByYear W20006652232019 @default.
- W2000665223 countsByYear W20006652232023 @default.
- W2000665223 crossrefType "proceedings-article" @default.
- W2000665223 hasAuthorship W2000665223A5002196974 @default.
- W2000665223 hasAuthorship W2000665223A5072032804 @default.
- W2000665223 hasBestOaLocation W20006652231 @default.
- W2000665223 hasConcept C104317684 @default.
- W2000665223 hasConcept C114289077 @default.
- W2000665223 hasConcept C117884012 @default.
- W2000665223 hasConcept C119857082 @default.
- W2000665223 hasConcept C123406163 @default.
- W2000665223 hasConcept C137293760 @default.
- W2000665223 hasConcept C138885662 @default.
- W2000665223 hasConcept C154945302 @default.
- W2000665223 hasConcept C177264268 @default.
- W2000665223 hasConcept C185592680 @default.
- W2000665223 hasConcept C199360897 @default.
- W2000665223 hasConcept C204321447 @default.
- W2000665223 hasConcept C23224414 @default.
- W2000665223 hasConcept C2780069185 @default.
- W2000665223 hasConcept C28490314 @default.
- W2000665223 hasConcept C31972630 @default.
- W2000665223 hasConcept C3770464 @default.
- W2000665223 hasConcept C41008148 @default.
- W2000665223 hasConcept C41895202 @default.
- W2000665223 hasConcept C49937458 @default.
- W2000665223 hasConcept C55493867 @default.
- W2000665223 hasConcept C63479239 @default.
- W2000665223 hasConcept C90805587 @default.
- W2000665223 hasConceptScore W2000665223C104317684 @default.
- W2000665223 hasConceptScore W2000665223C114289077 @default.
- W2000665223 hasConceptScore W2000665223C117884012 @default.
- W2000665223 hasConceptScore W2000665223C119857082 @default.
- W2000665223 hasConceptScore W2000665223C123406163 @default.
- W2000665223 hasConceptScore W2000665223C137293760 @default.
- W2000665223 hasConceptScore W2000665223C138885662 @default.
- W2000665223 hasConceptScore W2000665223C154945302 @default.
- W2000665223 hasConceptScore W2000665223C177264268 @default.
- W2000665223 hasConceptScore W2000665223C185592680 @default.
- W2000665223 hasConceptScore W2000665223C199360897 @default.
- W2000665223 hasConceptScore W2000665223C204321447 @default.
- W2000665223 hasConceptScore W2000665223C23224414 @default.
- W2000665223 hasConceptScore W2000665223C2780069185 @default.
- W2000665223 hasConceptScore W2000665223C28490314 @default.
- W2000665223 hasConceptScore W2000665223C31972630 @default.
- W2000665223 hasConceptScore W2000665223C3770464 @default.
- W2000665223 hasConceptScore W2000665223C41008148 @default.
- W2000665223 hasConceptScore W2000665223C41895202 @default.
- W2000665223 hasConceptScore W2000665223C49937458 @default.
- W2000665223 hasConceptScore W2000665223C55493867 @default.
- W2000665223 hasConceptScore W2000665223C63479239 @default.
- W2000665223 hasConceptScore W2000665223C90805587 @default.
- W2000665223 hasLocation W20006652231 @default.
- W2000665223 hasLocation W20006652232 @default.
- W2000665223 hasOpenAccess W2000665223 @default.
- W2000665223 hasPrimaryLocation W20006652231 @default.
- W2000665223 hasRelatedWork W1736441281 @default.
- W2000665223 hasRelatedWork W2008468404 @default.
- W2000665223 hasRelatedWork W2118172714 @default.
- W2000665223 hasRelatedWork W2120010607 @default.
- W2000665223 hasRelatedWork W2121038688 @default.
- W2000665223 hasRelatedWork W2132957691 @default.
- W2000665223 hasRelatedWork W2374918184 @default.
- W2000665223 hasRelatedWork W3196833733 @default.
- W2000665223 hasRelatedWork W3210039896 @default.
- W2000665223 hasRelatedWork W2336634055 @default.
- W2000665223 isParatext "false" @default.
- W2000665223 isRetracted "false" @default.
- W2000665223 magId "2000665223" @default.
- W2000665223 workType "article" @default.