Matches in SemOpenAlex for { <https://semopenalex.org/work/W2414925900> ?p ?o ?g. }
Showing items 1 to 65 of
65
with 100 items per page.
- W2414925900 abstract "Grammatical category ambiguity (distinct from semantic and structural ambiguity) is extremely frequent in natural language. In the Brown Corpus (a million-word grammatically tagged sample of English prose) 11% of all word forms, or 48% of all word instances, occur as members of more than one grammatical category. These figures greatly under-represent actual categorial ambiguity, for instance because uncommon words may seem unambiguous when they are actually not. Such frequent ambiguity poses extreme problems of non-determinism for parsers. Therefore means of resolving such ambiguities are important to the progress of natural language processing systems.This thesis examines probabilistic strategies for resolving categorial ambiguity. I consider the contextual probability of a given category, given a categorial context, and the relative probability that a given word form represents a particular category. Up to 96% of all words can be assigned the correct category without morphological analysis, special handling of idioms, or other non-probabilistic features. Dynamic programming yields disambiguation time directly proportional to text length. Probabilistic methods are thus both faster and more accurate than previous methods, and overcome the non-determinism which renders many other methods unworkable.I apply these methods to the Brown Corpus (English) and to the Greek New Testament (140,000 words of Koine Greek). I discuss the effects of various parameters on the accuracy of category assignment, and analyze the types and frequencies of residual errors. I report control studies which help to predict the algorithm's effectiveness for unrestricted text, and investigate the amount of normalization text required to obtain reliable probability estimates. Analyses of related information-theoretic properties of natural language corpora are also included, for example, investigations of the effect of sample size on measurement of entropy." @default.
- W2414925900 created "2016-06-24" @default.
- W2414925900 creator A5050073596 @default.
- W2414925900 date "1990-01-03" @default.
- W2414925900 modified "2023-09-27" @default.
- W2414925900 title "Stochastic methods for resolution of grammatical category ambiguity in inflected and uninflected languages" @default.
- W2414925900 hasPublicationYear "1990" @default.
- W2414925900 type Work @default.
- W2414925900 sameAs 2414925900 @default.
- W2414925900 citedByCount "3" @default.
- W2414925900 crossrefType "journal-article" @default.
- W2414925900 hasAuthorship W2414925900A5050073596 @default.
- W2414925900 hasConcept C138885662 @default.
- W2414925900 hasConcept C151730666 @default.
- W2414925900 hasConcept C154945302 @default.
- W2414925900 hasConcept C195324797 @default.
- W2414925900 hasConcept C199360897 @default.
- W2414925900 hasConcept C204321447 @default.
- W2414925900 hasConcept C2779343474 @default.
- W2414925900 hasConcept C2780522230 @default.
- W2414925900 hasConcept C41008148 @default.
- W2414925900 hasConcept C41895202 @default.
- W2414925900 hasConcept C49937458 @default.
- W2414925900 hasConcept C86803240 @default.
- W2414925900 hasConcept C90805587 @default.
- W2414925900 hasConceptScore W2414925900C138885662 @default.
- W2414925900 hasConceptScore W2414925900C151730666 @default.
- W2414925900 hasConceptScore W2414925900C154945302 @default.
- W2414925900 hasConceptScore W2414925900C195324797 @default.
- W2414925900 hasConceptScore W2414925900C199360897 @default.
- W2414925900 hasConceptScore W2414925900C204321447 @default.
- W2414925900 hasConceptScore W2414925900C2779343474 @default.
- W2414925900 hasConceptScore W2414925900C2780522230 @default.
- W2414925900 hasConceptScore W2414925900C41008148 @default.
- W2414925900 hasConceptScore W2414925900C41895202 @default.
- W2414925900 hasConceptScore W2414925900C49937458 @default.
- W2414925900 hasConceptScore W2414925900C86803240 @default.
- W2414925900 hasConceptScore W2414925900C90805587 @default.
- W2414925900 hasLocation W24149259001 @default.
- W2414925900 hasOpenAccess W2414925900 @default.
- W2414925900 hasPrimaryLocation W24149259001 @default.
- W2414925900 hasRelatedWork W125550881 @default.
- W2414925900 hasRelatedWork W148922567 @default.
- W2414925900 hasRelatedWork W1990438144 @default.
- W2414925900 hasRelatedWork W2107914787 @default.
- W2414925900 hasRelatedWork W2141503127 @default.
- W2414925900 hasRelatedWork W2154590657 @default.
- W2414925900 hasRelatedWork W2159289708 @default.
- W2414925900 hasRelatedWork W2172384523 @default.
- W2414925900 hasRelatedWork W2199603478 @default.
- W2414925900 hasRelatedWork W2347822881 @default.
- W2414925900 hasRelatedWork W2604367549 @default.
- W2414925900 hasRelatedWork W2616604614 @default.
- W2414925900 hasRelatedWork W2741706299 @default.
- W2414925900 hasRelatedWork W2760636332 @default.
- W2414925900 hasRelatedWork W2791104893 @default.
- W2414925900 hasRelatedWork W3122044586 @default.
- W2414925900 hasRelatedWork W3167589100 @default.
- W2414925900 hasRelatedWork W70866515 @default.
- W2414925900 hasRelatedWork W1487404635 @default.
- W2414925900 hasRelatedWork W2182297864 @default.
- W2414925900 isParatext "false" @default.
- W2414925900 isRetracted "false" @default.
- W2414925900 magId "2414925900" @default.
- W2414925900 workType "article" @default.