Matches in SemOpenAlex for { <https://semopenalex.org/work/W97193787> ?p ?o ?g. }
Showing items 1 to 78 of
78
with 100 items per page.
- W97193787 abstract "Abstract : This project evaluates two families of algorithms that can be used to automatically classify general texts within a set of conceptual categories. The first family uses indirect evidence in the form of term-category co-occurrence data. The second uses direct evidence based on the senses of the terms, where a term's senses are designated by the categories that it is a member of in a thesaurus. The direct evidence algorithms incorporate varying degrees of indirect evidence as well. For these experiments a set of 3,864 conceptual categories were derived from the noun hierarchy of WordNet, an on-line thesaurus. The co-occurrence data for the associational and disambiguation algorithms was collected from a corpus of 3,711 AP newswire articles, comprising approximately 1.7 million words of text. Each of the algorithms was applied to all of the articles in the AP corpus, with their performance evaluated both qualitatively and quantitatively. The results of these experiments show that both classes of algorithms have potential as fully automatic text classifiers. The direct methods produce qualitatively better classifications than the indirect ones when applied to AP newswire texts. The direct methods also achieve both a higher precision, 86.75% correctly classified (best case) versus 72.34%, and a higher approximate recall. The experiments identify limiting factors on the performance of the algorithms. The primary limitations stem from the quality of the thesaural categories, which were derived automatically, and from the performance of the term sense disambiguation algorithm. The former can be addressed with human intervention, the latter with a larger training set for the statistical database" @default.
- W97193787 created "2016-06-24" @default.
- W97193787 creator A5078874539 @default.
- W97193787 date "1994-05-01" @default.
- W97193787 modified "2023-09-24" @default.
- W97193787 title "Topic Characterization of Full Length Texts Using Direct and Indirect Term Evidence" @default.
- W97193787 doi "https://doi.org/10.21236/ada632255" @default.
- W97193787 hasPublicationYear "1994" @default.
- W97193787 type Work @default.
- W97193787 sameAs 97193787 @default.
- W97193787 citedByCount "3" @default.
- W97193787 crossrefType "report" @default.
- W97193787 hasAuthorship W97193787A5078874539 @default.
- W97193787 hasConcept C121332964 @default.
- W97193787 hasConcept C121934690 @default.
- W97193787 hasConcept C124101348 @default.
- W97193787 hasConcept C153962237 @default.
- W97193787 hasConcept C154945302 @default.
- W97193787 hasConcept C157659113 @default.
- W97193787 hasConcept C162324750 @default.
- W97193787 hasConcept C177264268 @default.
- W97193787 hasConcept C199360897 @default.
- W97193787 hasConcept C204321447 @default.
- W97193787 hasConcept C23123220 @default.
- W97193787 hasConcept C25343380 @default.
- W97193787 hasConcept C2778698081 @default.
- W97193787 hasConcept C31170391 @default.
- W97193787 hasConcept C34447519 @default.
- W97193787 hasConcept C41008148 @default.
- W97193787 hasConcept C61797465 @default.
- W97193787 hasConcept C62520636 @default.
- W97193787 hasConcept C81669768 @default.
- W97193787 hasConceptScore W97193787C121332964 @default.
- W97193787 hasConceptScore W97193787C121934690 @default.
- W97193787 hasConceptScore W97193787C124101348 @default.
- W97193787 hasConceptScore W97193787C153962237 @default.
- W97193787 hasConceptScore W97193787C154945302 @default.
- W97193787 hasConceptScore W97193787C157659113 @default.
- W97193787 hasConceptScore W97193787C162324750 @default.
- W97193787 hasConceptScore W97193787C177264268 @default.
- W97193787 hasConceptScore W97193787C199360897 @default.
- W97193787 hasConceptScore W97193787C204321447 @default.
- W97193787 hasConceptScore W97193787C23123220 @default.
- W97193787 hasConceptScore W97193787C25343380 @default.
- W97193787 hasConceptScore W97193787C2778698081 @default.
- W97193787 hasConceptScore W97193787C31170391 @default.
- W97193787 hasConceptScore W97193787C34447519 @default.
- W97193787 hasConceptScore W97193787C41008148 @default.
- W97193787 hasConceptScore W97193787C61797465 @default.
- W97193787 hasConceptScore W97193787C62520636 @default.
- W97193787 hasConceptScore W97193787C81669768 @default.
- W97193787 hasLocation W971937871 @default.
- W97193787 hasOpenAccess W97193787 @default.
- W97193787 hasPrimaryLocation W971937871 @default.
- W97193787 hasRelatedWork W1542064421 @default.
- W97193787 hasRelatedWork W1970863518 @default.
- W97193787 hasRelatedWork W2017167913 @default.
- W97193787 hasRelatedWork W2082171025 @default.
- W97193787 hasRelatedWork W2101995630 @default.
- W97193787 hasRelatedWork W2112408041 @default.
- W97193787 hasRelatedWork W2162987981 @default.
- W97193787 hasRelatedWork W2183001427 @default.
- W97193787 hasRelatedWork W2322557215 @default.
- W97193787 hasRelatedWork W2768945436 @default.
- W97193787 hasRelatedWork W2777934728 @default.
- W97193787 hasRelatedWork W2886642366 @default.
- W97193787 hasRelatedWork W2900652827 @default.
- W97193787 hasRelatedWork W2905549383 @default.
- W97193787 hasRelatedWork W2909504796 @default.
- W97193787 hasRelatedWork W3002396220 @default.
- W97193787 hasRelatedWork W3135281810 @default.
- W97193787 hasRelatedWork W3212926946 @default.
- W97193787 hasRelatedWork W168167162 @default.
- W97193787 hasRelatedWork W88663061 @default.
- W97193787 isParatext "false" @default.
- W97193787 isRetracted "false" @default.
- W97193787 magId "97193787" @default.
- W97193787 workType "report" @default.