Matches in SemOpenAlex for { <https://semopenalex.org/work/W2171886309> ?p ?o ?g. }
Showing items 1 to 79 of
79
with 100 items per page.
- W2171886309 abstract "Summary form only given. Test categorization is the assignment of natural language texts to predefined categories based on their concept. The use of predefined categories implies a supervised learning approach to categorization, where already-classified articles which effectively define the categories are used as training data to build a model that can be used for classifying new articles that comprise the data. Typical approaches extract features from articles and use the feature vectors as input to a machine learning scheme that learns how to classify articles. The features are generally words. It has often been observed that compression seems to provide a very promising alternative approach to categorization. The overall compression of an article with respect to different models can be compared to see which one it fits most closely. Such a scheme has several potential advantages: it yields an overall judgement on the document as a whole, rather than discarding information by pre-selecting features it avoids the messy and rather artificial problem of defining word boundaries; it deals uniformly with morphological variants of words; depending on the model (and its order), it can take account of phrasal effects that span word boundaries; it offers a uniform way of dealing with different types of documents for example, arbitrary files in a computer system; it generally minimizes arbitrary decisions that inevitably need to be taken to render any learning scheme practical." @default.
- W2171886309 created "2016-06-24" @default.
- W2171886309 creator A5045934263 @default.
- W2171886309 creator A5051886053 @default.
- W2171886309 creator A5059992863 @default.
- W2171886309 date "2002-11-07" @default.
- W2171886309 modified "2023-09-23" @default.
- W2171886309 title "Text categorization using compression models" @default.
- W2171886309 cites W2171886309 @default.
- W2171886309 doi "https://doi.org/10.1109/dcc.2000.838202" @default.
- W2171886309 hasPublicationYear "2002" @default.
- W2171886309 type Work @default.
- W2171886309 sameAs 2171886309 @default.
- W2171886309 citedByCount "56" @default.
- W2171886309 countsByYear W21718863092012 @default.
- W2171886309 countsByYear W21718863092013 @default.
- W2171886309 countsByYear W21718863092014 @default.
- W2171886309 countsByYear W21718863092015 @default.
- W2171886309 countsByYear W21718863092016 @default.
- W2171886309 countsByYear W21718863092017 @default.
- W2171886309 countsByYear W21718863092018 @default.
- W2171886309 countsByYear W21718863092019 @default.
- W2171886309 countsByYear W21718863092021 @default.
- W2171886309 crossrefType "proceedings-article" @default.
- W2171886309 hasAuthorship W2171886309A5045934263 @default.
- W2171886309 hasAuthorship W2171886309A5051886053 @default.
- W2171886309 hasAuthorship W2171886309A5059992863 @default.
- W2171886309 hasBestOaLocation W21718863092 @default.
- W2171886309 hasConcept C119857082 @default.
- W2171886309 hasConcept C134306372 @default.
- W2171886309 hasConcept C138885662 @default.
- W2171886309 hasConcept C154945302 @default.
- W2171886309 hasConcept C17744445 @default.
- W2171886309 hasConcept C195324797 @default.
- W2171886309 hasConcept C199539241 @default.
- W2171886309 hasConcept C204321447 @default.
- W2171886309 hasConcept C2776401178 @default.
- W2171886309 hasConcept C2776548248 @default.
- W2171886309 hasConcept C33923547 @default.
- W2171886309 hasConcept C41008148 @default.
- W2171886309 hasConcept C41895202 @default.
- W2171886309 hasConcept C77618280 @default.
- W2171886309 hasConcept C90805587 @default.
- W2171886309 hasConcept C94124525 @default.
- W2171886309 hasConceptScore W2171886309C119857082 @default.
- W2171886309 hasConceptScore W2171886309C134306372 @default.
- W2171886309 hasConceptScore W2171886309C138885662 @default.
- W2171886309 hasConceptScore W2171886309C154945302 @default.
- W2171886309 hasConceptScore W2171886309C17744445 @default.
- W2171886309 hasConceptScore W2171886309C195324797 @default.
- W2171886309 hasConceptScore W2171886309C199539241 @default.
- W2171886309 hasConceptScore W2171886309C204321447 @default.
- W2171886309 hasConceptScore W2171886309C2776401178 @default.
- W2171886309 hasConceptScore W2171886309C2776548248 @default.
- W2171886309 hasConceptScore W2171886309C33923547 @default.
- W2171886309 hasConceptScore W2171886309C41008148 @default.
- W2171886309 hasConceptScore W2171886309C41895202 @default.
- W2171886309 hasConceptScore W2171886309C77618280 @default.
- W2171886309 hasConceptScore W2171886309C90805587 @default.
- W2171886309 hasConceptScore W2171886309C94124525 @default.
- W2171886309 hasLocation W21718863091 @default.
- W2171886309 hasLocation W21718863092 @default.
- W2171886309 hasLocation W21718863093 @default.
- W2171886309 hasOpenAccess W2171886309 @default.
- W2171886309 hasPrimaryLocation W21718863091 @default.
- W2171886309 hasRelatedWork W159132833 @default.
- W2171886309 hasRelatedWork W1806995473 @default.
- W2171886309 hasRelatedWork W1925652344 @default.
- W2171886309 hasRelatedWork W1967203824 @default.
- W2171886309 hasRelatedWork W2293457016 @default.
- W2171886309 hasRelatedWork W2365213443 @default.
- W2171886309 hasRelatedWork W2384103485 @default.
- W2171886309 hasRelatedWork W3107474891 @default.
- W2171886309 hasRelatedWork W1872130062 @default.
- W2171886309 hasRelatedWork W2188432624 @default.
- W2171886309 isParatext "false" @default.
- W2171886309 isRetracted "false" @default.
- W2171886309 magId "2171886309" @default.
- W2171886309 workType "article" @default.