Matches in SemOpenAlex for { <https://semopenalex.org/work/W2001738676> ?p ?o ?g. }
- W2001738676 endingPage "159" @default.
- W2001738676 startingPage "137" @default.
- W2001738676 abstract "Text mining is about inferring structure from sequences representing natural language text, and may be defined as the process of analyzing text to extract information that is useful for particular purposes. Although hand-crafted heuristics are a common practical approach for extracting information from text, a general, and generalizable, approach requires adaptive techniques. This paper studies the way in which the adaptive techniques used in text compression can be applied to text mining. It develops several examples: extraction of hierarchical phrase structures from text, identification of keyphrases in documents, locating proper names and quantities of interest in a piece of text, text categorization, word segmentation, acronym extraction, and structure recognition. We conclude that compression forms a sound unifying principle that allows many text mining problems to be tacked adaptively." @default.
- W2001738676 created "2016-06-24" @default.
- W2001738676 creator A5034144529 @default.
- W2001738676 date "2004-06-01" @default.
- W2001738676 modified "2023-09-29" @default.
- W2001738676 title "Adaptive text mining: inferring structure from sequences" @default.
- W2001738676 cites W1991133427 @default.
- W2001738676 cites W2015806650 @default.
- W2001738676 cites W2065343708 @default.
- W2001738676 cites W2105686649 @default.
- W2001738676 cites W2107745473 @default.
- W2001738676 cites W2109893852 @default.
- W2001738676 cites W2122962290 @default.
- W2001738676 cites W2130350995 @default.
- W2001738676 cites W2145766604 @default.
- W2001738676 cites W2158291389 @default.
- W2001738676 cites W2159824188 @default.
- W2001738676 cites W2161628678 @default.
- W2001738676 cites W2171257168 @default.
- W2001738676 cites W2420187884 @default.
- W2001738676 cites W4212883601 @default.
- W2001738676 doi "https://doi.org/10.1016/s1570-8667(03)00084-4" @default.
- W2001738676 hasPublicationYear "2004" @default.
- W2001738676 type Work @default.
- W2001738676 sameAs 2001738676 @default.
- W2001738676 citedByCount "29" @default.
- W2001738676 countsByYear W20017386762012 @default.
- W2001738676 countsByYear W20017386762013 @default.
- W2001738676 countsByYear W20017386762015 @default.
- W2001738676 countsByYear W20017386762016 @default.
- W2001738676 countsByYear W20017386762018 @default.
- W2001738676 countsByYear W20017386762019 @default.
- W2001738676 countsByYear W20017386762020 @default.
- W2001738676 countsByYear W20017386762021 @default.
- W2001738676 crossrefType "journal-article" @default.
- W2001738676 hasAuthorship W2001738676A5034144529 @default.
- W2001738676 hasBestOaLocation W20017386761 @default.
- W2001738676 hasConcept C111919701 @default.
- W2001738676 hasConcept C116834253 @default.
- W2001738676 hasConcept C127705205 @default.
- W2001738676 hasConcept C138885662 @default.
- W2001738676 hasConcept C151375590 @default.
- W2001738676 hasConcept C154945302 @default.
- W2001738676 hasConcept C195807954 @default.
- W2001738676 hasConcept C204321447 @default.
- W2001738676 hasConcept C23123220 @default.
- W2001738676 hasConcept C2776224158 @default.
- W2001738676 hasConcept C2986744138 @default.
- W2001738676 hasConcept C41008148 @default.
- W2001738676 hasConcept C41895202 @default.
- W2001738676 hasConcept C482391 @default.
- W2001738676 hasConcept C59822182 @default.
- W2001738676 hasConcept C66945725 @default.
- W2001738676 hasConcept C71472368 @default.
- W2001738676 hasConcept C86803240 @default.
- W2001738676 hasConcept C89600930 @default.
- W2001738676 hasConcept C94124525 @default.
- W2001738676 hasConcept C98045186 @default.
- W2001738676 hasConcept C98501671 @default.
- W2001738676 hasConceptScore W2001738676C111919701 @default.
- W2001738676 hasConceptScore W2001738676C116834253 @default.
- W2001738676 hasConceptScore W2001738676C127705205 @default.
- W2001738676 hasConceptScore W2001738676C138885662 @default.
- W2001738676 hasConceptScore W2001738676C151375590 @default.
- W2001738676 hasConceptScore W2001738676C154945302 @default.
- W2001738676 hasConceptScore W2001738676C195807954 @default.
- W2001738676 hasConceptScore W2001738676C204321447 @default.
- W2001738676 hasConceptScore W2001738676C23123220 @default.
- W2001738676 hasConceptScore W2001738676C2776224158 @default.
- W2001738676 hasConceptScore W2001738676C2986744138 @default.
- W2001738676 hasConceptScore W2001738676C41008148 @default.
- W2001738676 hasConceptScore W2001738676C41895202 @default.
- W2001738676 hasConceptScore W2001738676C482391 @default.
- W2001738676 hasConceptScore W2001738676C59822182 @default.
- W2001738676 hasConceptScore W2001738676C66945725 @default.
- W2001738676 hasConceptScore W2001738676C71472368 @default.
- W2001738676 hasConceptScore W2001738676C86803240 @default.
- W2001738676 hasConceptScore W2001738676C89600930 @default.
- W2001738676 hasConceptScore W2001738676C94124525 @default.
- W2001738676 hasConceptScore W2001738676C98045186 @default.
- W2001738676 hasConceptScore W2001738676C98501671 @default.
- W2001738676 hasIssue "2" @default.
- W2001738676 hasLocation W20017386761 @default.
- W2001738676 hasLocation W20017386762 @default.
- W2001738676 hasLocation W20017386763 @default.
- W2001738676 hasOpenAccess W2001738676 @default.
- W2001738676 hasPrimaryLocation W20017386761 @default.
- W2001738676 hasRelatedWork W103659291 @default.
- W2001738676 hasRelatedWork W1173712697 @default.
- W2001738676 hasRelatedWork W2003206737 @default.
- W2001738676 hasRelatedWork W2012263312 @default.
- W2001738676 hasRelatedWork W2163264304 @default.
- W2001738676 hasRelatedWork W2174664889 @default.
- W2001738676 hasRelatedWork W2365240215 @default.
- W2001738676 hasRelatedWork W2366341164 @default.
- W2001738676 hasRelatedWork W3015808114 @default.
- W2001738676 hasRelatedWork W3161242220 @default.
- W2001738676 hasVolume "2" @default.