Matches in SemOpenAlex for { <https://semopenalex.org/work/W2079355893> ?p ?o ?g. }
Showing items 1 to 85 of
85
with 100 items per page.
- W2079355893 abstract "Faced with the need for human comprehension of any large collection of objects, a time honored approach has been to cluster the objects into groups of closely related objects. Individual groups are then summarized in some convenient manner to provide a more manageable view of the data. Such methods have been applied to document collections with mixed results. If a hard clustering of the data into mutually exclusive clusters is performed then documents are frequently forced into one cluster when they may contain important information that would also appropriately make them candidates for other clusters. If a soft clustering is used there still remains the problem of how to provide a useful summary of the data in a cluster. Here we introduce a new algorithm to produce a soft clustering of document collections that is based on the concept of a theme. A theme is conceptually a subject area that is discussed by multiple documents in the database. A theme has two potential representations that may be viewed as dual to each other. First it is represented by the set of documents that discuss the subject or theme and second it is also represented by the set of key terms that are typically used to discuss the theme. Our algorithm is an EM algorithm in which the term representation and the document representation are explicit components and each is used to refine the other in an alternating fashion. Upon convergence the term representation provides a natural summary of the document representation (the cluster). We describe how to optimize the themes produced by this process and give the results of applying the method to a database of over fifty thousand PubMed documents dealing with the subject of AIDS. How themes may improve access to a document collection is also discussed." @default.
- W2079355893 created "2016-06-24" @default.
- W2079355893 creator A5002959499 @default.
- W2079355893 date "2001-12-01" @default.
- W2079355893 modified "2023-09-26" @default.
- W2079355893 title "A THEMATIC ANALYSIS OF THE AIDS LITERATURE" @default.
- W2079355893 cites W125401671 @default.
- W2079355893 cites W1555244713 @default.
- W2079355893 cites W1920003194 @default.
- W2079355893 cites W2044758663 @default.
- W2079355893 cites W2064580901 @default.
- W2079355893 cites W2068632118 @default.
- W2079355893 cites W2107743791 @default.
- W2079355893 cites W2111705563 @default.
- W2079355893 cites W2113041963 @default.
- W2079355893 cites W2114804204 @default.
- W2079355893 cites W2115159360 @default.
- W2079355893 cites W2196501509 @default.
- W2079355893 doi "https://doi.org/10.1142/9789812799623_0036" @default.
- W2079355893 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/11928492" @default.
- W2079355893 hasPublicationYear "2001" @default.
- W2079355893 type Work @default.
- W2079355893 sameAs 2079355893 @default.
- W2079355893 citedByCount "21" @default.
- W2079355893 countsByYear W20793558932014 @default.
- W2079355893 countsByYear W20793558932015 @default.
- W2079355893 countsByYear W20793558932016 @default.
- W2079355893 countsByYear W20793558932017 @default.
- W2079355893 countsByYear W20793558932018 @default.
- W2079355893 crossrefType "proceedings-article" @default.
- W2079355893 hasAuthorship W2079355893A5002959499 @default.
- W2079355893 hasConcept C124101348 @default.
- W2079355893 hasConcept C136764020 @default.
- W2079355893 hasConcept C154945302 @default.
- W2079355893 hasConcept C164866538 @default.
- W2079355893 hasConcept C177264268 @default.
- W2079355893 hasConcept C17744445 @default.
- W2079355893 hasConcept C177937566 @default.
- W2079355893 hasConcept C199360897 @default.
- W2079355893 hasConcept C199539241 @default.
- W2079355893 hasConcept C204321447 @default.
- W2079355893 hasConcept C23123220 @default.
- W2079355893 hasConcept C2776359362 @default.
- W2079355893 hasConcept C2777855551 @default.
- W2079355893 hasConcept C33566652 @default.
- W2079355893 hasConcept C41008148 @default.
- W2079355893 hasConcept C73555534 @default.
- W2079355893 hasConcept C80444323 @default.
- W2079355893 hasConcept C94625758 @default.
- W2079355893 hasConceptScore W2079355893C124101348 @default.
- W2079355893 hasConceptScore W2079355893C136764020 @default.
- W2079355893 hasConceptScore W2079355893C154945302 @default.
- W2079355893 hasConceptScore W2079355893C164866538 @default.
- W2079355893 hasConceptScore W2079355893C177264268 @default.
- W2079355893 hasConceptScore W2079355893C17744445 @default.
- W2079355893 hasConceptScore W2079355893C177937566 @default.
- W2079355893 hasConceptScore W2079355893C199360897 @default.
- W2079355893 hasConceptScore W2079355893C199539241 @default.
- W2079355893 hasConceptScore W2079355893C204321447 @default.
- W2079355893 hasConceptScore W2079355893C23123220 @default.
- W2079355893 hasConceptScore W2079355893C2776359362 @default.
- W2079355893 hasConceptScore W2079355893C2777855551 @default.
- W2079355893 hasConceptScore W2079355893C33566652 @default.
- W2079355893 hasConceptScore W2079355893C41008148 @default.
- W2079355893 hasConceptScore W2079355893C73555534 @default.
- W2079355893 hasConceptScore W2079355893C80444323 @default.
- W2079355893 hasConceptScore W2079355893C94625758 @default.
- W2079355893 hasLocation W20793558931 @default.
- W2079355893 hasLocation W20793558932 @default.
- W2079355893 hasOpenAccess W2079355893 @default.
- W2079355893 hasPrimaryLocation W20793558931 @default.
- W2079355893 hasRelatedWork W1504491975 @default.
- W2079355893 hasRelatedWork W2068266569 @default.
- W2079355893 hasRelatedWork W2086064646 @default.
- W2079355893 hasRelatedWork W2184440854 @default.
- W2079355893 hasRelatedWork W2184609164 @default.
- W2079355893 hasRelatedWork W2250624607 @default.
- W2079355893 hasRelatedWork W2282393731 @default.
- W2079355893 hasRelatedWork W2380798983 @default.
- W2079355893 hasRelatedWork W2385729623 @default.
- W2079355893 hasRelatedWork W2888523397 @default.
- W2079355893 isParatext "false" @default.
- W2079355893 isRetracted "false" @default.
- W2079355893 magId "2079355893" @default.
- W2079355893 workType "article" @default.