Matches in SemOpenAlex for { <https://semopenalex.org/work/W3097687554> ?p ?o ?g. }
- W3097687554 endingPage "23" @default.
- W3097687554 startingPage "1" @default.
- W3097687554 abstract "Automatically summarizing a group of short texts that mainly share one topic is a fundamental task in many applications, e.g., summarizing the main symptoms for a disease based on a group of medical texts that are usually short, i.e., tens of words. Conventional unsupervised short text summarization techniques tend to find the most representative short text document. However, they may cause privacy issues, e.g., personal information in the medical texts may be exposed. Moreover, compared with the complete short text where some unimportant words may exist, a summary consisting of only a few keywords is more preferable by the user due to its clear and concise form. Due to the above reasons, in this article, we aim to solve the problem of unsupervised derivation of keyword summary for short texts. Existing keyword extraction methods such as Latent Dirichlet Allocation cannot be applied to solve this problem, since (1) the ordering relations among the extracted keywords are ignored, which causes troubles for people to capture the main idea of the event, and (2) short texts contain limited context, which makes it hard to find the optimal words for semantic coverage. Hence, we propose a simple but yet effective method named Frequent Closed Wordsets Ranking (FCWRank) to derive the keyword summary from a short text cluster. FCWRank is an unsupervised method that builds on the idea of frequent closed itemset mining in transaction database. FCWRank first mines all frequent closed wordsets from a cluster of short texts and then selects the most important wordset based on an importance model where the similarity between closed wordsets and the relation between the closed wordset and the short text document are considered simultaneously. To make the keywords within the wordset more understandable, FCWRank further unfolds the semantics behind them by sorting them. Experiments on real-world short text collections show that FCWRank outperforms the state-of-the-art baselines in terms of Recall-Oriented Understudy for Gisting Evaluation-Longest common subsequence F1, precision and recall scores." @default.
- W3097687554 created "2020-11-09" @default.
- W3097687554 creator A5009633508 @default.
- W3097687554 creator A5030881288 @default.
- W3097687554 creator A5052160218 @default.
- W3097687554 creator A5056652572 @default.
- W3097687554 creator A5057202739 @default.
- W3097687554 creator A5069353502 @default.
- W3097687554 creator A5074639024 @default.
- W3097687554 creator A5087787072 @default.
- W3097687554 date "2021-06-02" @default.
- W3097687554 modified "2023-09-26" @default.
- W3097687554 title "Unsupervised Derivation of Keyword Summary for Short Texts" @default.
- W3097687554 cites W1880262756 @default.
- W3097687554 cites W1965667542 @default.
- W3097687554 cites W1997036897 @default.
- W3097687554 cites W2002117016 @default.
- W3097687554 cites W2048195127 @default.
- W3097687554 cites W2063904635 @default.
- W3097687554 cites W2069667724 @default.
- W3097687554 cites W2080593377 @default.
- W3097687554 cites W2101717554 @default.
- W3097687554 cites W2103339462 @default.
- W3097687554 cites W2104210067 @default.
- W3097687554 cites W2123442489 @default.
- W3097687554 cites W2139317750 @default.
- W3097687554 cites W2148507357 @default.
- W3097687554 cites W2151703435 @default.
- W3097687554 cites W2158780731 @default.
- W3097687554 cites W2159583324 @default.
- W3097687554 cites W2171836785 @default.
- W3097687554 cites W2340381866 @default.
- W3097687554 cites W2519081496 @default.
- W3097687554 cites W2606926586 @default.
- W3097687554 cites W2729840144 @default.
- W3097687554 cites W2753434909 @default.
- W3097687554 cites W2803263920 @default.
- W3097687554 cites W2807927369 @default.
- W3097687554 cites W2904432433 @default.
- W3097687554 cites W2946833416 @default.
- W3097687554 cites W2948306594 @default.
- W3097687554 cites W2964315079 @default.
- W3097687554 cites W4235295823 @default.
- W3097687554 cites W4252403066 @default.
- W3097687554 cites W95321676 @default.
- W3097687554 doi "https://doi.org/10.1145/3397162" @default.
- W3097687554 hasPublicationYear "2021" @default.
- W3097687554 type Work @default.
- W3097687554 sameAs 3097687554 @default.
- W3097687554 citedByCount "4" @default.
- W3097687554 countsByYear W30976875542022 @default.
- W3097687554 countsByYear W30976875542023 @default.
- W3097687554 crossrefType "journal-article" @default.
- W3097687554 hasAuthorship W3097687554A5009633508 @default.
- W3097687554 hasAuthorship W3097687554A5030881288 @default.
- W3097687554 hasAuthorship W3097687554A5052160218 @default.
- W3097687554 hasAuthorship W3097687554A5056652572 @default.
- W3097687554 hasAuthorship W3097687554A5057202739 @default.
- W3097687554 hasAuthorship W3097687554A5069353502 @default.
- W3097687554 hasAuthorship W3097687554A5074639024 @default.
- W3097687554 hasAuthorship W3097687554A5087787072 @default.
- W3097687554 hasConcept C111472728 @default.
- W3097687554 hasConcept C121332964 @default.
- W3097687554 hasConcept C138885662 @default.
- W3097687554 hasConcept C151730666 @default.
- W3097687554 hasConcept C154945302 @default.
- W3097687554 hasConcept C162324750 @default.
- W3097687554 hasConcept C170858558 @default.
- W3097687554 hasConcept C171686336 @default.
- W3097687554 hasConcept C187736073 @default.
- W3097687554 hasConcept C189430467 @default.
- W3097687554 hasConcept C204321447 @default.
- W3097687554 hasConcept C23123220 @default.
- W3097687554 hasConcept C2779343474 @default.
- W3097687554 hasConcept C2779662365 @default.
- W3097687554 hasConcept C2780288562 @default.
- W3097687554 hasConcept C2780451532 @default.
- W3097687554 hasConcept C2780586882 @default.
- W3097687554 hasConcept C41008148 @default.
- W3097687554 hasConcept C500882744 @default.
- W3097687554 hasConcept C62520636 @default.
- W3097687554 hasConcept C86803240 @default.
- W3097687554 hasConceptScore W3097687554C111472728 @default.
- W3097687554 hasConceptScore W3097687554C121332964 @default.
- W3097687554 hasConceptScore W3097687554C138885662 @default.
- W3097687554 hasConceptScore W3097687554C151730666 @default.
- W3097687554 hasConceptScore W3097687554C154945302 @default.
- W3097687554 hasConceptScore W3097687554C162324750 @default.
- W3097687554 hasConceptScore W3097687554C170858558 @default.
- W3097687554 hasConceptScore W3097687554C171686336 @default.
- W3097687554 hasConceptScore W3097687554C187736073 @default.
- W3097687554 hasConceptScore W3097687554C189430467 @default.
- W3097687554 hasConceptScore W3097687554C204321447 @default.
- W3097687554 hasConceptScore W3097687554C23123220 @default.
- W3097687554 hasConceptScore W3097687554C2779343474 @default.
- W3097687554 hasConceptScore W3097687554C2779662365 @default.
- W3097687554 hasConceptScore W3097687554C2780288562 @default.
- W3097687554 hasConceptScore W3097687554C2780451532 @default.