Matches in SemOpenAlex for { <https://semopenalex.org/work/W3112561948> ?p ?o ?g. }
Showing items 1 to 75 of
75
with 100 items per page.
- W3112561948 abstract "Summarization is an integral part of modern Internet. In social networks, which have become primary information sources, users have grown accustomed to condense their writing. Content providers routinely publish short textual excerpts to these platforms as well. However, with larger quantities of small documents becoming constantly available, search engines now have less data to index, classify and retrieve relevant information. In this regard, more research is needed to show how reliable the current Information Retrieval (IR) algorithms are when confronted to collections of exclusively short documents, such as the ones arising from social media.This paper explores the semantic proximity between human summaries and queries through cluster analysis, and how it relates to IR. Roughly, the k-means algorithm was used to cluster two collections of summaries by their semantic similarity: one in English and one in Spanish. This, to measure how summarization may affect information content in cluster-based IR. Furthermore, the same algorithm was used to measure how documents grouped around a set of artificially generated queries.The results show that, regardless of the language, providing the algorithm with previous category knowledge may contribute to increase the accuracy of cluster-based document classification. Furthermore, some evidences points to the effect of summary quality in retrievability: summaries created by specialized summarizers induced more distinguishable clusters than summaries created by university students. Future work in this area may serve to adapt existing algorithms to big collections of short documents, improving IR performance in cases where machine learning techniques are not available." @default.
- W3112561948 created "2020-12-21" @default.
- W3112561948 creator A5018786694 @default.
- W3112561948 creator A5055865789 @default.
- W3112561948 creator A5071736469 @default.
- W3112561948 creator A5081516169 @default.
- W3112561948 date "2020-11-16" @default.
- W3112561948 modified "2023-10-01" @default.
- W3112561948 title "Measuring the Effects of Summarization in Cluster-based Information Retrieval" @default.
- W3112561948 cites W1807473991 @default.
- W3112561948 cites W1979432867 @default.
- W3112561948 cites W1987971958 @default.
- W3112561948 cites W1998720920 @default.
- W3112561948 cites W2011430131 @default.
- W3112561948 cites W2053968437 @default.
- W3112561948 cites W2101105183 @default.
- W3112561948 cites W2123095505 @default.
- W3112561948 cites W2123442489 @default.
- W3112561948 cites W2145907631 @default.
- W3112561948 cites W2149593800 @default.
- W3112561948 cites W2150824314 @default.
- W3112561948 cites W2174420725 @default.
- W3112561948 cites W2251023345 @default.
- W3112561948 cites W2558653419 @default.
- W3112561948 cites W2610561411 @default.
- W3112561948 cites W2769025373 @default.
- W3112561948 cites W2899449303 @default.
- W3112561948 cites W2911737959 @default.
- W3112561948 cites W2922826082 @default.
- W3112561948 cites W2945560458 @default.
- W3112561948 cites W2971054328 @default.
- W3112561948 cites W3081986567 @default.
- W3112561948 cites W3205499257 @default.
- W3112561948 cites W4213009331 @default.
- W3112561948 cites W4235169531 @default.
- W3112561948 doi "https://doi.org/10.1109/sccc51225.2020.9281189" @default.
- W3112561948 hasPublicationYear "2020" @default.
- W3112561948 type Work @default.
- W3112561948 sameAs 3112561948 @default.
- W3112561948 citedByCount "1" @default.
- W3112561948 countsByYear W31125619482022 @default.
- W3112561948 crossrefType "proceedings-article" @default.
- W3112561948 hasAuthorship W3112561948A5018786694 @default.
- W3112561948 hasAuthorship W3112561948A5055865789 @default.
- W3112561948 hasAuthorship W3112561948A5071736469 @default.
- W3112561948 hasAuthorship W3112561948A5081516169 @default.
- W3112561948 hasConcept C134714966 @default.
- W3112561948 hasConcept C164866538 @default.
- W3112561948 hasConcept C170858558 @default.
- W3112561948 hasConcept C23123220 @default.
- W3112561948 hasConcept C31258907 @default.
- W3112561948 hasConcept C41008148 @default.
- W3112561948 hasConceptScore W3112561948C134714966 @default.
- W3112561948 hasConceptScore W3112561948C164866538 @default.
- W3112561948 hasConceptScore W3112561948C170858558 @default.
- W3112561948 hasConceptScore W3112561948C23123220 @default.
- W3112561948 hasConceptScore W3112561948C31258907 @default.
- W3112561948 hasConceptScore W3112561948C41008148 @default.
- W3112561948 hasLocation W31125619481 @default.
- W3112561948 hasOpenAccess W3112561948 @default.
- W3112561948 hasPrimaryLocation W31125619481 @default.
- W3112561948 hasRelatedWork W132250100 @default.
- W3112561948 hasRelatedWork W1539478205 @default.
- W3112561948 hasRelatedWork W2093597205 @default.
- W3112561948 hasRelatedWork W2141817295 @default.
- W3112561948 hasRelatedWork W2334535520 @default.
- W3112561948 hasRelatedWork W2380641910 @default.
- W3112561948 hasRelatedWork W2389846579 @default.
- W3112561948 hasRelatedWork W2392495745 @default.
- W3112561948 hasRelatedWork W2725657302 @default.
- W3112561948 hasRelatedWork W52724171 @default.
- W3112561948 isParatext "false" @default.
- W3112561948 isRetracted "false" @default.
- W3112561948 magId "3112561948" @default.
- W3112561948 workType "article" @default.