Matches in SemOpenAlex for { <https://semopenalex.org/work/W2895990696> ?p ?o ?g. }
Showing items 1 to 76 of
76
with 100 items per page.
- W2895990696 endingPage "946" @default.
- W2895990696 startingPage "921" @default.
- W2895990696 abstract "Abstract Research on topic segmentation has recently focused on segmenting documents by taking advantage of documents covering the same topics. In order to properly evaluate such approaches, a dataset of related documents is needed. However, existing datasets are limited in the number of related documents per domain. In addition, most of the available datasets do not consider documents from different media sources (PowerPoints, videos, etc.), which pose specific challenges to segmentation. We fill this gap with the MU ltimedia SE gmentation D ataset (MUSED), a collection of documents manually segmented, from different media sources, in seven different domains, with an average of twenty related documents per domain. In this paper, we describe the process of building MUSED. A multi-annotator study is carried out to determine if it is possible to observe agreement among human judges and characterize their disagreement patterns. In addition, we use MUSED to compare the state-of-the-art topic segmentation techniques, including the ones that take advantage of related documents. Moreover, we study the impact of having documents from different media sources in the dataset. To the best of our knowledge, MUSED is the first dataset that allows a straightforward evaluation of both single- and multiple-documents topic segmentation techniques, as well as to study how these behave in the presence of documents from different media sources. Results show that some techniques are, indeed, sensitive to different media sources, and also that current multi-document segmentation models do not outperform previous models, pointing to a research line that needs to be boosted." @default.
- W2895990696 created "2018-10-26" @default.
- W2895990696 creator A5066439336 @default.
- W2895990696 creator A5067178556 @default.
- W2895990696 creator A5077285164 @default.
- W2895990696 date "2018-10-22" @default.
- W2895990696 modified "2023-09-24" @default.
- W2895990696 title "MUSED: A multimedia multi-document dataset for topic segmentation" @default.
- W2895990696 cites W1985741469 @default.
- W2895990696 cites W2053154970 @default.
- W2895990696 cites W2069207503 @default.
- W2895990696 cites W2092062917 @default.
- W2895990696 cites W2133943399 @default.
- W2895990696 cites W2141403362 @default.
- W2895990696 cites W2159083595 @default.
- W2895990696 cites W2162021827 @default.
- W2895990696 cites W2165232124 @default.
- W2895990696 cites W2169142063 @default.
- W2895990696 cites W2750725664 @default.
- W2895990696 doi "https://doi.org/10.1017/s1351324918000359" @default.
- W2895990696 hasPublicationYear "2018" @default.
- W2895990696 type Work @default.
- W2895990696 sameAs 2895990696 @default.
- W2895990696 citedByCount "2" @default.
- W2895990696 countsByYear W28959906962022 @default.
- W2895990696 crossrefType "journal-article" @default.
- W2895990696 hasAuthorship W2895990696A5066439336 @default.
- W2895990696 hasAuthorship W2895990696A5067178556 @default.
- W2895990696 hasAuthorship W2895990696A5077285164 @default.
- W2895990696 hasConcept C111919701 @default.
- W2895990696 hasConcept C125308379 @default.
- W2895990696 hasConcept C134306372 @default.
- W2895990696 hasConcept C144133560 @default.
- W2895990696 hasConcept C154945302 @default.
- W2895990696 hasConcept C162853370 @default.
- W2895990696 hasConcept C23123220 @default.
- W2895990696 hasConcept C2522767166 @default.
- W2895990696 hasConcept C33923547 @default.
- W2895990696 hasConcept C36503486 @default.
- W2895990696 hasConcept C41008148 @default.
- W2895990696 hasConcept C89600930 @default.
- W2895990696 hasConcept C98045186 @default.
- W2895990696 hasConceptScore W2895990696C111919701 @default.
- W2895990696 hasConceptScore W2895990696C125308379 @default.
- W2895990696 hasConceptScore W2895990696C134306372 @default.
- W2895990696 hasConceptScore W2895990696C144133560 @default.
- W2895990696 hasConceptScore W2895990696C154945302 @default.
- W2895990696 hasConceptScore W2895990696C162853370 @default.
- W2895990696 hasConceptScore W2895990696C23123220 @default.
- W2895990696 hasConceptScore W2895990696C2522767166 @default.
- W2895990696 hasConceptScore W2895990696C33923547 @default.
- W2895990696 hasConceptScore W2895990696C36503486 @default.
- W2895990696 hasConceptScore W2895990696C41008148 @default.
- W2895990696 hasConceptScore W2895990696C89600930 @default.
- W2895990696 hasConceptScore W2895990696C98045186 @default.
- W2895990696 hasIssue "6" @default.
- W2895990696 hasLocation W28959906961 @default.
- W2895990696 hasOpenAccess W2895990696 @default.
- W2895990696 hasPrimaryLocation W28959906961 @default.
- W2895990696 hasRelatedWork W1553514811 @default.
- W2895990696 hasRelatedWork W2045342254 @default.
- W2895990696 hasRelatedWork W2101955803 @default.
- W2895990696 hasRelatedWork W2144190808 @default.
- W2895990696 hasRelatedWork W2376314740 @default.
- W2895990696 hasRelatedWork W2384888906 @default.
- W2895990696 hasRelatedWork W2616049357 @default.
- W2895990696 hasRelatedWork W2971867419 @default.
- W2895990696 hasRelatedWork W4285814174 @default.
- W2895990696 hasRelatedWork W4293872997 @default.
- W2895990696 hasVolume "24" @default.
- W2895990696 isParatext "false" @default.
- W2895990696 isRetracted "false" @default.
- W2895990696 magId "2895990696" @default.
- W2895990696 workType "article" @default.