Matches in SemOpenAlex for { <https://semopenalex.org/work/W4288280017> ?p ?o ?g. }
Showing items 1 to 72 of
72
with 100 items per page.
- W4288280017 abstract "Audio captioning is a novel field of multi-modal translation and it is the task of creating a textual description of the content of an audio signal (e.g. people talking in a big room). The creation of a dataset for this task requires a considerable amount of work, rendering the crowdsourcing a very attractive option. In this paper we present a three steps based framework for crowdsourcing an audio captioning dataset, based on concepts and practises followed for the creation of widely used image captioning and machine translations datasets. During the first step initial captions are gathered. A grammatically corrected and/or rephrased version of each initial caption is obtained in second step. Finally, the initial and edited captions are rated, keeping the top ones for the produced dataset. We objectively evaluate the impact of our framework during the process of creating an audio captioning dataset, in terms of diversity and amount of typographical errors in the obtained captions. The obtained results show that the resulting dataset has less typographical errors than the initial captions, and on average each sound in the produced dataset has captions with a Jaccard similarity of 0.24, roughly equivalent to two ten-word captions having in common four words with the same root, indicating that the captions are dissimilar while they still contain some of the same information." @default.
- W4288280017 created "2022-07-28" @default.
- W4288280017 creator A5007748468 @default.
- W4288280017 creator A5049691461 @default.
- W4288280017 creator A5087751248 @default.
- W4288280017 date "2019-07-22" @default.
- W4288280017 modified "2023-10-18" @default.
- W4288280017 title "Crowdsourcing a Dataset of Audio Captions" @default.
- W4288280017 doi "https://doi.org/10.48550/arxiv.1907.09238" @default.
- W4288280017 hasPublicationYear "2019" @default.
- W4288280017 type Work @default.
- W4288280017 citedByCount "0" @default.
- W4288280017 crossrefType "posted-content" @default.
- W4288280017 hasAuthorship W4288280017A5007748468 @default.
- W4288280017 hasAuthorship W4288280017A5049691461 @default.
- W4288280017 hasAuthorship W4288280017A5087751248 @default.
- W4288280017 hasBestOaLocation W42882800171 @default.
- W4288280017 hasConcept C103278499 @default.
- W4288280017 hasConcept C115961682 @default.
- W4288280017 hasConcept C136764020 @default.
- W4288280017 hasConcept C153180895 @default.
- W4288280017 hasConcept C154945302 @default.
- W4288280017 hasConcept C157657479 @default.
- W4288280017 hasConcept C162324750 @default.
- W4288280017 hasConcept C187736073 @default.
- W4288280017 hasConcept C202444582 @default.
- W4288280017 hasConcept C203519979 @default.
- W4288280017 hasConcept C204321447 @default.
- W4288280017 hasConcept C205711294 @default.
- W4288280017 hasConcept C23123220 @default.
- W4288280017 hasConcept C2780451532 @default.
- W4288280017 hasConcept C28490314 @default.
- W4288280017 hasConcept C33923547 @default.
- W4288280017 hasConcept C41008148 @default.
- W4288280017 hasConcept C62230096 @default.
- W4288280017 hasConcept C9652623 @default.
- W4288280017 hasConceptScore W4288280017C103278499 @default.
- W4288280017 hasConceptScore W4288280017C115961682 @default.
- W4288280017 hasConceptScore W4288280017C136764020 @default.
- W4288280017 hasConceptScore W4288280017C153180895 @default.
- W4288280017 hasConceptScore W4288280017C154945302 @default.
- W4288280017 hasConceptScore W4288280017C157657479 @default.
- W4288280017 hasConceptScore W4288280017C162324750 @default.
- W4288280017 hasConceptScore W4288280017C187736073 @default.
- W4288280017 hasConceptScore W4288280017C202444582 @default.
- W4288280017 hasConceptScore W4288280017C203519979 @default.
- W4288280017 hasConceptScore W4288280017C204321447 @default.
- W4288280017 hasConceptScore W4288280017C205711294 @default.
- W4288280017 hasConceptScore W4288280017C23123220 @default.
- W4288280017 hasConceptScore W4288280017C2780451532 @default.
- W4288280017 hasConceptScore W4288280017C28490314 @default.
- W4288280017 hasConceptScore W4288280017C33923547 @default.
- W4288280017 hasConceptScore W4288280017C41008148 @default.
- W4288280017 hasConceptScore W4288280017C62230096 @default.
- W4288280017 hasConceptScore W4288280017C9652623 @default.
- W4288280017 hasLocation W42882800171 @default.
- W4288280017 hasLocation W42882800172 @default.
- W4288280017 hasOpenAccess W4288280017 @default.
- W4288280017 hasPrimaryLocation W42882800171 @default.
- W4288280017 hasRelatedWork W1509467138 @default.
- W4288280017 hasRelatedWork W2072271845 @default.
- W4288280017 hasRelatedWork W2081647779 @default.
- W4288280017 hasRelatedWork W2765264756 @default.
- W4288280017 hasRelatedWork W2983160953 @default.
- W4288280017 hasRelatedWork W3113062513 @default.
- W4288280017 hasRelatedWork W3122011053 @default.
- W4288280017 hasRelatedWork W4200486724 @default.
- W4288280017 hasRelatedWork W99542844 @default.
- W4288280017 hasRelatedWork W2183521486 @default.
- W4288280017 isParatext "false" @default.
- W4288280017 isRetracted "false" @default.
- W4288280017 workType "article" @default.