Matches in SemOpenAlex for { <https://semopenalex.org/work/W4313014461> ?p ?o ?g. }
Showing items 1 to 83 of
83
with 100 items per page.
- W4313014461 abstract "Audio question answering (AQA) is a multimodal translation task where a system analyzes an audio signal and a natural language question, to generate a desirable natural language answer. In this paper, we introduce Clotho-AQA, a dataset for Audio question answering consisting of 1991 audio files each between 15 to 30 seconds duration selected from the Clotho dataset. For each audio file, we collect six different questions and corresponding answers by crowdsourcing using Amazon Mechanical Turk. The questions and answers are produced by different annotators. Out of the six questions for each audio, two questions each are designed to have ‘yes’ and ‘no’ as answers, while the remaining two questions have other single-word answers. For each question, we collect answers from three different annotators. We also present two baseline experiments to describe the usage of our dataset for the AQA task — a Long short-term memory (LSTM) based multimodal binary classifier for ‘yes’ or ‘no’ type answers and an LSTM based multimodal multi-class classifier for 828 single-word answers. The binary classifier achieved an accuracy of 62.7% and the multi-class classifier achieved a top-1 accuracy of 54.2% and a top-5 accuracy of 93.7%. Clotho-AQA dataset is freely available online at https://zenodo.org/record/6473207." @default.
- W4313014461 created "2023-01-05" @default.
- W4313014461 creator A5007748468 @default.
- W4313014461 creator A5022669956 @default.
- W4313014461 creator A5049691461 @default.
- W4313014461 creator A5087751248 @default.
- W4313014461 date "2022-08-29" @default.
- W4313014461 modified "2023-09-26" @default.
- W4313014461 title "Clotho-AQA: A Crowdsourced Dataset for Audio Question Answering" @default.
- W4313014461 cites W2033875152 @default.
- W4313014461 cites W2529436507 @default.
- W4313014461 cites W2557764419 @default.
- W4313014461 cites W2563399268 @default.
- W4313014461 cites W2593116425 @default.
- W4313014461 cites W2619697695 @default.
- W4313014461 cites W2939574508 @default.
- W4313014461 cites W2945761034 @default.
- W4313014461 cites W2962749469 @default.
- W4313014461 cites W2962910007 @default.
- W4313014461 cites W2963541336 @default.
- W4313014461 cites W2963748441 @default.
- W4313014461 cites W2963890019 @default.
- W4313014461 cites W2964220823 @default.
- W4313014461 cites W3044495139 @default.
- W4313014461 cites W3135751354 @default.
- W4313014461 cites W54887220 @default.
- W4313014461 doi "https://doi.org/10.23919/eusipco55093.2022.9909680" @default.
- W4313014461 hasPublicationYear "2022" @default.
- W4313014461 type Work @default.
- W4313014461 citedByCount "0" @default.
- W4313014461 crossrefType "proceedings-article" @default.
- W4313014461 hasAuthorship W4313014461A5007748468 @default.
- W4313014461 hasAuthorship W4313014461A5022669956 @default.
- W4313014461 hasAuthorship W4313014461A5049691461 @default.
- W4313014461 hasAuthorship W4313014461A5087751248 @default.
- W4313014461 hasBestOaLocation W43130144612 @default.
- W4313014461 hasConcept C12267149 @default.
- W4313014461 hasConcept C136764020 @default.
- W4313014461 hasConcept C154945302 @default.
- W4313014461 hasConcept C204321447 @default.
- W4313014461 hasConcept C23123220 @default.
- W4313014461 hasConcept C28490314 @default.
- W4313014461 hasConcept C3019144022 @default.
- W4313014461 hasConcept C33923547 @default.
- W4313014461 hasConcept C41008148 @default.
- W4313014461 hasConcept C44291984 @default.
- W4313014461 hasConcept C48372109 @default.
- W4313014461 hasConcept C62230096 @default.
- W4313014461 hasConcept C66905080 @default.
- W4313014461 hasConcept C94375191 @default.
- W4313014461 hasConcept C95623464 @default.
- W4313014461 hasConceptScore W4313014461C12267149 @default.
- W4313014461 hasConceptScore W4313014461C136764020 @default.
- W4313014461 hasConceptScore W4313014461C154945302 @default.
- W4313014461 hasConceptScore W4313014461C204321447 @default.
- W4313014461 hasConceptScore W4313014461C23123220 @default.
- W4313014461 hasConceptScore W4313014461C28490314 @default.
- W4313014461 hasConceptScore W4313014461C3019144022 @default.
- W4313014461 hasConceptScore W4313014461C33923547 @default.
- W4313014461 hasConceptScore W4313014461C41008148 @default.
- W4313014461 hasConceptScore W4313014461C44291984 @default.
- W4313014461 hasConceptScore W4313014461C48372109 @default.
- W4313014461 hasConceptScore W4313014461C62230096 @default.
- W4313014461 hasConceptScore W4313014461C66905080 @default.
- W4313014461 hasConceptScore W4313014461C94375191 @default.
- W4313014461 hasConceptScore W4313014461C95623464 @default.
- W4313014461 hasLocation W43130144611 @default.
- W4313014461 hasLocation W43130144612 @default.
- W4313014461 hasOpenAccess W4313014461 @default.
- W4313014461 hasPrimaryLocation W43130144611 @default.
- W4313014461 hasRelatedWork W128392744 @default.
- W4313014461 hasRelatedWork W1846393350 @default.
- W4313014461 hasRelatedWork W207304934 @default.
- W4313014461 hasRelatedWork W2131129554 @default.
- W4313014461 hasRelatedWork W2135033253 @default.
- W4313014461 hasRelatedWork W2233955765 @default.
- W4313014461 hasRelatedWork W2747680751 @default.
- W4313014461 hasRelatedWork W3107474891 @default.
- W4313014461 hasRelatedWork W4224282636 @default.
- W4313014461 hasRelatedWork W4313014461 @default.
- W4313014461 isParatext "false" @default.
- W4313014461 isRetracted "false" @default.
- W4313014461 workType "article" @default.