Matches in SemOpenAlex for { <https://semopenalex.org/work/W4313594098> ?p ?o ?g. }
Showing items 1 to 90 of
90
with 100 items per page.
- W4313594098 endingPage "119491" @default.
- W4313594098 startingPage "119491" @default.
- W4313594098 abstract "Music transcription, which deals with the conversion of music sources into a structured digital format, is a key problem for Music Information Retrieval (MIR). When addressing this challenge in computational terms, the MIR community follows two lines of research: music documents, which is the case of Optical Music Recognition (OMR), or audio recordings, which is the case of Automatic Music Transcription (AMT). The different nature of the aforementioned input data has conditioned these fields to develop modality-specific frameworks. However, their recent definition in terms of sequence labeling tasks leads to a common output representation, which enables research on a combined paradigm. In this respect, multimodal image and audio music transcription comprises the challenge of effectively combining the information conveyed by image and audio modalities. In this work, we explore this question at a late-fusion level: we study four combination approaches in order to merge, for the first time, the hypotheses regarding end-to-end OMR and AMT systems in a lattice-based search space. The results obtained for a series of performance scenarios–in which the corresponding single-modality models yield different error rates–showed interesting benefits of these approaches. In addition, two of the four strategies considered significantly improve the corresponding unimodal standard recognition frameworks." @default.
- W4313594098 created "2023-01-06" @default.
- W4313594098 creator A5005281683 @default.
- W4313594098 creator A5041459269 @default.
- W4313594098 creator A5080444506 @default.
- W4313594098 creator A5085151278 @default.
- W4313594098 date "2023-04-01" @default.
- W4313594098 modified "2023-10-15" @default.
- W4313594098 title "Late multimodal fusion for image and audio music transcription" @default.
- W4313594098 cites W1972594981 @default.
- W4313594098 cites W1993721840 @default.
- W4313594098 cites W2045220951 @default.
- W4313594098 cites W2087064593 @default.
- W4313594098 cites W2105143211 @default.
- W4313594098 cites W2106646384 @default.
- W4313594098 cites W2111894689 @default.
- W4313594098 cites W2148436391 @default.
- W4313594098 cites W2280355185 @default.
- W4313594098 cites W2485144538 @default.
- W4313594098 cites W2796517058 @default.
- W4313594098 cites W2901920821 @default.
- W4313594098 cites W2906214917 @default.
- W4313594098 cites W2963537349 @default.
- W4313594098 cites W2972764791 @default.
- W4313594098 cites W3103821002 @default.
- W4313594098 cites W4240592325 @default.
- W4313594098 doi "https://doi.org/10.1016/j.eswa.2022.119491" @default.
- W4313594098 hasPublicationYear "2023" @default.
- W4313594098 type Work @default.
- W4313594098 citedByCount "2" @default.
- W4313594098 countsByYear W43135940982023 @default.
- W4313594098 crossrefType "journal-article" @default.
- W4313594098 hasAuthorship W4313594098A5005281683 @default.
- W4313594098 hasAuthorship W4313594098A5041459269 @default.
- W4313594098 hasAuthorship W4313594098A5080444506 @default.
- W4313594098 hasAuthorship W4313594098A5085151278 @default.
- W4313594098 hasBestOaLocation W43135940981 @default.
- W4313594098 hasConcept C138885662 @default.
- W4313594098 hasConcept C13895895 @default.
- W4313594098 hasConcept C144024400 @default.
- W4313594098 hasConcept C154945302 @default.
- W4313594098 hasConcept C179926584 @default.
- W4313594098 hasConcept C197129107 @default.
- W4313594098 hasConcept C204321447 @default.
- W4313594098 hasConcept C23123220 @default.
- W4313594098 hasConcept C2779903281 @default.
- W4313594098 hasConcept C28490314 @default.
- W4313594098 hasConcept C36289849 @default.
- W4313594098 hasConcept C40969351 @default.
- W4313594098 hasConcept C41008148 @default.
- W4313594098 hasConcept C41895202 @default.
- W4313594098 hasConcept C64922751 @default.
- W4313594098 hasConcept C87687168 @default.
- W4313594098 hasConceptScore W4313594098C138885662 @default.
- W4313594098 hasConceptScore W4313594098C13895895 @default.
- W4313594098 hasConceptScore W4313594098C144024400 @default.
- W4313594098 hasConceptScore W4313594098C154945302 @default.
- W4313594098 hasConceptScore W4313594098C179926584 @default.
- W4313594098 hasConceptScore W4313594098C197129107 @default.
- W4313594098 hasConceptScore W4313594098C204321447 @default.
- W4313594098 hasConceptScore W4313594098C23123220 @default.
- W4313594098 hasConceptScore W4313594098C2779903281 @default.
- W4313594098 hasConceptScore W4313594098C28490314 @default.
- W4313594098 hasConceptScore W4313594098C36289849 @default.
- W4313594098 hasConceptScore W4313594098C40969351 @default.
- W4313594098 hasConceptScore W4313594098C41008148 @default.
- W4313594098 hasConceptScore W4313594098C41895202 @default.
- W4313594098 hasConceptScore W4313594098C64922751 @default.
- W4313594098 hasConceptScore W4313594098C87687168 @default.
- W4313594098 hasLocation W43135940981 @default.
- W4313594098 hasLocation W43135940982 @default.
- W4313594098 hasLocation W43135940983 @default.
- W4313594098 hasOpenAccess W4313594098 @default.
- W4313594098 hasPrimaryLocation W43135940981 @default.
- W4313594098 hasRelatedWork W1751699554 @default.
- W4313594098 hasRelatedWork W1982907196 @default.
- W4313594098 hasRelatedWork W2008308193 @default.
- W4313594098 hasRelatedWork W2017555767 @default.
- W4313594098 hasRelatedWork W2123700259 @default.
- W4313594098 hasRelatedWork W2147011861 @default.
- W4313594098 hasRelatedWork W2401572723 @default.
- W4313594098 hasRelatedWork W2505877856 @default.
- W4313594098 hasRelatedWork W2707788252 @default.
- W4313594098 hasRelatedWork W2763412546 @default.
- W4313594098 hasVolume "216" @default.
- W4313594098 isParatext "false" @default.
- W4313594098 isRetracted "false" @default.
- W4313594098 workType "article" @default.