Matches in SemOpenAlex for { <https://semopenalex.org/work/W4295215087> ?p ?o ?g. }
- W4295215087 endingPage "6818" @default.
- W4295215087 startingPage "6818" @default.
- W4295215087 abstract "The complexity of polyphonic sounds imposes numerous challenges on their classification. Especially in real life, polyphonic sound events have discontinuity and unstable time-frequency variations. Traditional single acoustic features cannot characterize the key feature information of the polyphonic sound event, and this deficiency results in poor model classification performance. In this paper, we propose a convolutional recurrent neural network model based on the temporal-frequency (TF) attention mechanism and feature space (FS) attention mechanism (TFFS-CRNN). The TFFS-CRNN model aggregates Log-Mel spectrograms and MFCCs feature as inputs, which contains the TF-attention module, the convolutional recurrent neural network (CRNN) module, the FS-attention module and the bidirectional gated recurrent unit (BGRU) module. In polyphonic sound events detection (SED), the TF-attention module can capture the critical temporal–frequency features more capably. The FS-attention module assigns different dynamically learnable weights to different dimensions of features. The TFFS-CRNN model improves the characterization of features for key feature information in polyphonic SED. By using two attention modules, the model can focus on semantically relevant time frames, key frequency bands, and important feature spaces. Finally, the BGRU module learns contextual information. The experiments were conducted on the DCASE 2016 Task3 dataset and the DCASE 2017 Task3 dataset. Experimental results show that the F1-score of the TFFS-CRNN model improved 12.4% and 25.2% compared with winning system models in DCASE challenge; the ER is reduced by 0.41 and 0.37 as well. The proposed TFFS-CRNN model algorithm has better classification performance and lower ER in polyphonic SED." @default.
- W4295215087 created "2022-09-12" @default.
- W4295215087 creator A5013877360 @default.
- W4295215087 creator A5035008919 @default.
- W4295215087 creator A5035322817 @default.
- W4295215087 creator A5061690729 @default.
- W4295215087 creator A5069680900 @default.
- W4295215087 date "2022-09-09" @default.
- W4295215087 modified "2023-09-30" @default.
- W4295215087 title "Polyphonic Sound Event Detection Using Temporal-Frequency Attention and Feature Space Attention" @default.
- W4295215087 cites W2036122775 @default.
- W4295215087 cites W2144414181 @default.
- W4295215087 cites W2164935924 @default.
- W4295215087 cites W2168441989 @default.
- W4295215087 cites W2292996718 @default.
- W4295215087 cites W2341412280 @default.
- W4295215087 cites W2408239454 @default.
- W4295215087 cites W2510931882 @default.
- W4295215087 cites W2566935005 @default.
- W4295215087 cites W2591013610 @default.
- W4295215087 cites W2883135409 @default.
- W4295215087 cites W2955179149 @default.
- W4295215087 cites W2962824709 @default.
- W4295215087 cites W2985454151 @default.
- W4295215087 cites W3006275583 @default.
- W4295215087 cites W3015683396 @default.
- W4295215087 cites W3043183554 @default.
- W4295215087 cites W3083566224 @default.
- W4295215087 cites W3098357269 @default.
- W4295215087 cites W3099026523 @default.
- W4295215087 cites W3169030202 @default.
- W4295215087 cites W3192834253 @default.
- W4295215087 cites W3208915815 @default.
- W4295215087 cites W3209152703 @default.
- W4295215087 cites W4206628220 @default.
- W4295215087 cites W4213209776 @default.
- W4295215087 cites W4200337780 @default.
- W4295215087 doi "https://doi.org/10.3390/s22186818" @default.
- W4295215087 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/36146166" @default.
- W4295215087 hasPublicationYear "2022" @default.
- W4295215087 type Work @default.
- W4295215087 citedByCount "0" @default.
- W4295215087 crossrefType "journal-article" @default.
- W4295215087 hasAuthorship W4295215087A5013877360 @default.
- W4295215087 hasAuthorship W4295215087A5035008919 @default.
- W4295215087 hasAuthorship W4295215087A5035322817 @default.
- W4295215087 hasAuthorship W4295215087A5061690729 @default.
- W4295215087 hasAuthorship W4295215087A5069680900 @default.
- W4295215087 hasBestOaLocation W42952150871 @default.
- W4295215087 hasConcept C121332964 @default.
- W4295215087 hasConcept C128979739 @default.
- W4295215087 hasConcept C138885662 @default.
- W4295215087 hasConcept C147168706 @default.
- W4295215087 hasConcept C153180895 @default.
- W4295215087 hasConcept C154945302 @default.
- W4295215087 hasConcept C24890656 @default.
- W4295215087 hasConcept C26517878 @default.
- W4295215087 hasConcept C2776401178 @default.
- W4295215087 hasConcept C28490314 @default.
- W4295215087 hasConcept C38652104 @default.
- W4295215087 hasConcept C41008148 @default.
- W4295215087 hasConcept C41895202 @default.
- W4295215087 hasConcept C45273575 @default.
- W4295215087 hasConcept C50644808 @default.
- W4295215087 hasConcept C81363708 @default.
- W4295215087 hasConceptScore W4295215087C121332964 @default.
- W4295215087 hasConceptScore W4295215087C128979739 @default.
- W4295215087 hasConceptScore W4295215087C138885662 @default.
- W4295215087 hasConceptScore W4295215087C147168706 @default.
- W4295215087 hasConceptScore W4295215087C153180895 @default.
- W4295215087 hasConceptScore W4295215087C154945302 @default.
- W4295215087 hasConceptScore W4295215087C24890656 @default.
- W4295215087 hasConceptScore W4295215087C26517878 @default.
- W4295215087 hasConceptScore W4295215087C2776401178 @default.
- W4295215087 hasConceptScore W4295215087C28490314 @default.
- W4295215087 hasConceptScore W4295215087C38652104 @default.
- W4295215087 hasConceptScore W4295215087C41008148 @default.
- W4295215087 hasConceptScore W4295215087C41895202 @default.
- W4295215087 hasConceptScore W4295215087C45273575 @default.
- W4295215087 hasConceptScore W4295215087C50644808 @default.
- W4295215087 hasConceptScore W4295215087C81363708 @default.
- W4295215087 hasIssue "18" @default.
- W4295215087 hasLocation W42952150871 @default.
- W4295215087 hasLocation W42952150872 @default.
- W4295215087 hasLocation W42952150873 @default.
- W4295215087 hasOpenAccess W4295215087 @default.
- W4295215087 hasPrimaryLocation W42952150871 @default.
- W4295215087 hasRelatedWork W2295021132 @default.
- W4295215087 hasRelatedWork W2767651786 @default.
- W4295215087 hasRelatedWork W2886673456 @default.
- W4295215087 hasRelatedWork W2907228390 @default.
- W4295215087 hasRelatedWork W2912288872 @default.
- W4295215087 hasRelatedWork W2936488316 @default.
- W4295215087 hasRelatedWork W3016107420 @default.
- W4295215087 hasRelatedWork W3091785813 @default.
- W4295215087 hasRelatedWork W3106036237 @default.
- W4295215087 hasRelatedWork W564581980 @default.
- W4295215087 hasVolume "22" @default.