Matches in SemOpenAlex for { <https://semopenalex.org/work/W4385304448> ?p ?o ?g. }
- W4385304448 endingPage "110678" @default.
- W4385304448 startingPage "110678" @default.
- W4385304448 abstract "Bird song recognition plays an important function in ecosystem balance monitoring, biodiversity detection, and biodiversity conservation. Due to the complexity of the natural environment, based on deep learning, there is a problem of information loss in extracting audio features with a single filter. Identifying bird sounds efficiently and quickly is still a challenge. To address this problem, a lightweight multi-sensory field dual-feature fusion residual network (LDFSRE-NET) is proposed in this paper. Firstly, using the feature extraction filters based on Mel and SincNet to extract birdsong’s low-frequency and timbre information. The proposed dual-feature fusion module (FFMS) is used to fuse the low-frequency and timbre information with the differences between the two feature sets. Secondly, the double-layer residual module (DBNet), connected by basicblock and downblock, is used as the backbone network for bird song recognition to improve the training speed. To improve the different perceptual fields of the backbone network, the 3 × 3 convolutional modules in the basicblocks of the two residual modules are replaced with a Diverse Branch Block. They make the network performs better on recognition tasks under complex situations of multiple branches. Then, the ShuffleAttention attention module is embedded between the two layers of the residual module for transferring its valid information, enhancing the spectrogram ripple feature, and further improving the network’s recognition performance. Finally, extensive experiments are conducted on three datasets: the self-built 30-class bird song dataset (Birdselfdata), the public datasets Birdsdata and Urbansound8K. The model proposed in this paper surpasses the state-of-the-art sound recognition model methods in terms of efficiency and accuracy. The recognition accuracy on these three datasets of this model are 96.75%, 96.46%, and 97.98%, with the F1-score of 96.79%, 96.39%, and 97.88%." @default.
- W4385304448 created "2023-07-28" @default.
- W4385304448 creator A5001108111 @default.
- W4385304448 creator A5023817142 @default.
- W4385304448 creator A5031275848 @default.
- W4385304448 creator A5035582314 @default.
- W4385304448 creator A5060348513 @default.
- W4385304448 creator A5061696740 @default.
- W4385304448 date "2023-10-01" @default.
- W4385304448 modified "2023-10-14" @default.
- W4385304448 title "A lightweight multi-sensory field-based dual-feature fusion residual network for bird song recognition" @default.
- W4385304448 cites W1967057778 @default.
- W4385304448 cites W1974826336 @default.
- W4385304448 cites W2012685917 @default.
- W4385304448 cites W2023258474 @default.
- W4385304448 cites W2071004693 @default.
- W4385304448 cites W2073546306 @default.
- W4385304448 cites W2081793620 @default.
- W4385304448 cites W2108724231 @default.
- W4385304448 cites W2149241068 @default.
- W4385304448 cites W2563031223 @default.
- W4385304448 cites W2773245560 @default.
- W4385304448 cites W2901447936 @default.
- W4385304448 cites W2910136993 @default.
- W4385304448 cites W2962151887 @default.
- W4385304448 cites W2964370293 @default.
- W4385304448 cites W2969991197 @default.
- W4385304448 cites W2974670785 @default.
- W4385304448 cites W2977965380 @default.
- W4385304448 cites W2981036610 @default.
- W4385304448 cites W2981804305 @default.
- W4385304448 cites W2990214083 @default.
- W4385304448 cites W2992334633 @default.
- W4385304448 cites W2994125460 @default.
- W4385304448 cites W3008626935 @default.
- W4385304448 cites W3008768000 @default.
- W4385304448 cites W3011688396 @default.
- W4385304448 cites W3011702533 @default.
- W4385304448 cites W3080776956 @default.
- W4385304448 cites W3086154751 @default.
- W4385304448 cites W3092871726 @default.
- W4385304448 cites W3093766508 @default.
- W4385304448 cites W3164046946 @default.
- W4385304448 cites W3171038842 @default.
- W4385304448 cites W3177052299 @default.
- W4385304448 cites W3179495260 @default.
- W4385304448 cites W3195409179 @default.
- W4385304448 cites W3196747467 @default.
- W4385304448 cites W3196957369 @default.
- W4385304448 cites W3200034843 @default.
- W4385304448 cites W3217364055 @default.
- W4385304448 cites W4214547277 @default.
- W4385304448 cites W4220829848 @default.
- W4385304448 cites W4284899153 @default.
- W4385304448 cites W4286582832 @default.
- W4385304448 cites W4291184185 @default.
- W4385304448 cites W4296188778 @default.
- W4385304448 cites W4296886713 @default.
- W4385304448 cites W4302011122 @default.
- W4385304448 doi "https://doi.org/10.1016/j.asoc.2023.110678" @default.
- W4385304448 hasPublicationYear "2023" @default.
- W4385304448 type Work @default.
- W4385304448 citedByCount "0" @default.
- W4385304448 crossrefType "journal-article" @default.
- W4385304448 hasAuthorship W4385304448A5001108111 @default.
- W4385304448 hasAuthorship W4385304448A5023817142 @default.
- W4385304448 hasAuthorship W4385304448A5031275848 @default.
- W4385304448 hasAuthorship W4385304448A5035582314 @default.
- W4385304448 hasAuthorship W4385304448A5060348513 @default.
- W4385304448 hasAuthorship W4385304448A5061696740 @default.
- W4385304448 hasConcept C11413529 @default.
- W4385304448 hasConcept C138885662 @default.
- W4385304448 hasConcept C153180895 @default.
- W4385304448 hasConcept C154945302 @default.
- W4385304448 hasConcept C155512373 @default.
- W4385304448 hasConcept C202444582 @default.
- W4385304448 hasConcept C2524010 @default.
- W4385304448 hasConcept C2776401178 @default.
- W4385304448 hasConcept C2777210771 @default.
- W4385304448 hasConcept C33923547 @default.
- W4385304448 hasConcept C41008148 @default.
- W4385304448 hasConcept C41895202 @default.
- W4385304448 hasConcept C52622490 @default.
- W4385304448 hasConcept C9652623 @default.
- W4385304448 hasConceptScore W4385304448C11413529 @default.
- W4385304448 hasConceptScore W4385304448C138885662 @default.
- W4385304448 hasConceptScore W4385304448C153180895 @default.
- W4385304448 hasConceptScore W4385304448C154945302 @default.
- W4385304448 hasConceptScore W4385304448C155512373 @default.
- W4385304448 hasConceptScore W4385304448C202444582 @default.
- W4385304448 hasConceptScore W4385304448C2524010 @default.
- W4385304448 hasConceptScore W4385304448C2776401178 @default.
- W4385304448 hasConceptScore W4385304448C2777210771 @default.
- W4385304448 hasConceptScore W4385304448C33923547 @default.
- W4385304448 hasConceptScore W4385304448C41008148 @default.
- W4385304448 hasConceptScore W4385304448C41895202 @default.
- W4385304448 hasConceptScore W4385304448C52622490 @default.
- W4385304448 hasConceptScore W4385304448C9652623 @default.