Matches in SemOpenAlex for { <https://semopenalex.org/work/W4302047806> ?p ?o ?g. }
Showing items 1 to 62 of
62
with 100 items per page.
- W4302047806 abstract "Acoustic scene classification (ASC) aims to identify the type of scene (environment) in which a given audio signal is recorded. The log-mel feature and convolutional neural network (CNN) have recently become the most popular time-frequency (TF) feature representation and classifier in ASC. An audio signal recorded in a scene may include various sounds overlapping in time and frequency. The previous study suggests that separately considering the long-duration sounds and short-duration sounds in CNN may improve ASC accuracy. This study addresses the problem of the generalization ability of acoustic scene classifiers. In practice, acoustic scene signals' characteristics may be affected by various factors, such as the choice of recording devices and the change of recording locations. When an established ASC system predicts scene classes on audios recorded in unseen scenarios, its accuracy may drop significantly. The long-duration sounds not only contain domain-independent acoustic scene information, but also contain channel information determined by the recording conditions, which is prone to over-fitting. For a more robust ASC system, We propose a robust feature learning (RFL) framework to train the CNN. The RFL framework down-weights CNN learning specifically on long-duration sounds. The proposed method is to train an auxiliary classifier with only long-duration sound information as input. The auxiliary classifier is trained with an auxiliary loss function that assigns less learning weight to poorly classified examples than the standard cross-entropy loss. The experimental results show that the proposed RFL framework can obtain a more robust acoustic scene classifier towards unseen devices and cities." @default.
- W4302047806 created "2022-10-06" @default.
- W4302047806 creator A5001795601 @default.
- W4302047806 creator A5087187491 @default.
- W4302047806 date "2021-08-10" @default.
- W4302047806 modified "2023-09-27" @default.
- W4302047806 title "Robust Feature Learning on Long-Duration Sounds for Acoustic Scene Classification" @default.
- W4302047806 doi "https://doi.org/10.48550/arxiv.2108.05008" @default.
- W4302047806 hasPublicationYear "2021" @default.
- W4302047806 type Work @default.
- W4302047806 citedByCount "0" @default.
- W4302047806 crossrefType "posted-content" @default.
- W4302047806 hasAuthorship W4302047806A5001795601 @default.
- W4302047806 hasAuthorship W4302047806A5087187491 @default.
- W4302047806 hasBestOaLocation W43020478061 @default.
- W4302047806 hasConcept C112758219 @default.
- W4302047806 hasConcept C121332964 @default.
- W4302047806 hasConcept C138885662 @default.
- W4302047806 hasConcept C13895895 @default.
- W4302047806 hasConcept C153180895 @default.
- W4302047806 hasConcept C154945302 @default.
- W4302047806 hasConcept C24890656 @default.
- W4302047806 hasConcept C2776401178 @default.
- W4302047806 hasConcept C28490314 @default.
- W4302047806 hasConcept C41008148 @default.
- W4302047806 hasConcept C41895202 @default.
- W4302047806 hasConcept C59404180 @default.
- W4302047806 hasConcept C64922751 @default.
- W4302047806 hasConcept C81363708 @default.
- W4302047806 hasConcept C95623464 @default.
- W4302047806 hasConceptScore W4302047806C112758219 @default.
- W4302047806 hasConceptScore W4302047806C121332964 @default.
- W4302047806 hasConceptScore W4302047806C138885662 @default.
- W4302047806 hasConceptScore W4302047806C13895895 @default.
- W4302047806 hasConceptScore W4302047806C153180895 @default.
- W4302047806 hasConceptScore W4302047806C154945302 @default.
- W4302047806 hasConceptScore W4302047806C24890656 @default.
- W4302047806 hasConceptScore W4302047806C2776401178 @default.
- W4302047806 hasConceptScore W4302047806C28490314 @default.
- W4302047806 hasConceptScore W4302047806C41008148 @default.
- W4302047806 hasConceptScore W4302047806C41895202 @default.
- W4302047806 hasConceptScore W4302047806C59404180 @default.
- W4302047806 hasConceptScore W4302047806C64922751 @default.
- W4302047806 hasConceptScore W4302047806C81363708 @default.
- W4302047806 hasConceptScore W4302047806C95623464 @default.
- W4302047806 hasLocation W43020478061 @default.
- W4302047806 hasLocation W43020478062 @default.
- W4302047806 hasOpenAccess W4302047806 @default.
- W4302047806 hasPrimaryLocation W43020478061 @default.
- W4302047806 hasRelatedWork W2406522397 @default.
- W4302047806 hasRelatedWork W2613736958 @default.
- W4302047806 hasRelatedWork W2760085659 @default.
- W4302047806 hasRelatedWork W2768413403 @default.
- W4302047806 hasRelatedWork W2785535669 @default.
- W4302047806 hasRelatedWork W2905846897 @default.
- W4302047806 hasRelatedWork W2995914718 @default.
- W4302047806 hasRelatedWork W3093612317 @default.
- W4302047806 hasRelatedWork W4225852842 @default.
- W4302047806 hasRelatedWork W564581980 @default.
- W4302047806 isParatext "false" @default.
- W4302047806 isRetracted "false" @default.
- W4302047806 workType "article" @default.