Matches in SemOpenAlex for { <https://semopenalex.org/work/W4385477789> ?p ?o ?g. }
Showing items 1 to 94 of
94
with 100 items per page.
- W4385477789 abstract "In a range of recent works, object-centric architectures have been shown to be suitable for unsupervised scene decomposition in the vision domain. Inspired by these methods we present AudioSlots, a slot-centric generative model for blind source separation in the audio domain. AudioSlots is built using permutation-equivariant encoder and decoder networks. The encoder network based on the Transformer architecture learns to map a mixed audio spectrogram to an unordered set of independent source embeddings. The spatial broadcast decoder network learns to generate the source spectrograms from the source embeddings. We train the model in an end-to-end manner using a permutation invariant loss function. Our results on Libri2Mix speech separation constitute a proof of concept that this approach shows promise. We discuss the results and limitations of our approach in detail, and further outline potential ways to overcome the limitations and directions for future work." @default.
- W4385477789 created "2023-08-03" @default.
- W4385477789 creator A5025691113 @default.
- W4385477789 creator A5053447215 @default.
- W4385477789 creator A5082524152 @default.
- W4385477789 creator A5082808731 @default.
- W4385477789 creator A5086977582 @default.
- W4385477789 date "2023-06-04" @default.
- W4385477789 modified "2023-09-26" @default.
- W4385477789 title "Audioslots: A Slot-Centric Generative Model For Audio Separation" @default.
- W4385477789 cites W1482149378 @default.
- W4385477789 cites W1494198834 @default.
- W4385477789 cites W2046869671 @default.
- W4385477789 cites W2147455188 @default.
- W4385477789 cites W2194775991 @default.
- W4385477789 cites W2221409856 @default.
- W4385477789 cites W2558649592 @default.
- W4385477789 cites W2734774145 @default.
- W4385477789 cites W2952218014 @default.
- W4385477789 cites W2962715207 @default.
- W4385477789 cites W2964058413 @default.
- W4385477789 cites W2998490864 @default.
- W4385477789 cites W3015199127 @default.
- W4385477789 cites W3095095816 @default.
- W4385477789 cites W3109585842 @default.
- W4385477789 cites W3163652268 @default.
- W4385477789 cites W3177067699 @default.
- W4385477789 cites W4312819586 @default.
- W4385477789 doi "https://doi.org/10.1109/icasspw59220.2023.10193208" @default.
- W4385477789 hasPublicationYear "2023" @default.
- W4385477789 type Work @default.
- W4385477789 citedByCount "0" @default.
- W4385477789 crossrefType "proceedings-article" @default.
- W4385477789 hasAuthorship W4385477789A5025691113 @default.
- W4385477789 hasAuthorship W4385477789A5053447215 @default.
- W4385477789 hasAuthorship W4385477789A5082524152 @default.
- W4385477789 hasAuthorship W4385477789A5082808731 @default.
- W4385477789 hasAuthorship W4385477789A5086977582 @default.
- W4385477789 hasBestOaLocation W43854777892 @default.
- W4385477789 hasConcept C101738243 @default.
- W4385477789 hasConcept C111919701 @default.
- W4385477789 hasConcept C118505674 @default.
- W4385477789 hasConcept C121332964 @default.
- W4385477789 hasConcept C154945302 @default.
- W4385477789 hasConcept C165801399 @default.
- W4385477789 hasConcept C167966045 @default.
- W4385477789 hasConcept C190470478 @default.
- W4385477789 hasConcept C21308566 @default.
- W4385477789 hasConcept C24890656 @default.
- W4385477789 hasConcept C2776864781 @default.
- W4385477789 hasConcept C28490314 @default.
- W4385477789 hasConcept C37914503 @default.
- W4385477789 hasConcept C39890363 @default.
- W4385477789 hasConcept C41008148 @default.
- W4385477789 hasConcept C45273575 @default.
- W4385477789 hasConcept C50644808 @default.
- W4385477789 hasConcept C62520636 @default.
- W4385477789 hasConcept C66322947 @default.
- W4385477789 hasConceptScore W4385477789C101738243 @default.
- W4385477789 hasConceptScore W4385477789C111919701 @default.
- W4385477789 hasConceptScore W4385477789C118505674 @default.
- W4385477789 hasConceptScore W4385477789C121332964 @default.
- W4385477789 hasConceptScore W4385477789C154945302 @default.
- W4385477789 hasConceptScore W4385477789C165801399 @default.
- W4385477789 hasConceptScore W4385477789C167966045 @default.
- W4385477789 hasConceptScore W4385477789C190470478 @default.
- W4385477789 hasConceptScore W4385477789C21308566 @default.
- W4385477789 hasConceptScore W4385477789C24890656 @default.
- W4385477789 hasConceptScore W4385477789C2776864781 @default.
- W4385477789 hasConceptScore W4385477789C28490314 @default.
- W4385477789 hasConceptScore W4385477789C37914503 @default.
- W4385477789 hasConceptScore W4385477789C39890363 @default.
- W4385477789 hasConceptScore W4385477789C41008148 @default.
- W4385477789 hasConceptScore W4385477789C45273575 @default.
- W4385477789 hasConceptScore W4385477789C50644808 @default.
- W4385477789 hasConceptScore W4385477789C62520636 @default.
- W4385477789 hasConceptScore W4385477789C66322947 @default.
- W4385477789 hasLocation W43854777891 @default.
- W4385477789 hasLocation W43854777892 @default.
- W4385477789 hasOpenAccess W4385477789 @default.
- W4385477789 hasPrimaryLocation W43854777891 @default.
- W4385477789 hasRelatedWork W2886577208 @default.
- W4385477789 hasRelatedWork W2903766720 @default.
- W4385477789 hasRelatedWork W2963375116 @default.
- W4385477789 hasRelatedWork W3094316140 @default.
- W4385477789 hasRelatedWork W3114440105 @default.
- W4385477789 hasRelatedWork W3201240020 @default.
- W4385477789 hasRelatedWork W4286975102 @default.
- W4385477789 hasRelatedWork W4313303565 @default.
- W4385477789 hasRelatedWork W4319775894 @default.
- W4385477789 hasRelatedWork W4321854007 @default.
- W4385477789 isParatext "false" @default.
- W4385477789 isRetracted "false" @default.
- W4385477789 workType "article" @default.