Matches in SemOpenAlex for { <https://semopenalex.org/work/W2891752192> ?p ?o ?g. }
- W2891752192 abstract "Image/video captioning based on neural network can generate accurate description. But how to convert visual information into natural language representation is a true enigma. Existing caption-guided saliency methods take the entire sentence as input to generate a saliency map, which exposes the region-to-word mapping. However, visual information is not related to every word in caption. We eliminate these meaningless stop words such as ‘the’, ‘of’ to avoid misleading. We also utilize MFB (Multi-modal Factorized Bilinear Pooling) to fuse C3D features, which could provide richer spatiotemporal information to exposure visual-word guided saliency. Such the system produces better spatiotemporal heatmaps for both predicted captions and arbitrary query sentences without introducing attentional layers. The experimental results on MSR-VTT and Flickr30K dataset surpasses the state-of-the-art by a significant margin." @default.
- W2891752192 created "2018-09-27" @default.
- W2891752192 creator A5007776175 @default.
- W2891752192 creator A5024994504 @default.
- W2891752192 creator A5029104204 @default.
- W2891752192 creator A5041232609 @default.
- W2891752192 creator A5058448163 @default.
- W2891752192 date "2018-01-01" @default.
- W2891752192 modified "2023-09-23" @default.
- W2891752192 title "Text to Region: Visual-Word Guided Saliency Detection" @default.
- W2891752192 cites W1586939924 @default.
- W2891752192 cites W1592631677 @default.
- W2891752192 cites W1773149199 @default.
- W2891752192 cites W1849277567 @default.
- W2891752192 cites W1895577753 @default.
- W2891752192 cites W1895989618 @default.
- W2891752192 cites W1915485278 @default.
- W2891752192 cites W2047670868 @default.
- W2891752192 cites W2108598243 @default.
- W2891752192 cites W2183341477 @default.
- W2891752192 cites W2185175083 @default.
- W2891752192 cites W2221625691 @default.
- W2891752192 cites W2295107390 @default.
- W2891752192 cites W2425121537 @default.
- W2891752192 cites W2503388974 @default.
- W2891752192 cites W2563296158 @default.
- W2891752192 cites W2575842049 @default.
- W2891752192 cites W2963150162 @default.
- W2891752192 cites W2964241990 @default.
- W2891752192 cites W4239147634 @default.
- W2891752192 doi "https://doi.org/10.1007/978-3-030-00764-5_68" @default.
- W2891752192 hasPublicationYear "2018" @default.
- W2891752192 type Work @default.
- W2891752192 sameAs 2891752192 @default.
- W2891752192 citedByCount "0" @default.
- W2891752192 crossrefType "book-chapter" @default.
- W2891752192 hasAuthorship W2891752192A5007776175 @default.
- W2891752192 hasAuthorship W2891752192A5024994504 @default.
- W2891752192 hasAuthorship W2891752192A5029104204 @default.
- W2891752192 hasAuthorship W2891752192A5041232609 @default.
- W2891752192 hasAuthorship W2891752192A5058448163 @default.
- W2891752192 hasConcept C115961682 @default.
- W2891752192 hasConcept C119599485 @default.
- W2891752192 hasConcept C119857082 @default.
- W2891752192 hasConcept C127413603 @default.
- W2891752192 hasConcept C138885662 @default.
- W2891752192 hasConcept C141353440 @default.
- W2891752192 hasConcept C153180895 @default.
- W2891752192 hasConcept C154945302 @default.
- W2891752192 hasConcept C157657479 @default.
- W2891752192 hasConcept C17744445 @default.
- W2891752192 hasConcept C195324797 @default.
- W2891752192 hasConcept C199539241 @default.
- W2891752192 hasConcept C204321447 @default.
- W2891752192 hasConcept C2776359362 @default.
- W2891752192 hasConcept C2777530160 @default.
- W2891752192 hasConcept C28490314 @default.
- W2891752192 hasConcept C36464697 @default.
- W2891752192 hasConcept C41008148 @default.
- W2891752192 hasConcept C41895202 @default.
- W2891752192 hasConcept C70437156 @default.
- W2891752192 hasConcept C774472 @default.
- W2891752192 hasConcept C90805587 @default.
- W2891752192 hasConcept C94625758 @default.
- W2891752192 hasConceptScore W2891752192C115961682 @default.
- W2891752192 hasConceptScore W2891752192C119599485 @default.
- W2891752192 hasConceptScore W2891752192C119857082 @default.
- W2891752192 hasConceptScore W2891752192C127413603 @default.
- W2891752192 hasConceptScore W2891752192C138885662 @default.
- W2891752192 hasConceptScore W2891752192C141353440 @default.
- W2891752192 hasConceptScore W2891752192C153180895 @default.
- W2891752192 hasConceptScore W2891752192C154945302 @default.
- W2891752192 hasConceptScore W2891752192C157657479 @default.
- W2891752192 hasConceptScore W2891752192C17744445 @default.
- W2891752192 hasConceptScore W2891752192C195324797 @default.
- W2891752192 hasConceptScore W2891752192C199539241 @default.
- W2891752192 hasConceptScore W2891752192C204321447 @default.
- W2891752192 hasConceptScore W2891752192C2776359362 @default.
- W2891752192 hasConceptScore W2891752192C2777530160 @default.
- W2891752192 hasConceptScore W2891752192C28490314 @default.
- W2891752192 hasConceptScore W2891752192C36464697 @default.
- W2891752192 hasConceptScore W2891752192C41008148 @default.
- W2891752192 hasConceptScore W2891752192C41895202 @default.
- W2891752192 hasConceptScore W2891752192C70437156 @default.
- W2891752192 hasConceptScore W2891752192C774472 @default.
- W2891752192 hasConceptScore W2891752192C90805587 @default.
- W2891752192 hasConceptScore W2891752192C94625758 @default.
- W2891752192 hasLocation W28917521921 @default.
- W2891752192 hasOpenAccess W2891752192 @default.
- W2891752192 hasPrimaryLocation W28917521921 @default.
- W2891752192 hasRelatedWork W1472787671 @default.
- W2891752192 hasRelatedWork W1520140270 @default.
- W2891752192 hasRelatedWork W2033859430 @default.
- W2891752192 hasRelatedWork W2034197131 @default.
- W2891752192 hasRelatedWork W2053122719 @default.
- W2891752192 hasRelatedWork W2078699225 @default.
- W2891752192 hasRelatedWork W2102005949 @default.
- W2891752192 hasRelatedWork W2754902757 @default.
- W2891752192 hasRelatedWork W2757028014 @default.
- W2891752192 hasRelatedWork W2759512615 @default.