Matches in SemOpenAlex for { <https://semopenalex.org/work/W3035392611> ?p ?o ?g. }
- W3035392611 abstract "Video captioning is a challenging task that requires a deep understanding of visual scenes. State-of-the-art methods generate captions using either scene-level or object-level information but without explicitly modeling object interactions. Thus, they often fail to make visually grounded predictions, and are sensitive to spurious correlations. In this paper, we propose a novel spatio-temporal graph model for video captioning that exploits object interactions in space and time. Our model builds interpretable links and is able to provide explicit visual grounding. To avoid unstable performance caused by the variable number of objects, we further propose an object-aware knowledge distillation mechanism, in which local object information is used to regularize global scene features. We demonstrate the efficacy of our approach through extensive experiments on two benchmarks, showing our approach yields competitive performance with interpretable predictions." @default.
- W3035392611 created "2020-06-19" @default.
- W3035392611 creator A5002304596 @default.
- W3035392611 creator A5018518655 @default.
- W3035392611 creator A5030739140 @default.
- W3035392611 creator A5052829033 @default.
- W3035392611 creator A5075018873 @default.
- W3035392611 creator A5078879036 @default.
- W3035392611 creator A5084008993 @default.
- W3035392611 date "2020-06-01" @default.
- W3035392611 modified "2023-10-10" @default.
- W3035392611 title "Spatio-Temporal Graph for Video Captioning With Knowledge Distillation" @default.
- W3035392611 cites W1522734439 @default.
- W3035392611 cites W1536680647 @default.
- W3035392611 cites W1596841185 @default.
- W3035392611 cites W1956340063 @default.
- W3035392611 cites W2077069816 @default.
- W3035392611 cites W2101105183 @default.
- W3035392611 cites W2108598243 @default.
- W3035392611 cites W2110933980 @default.
- W3035392611 cites W2139501017 @default.
- W3035392611 cites W2142900973 @default.
- W3035392611 cites W2194775991 @default.
- W3035392611 cites W2277195237 @default.
- W3035392611 cites W2342662179 @default.
- W3035392611 cites W2425121537 @default.
- W3035392611 cites W2940963663 @default.
- W3035392611 cites W2948358897 @default.
- W3035392611 cites W2955874753 @default.
- W3035392611 cites W2962681491 @default.
- W3035392611 cites W2962990649 @default.
- W3035392611 cites W2963091558 @default.
- W3035392611 cites W2963150697 @default.
- W3035392611 cites W2963524571 @default.
- W3035392611 cites W2963699792 @default.
- W3035392611 cites W2982515679 @default.
- W3035392611 cites W2984862483 @default.
- W3035392611 cites W2986953233 @default.
- W3035392611 cites W2988753485 @default.
- W3035392611 cites W753847829 @default.
- W3035392611 doi "https://doi.org/10.1109/cvpr42600.2020.01088" @default.
- W3035392611 hasPublicationYear "2020" @default.
- W3035392611 type Work @default.
- W3035392611 sameAs 3035392611 @default.
- W3035392611 citedByCount "136" @default.
- W3035392611 countsByYear W30353926112020 @default.
- W3035392611 countsByYear W30353926112021 @default.
- W3035392611 countsByYear W30353926112022 @default.
- W3035392611 countsByYear W30353926112023 @default.
- W3035392611 crossrefType "proceedings-article" @default.
- W3035392611 hasAuthorship W3035392611A5002304596 @default.
- W3035392611 hasAuthorship W3035392611A5018518655 @default.
- W3035392611 hasAuthorship W3035392611A5030739140 @default.
- W3035392611 hasAuthorship W3035392611A5052829033 @default.
- W3035392611 hasAuthorship W3035392611A5075018873 @default.
- W3035392611 hasAuthorship W3035392611A5078879036 @default.
- W3035392611 hasAuthorship W3035392611A5084008993 @default.
- W3035392611 hasBestOaLocation W30353926112 @default.
- W3035392611 hasConcept C115961682 @default.
- W3035392611 hasConcept C119857082 @default.
- W3035392611 hasConcept C132525143 @default.
- W3035392611 hasConcept C154945302 @default.
- W3035392611 hasConcept C157657479 @default.
- W3035392611 hasConcept C162324750 @default.
- W3035392611 hasConcept C165696696 @default.
- W3035392611 hasConcept C178790620 @default.
- W3035392611 hasConcept C179372163 @default.
- W3035392611 hasConcept C185592680 @default.
- W3035392611 hasConcept C187736073 @default.
- W3035392611 hasConcept C204030448 @default.
- W3035392611 hasConcept C205711294 @default.
- W3035392611 hasConcept C2780451532 @default.
- W3035392611 hasConcept C2781238097 @default.
- W3035392611 hasConcept C31972630 @default.
- W3035392611 hasConcept C38652104 @default.
- W3035392611 hasConcept C41008148 @default.
- W3035392611 hasConcept C80444323 @default.
- W3035392611 hasConcept C97256817 @default.
- W3035392611 hasConceptScore W3035392611C115961682 @default.
- W3035392611 hasConceptScore W3035392611C119857082 @default.
- W3035392611 hasConceptScore W3035392611C132525143 @default.
- W3035392611 hasConceptScore W3035392611C154945302 @default.
- W3035392611 hasConceptScore W3035392611C157657479 @default.
- W3035392611 hasConceptScore W3035392611C162324750 @default.
- W3035392611 hasConceptScore W3035392611C165696696 @default.
- W3035392611 hasConceptScore W3035392611C178790620 @default.
- W3035392611 hasConceptScore W3035392611C179372163 @default.
- W3035392611 hasConceptScore W3035392611C185592680 @default.
- W3035392611 hasConceptScore W3035392611C187736073 @default.
- W3035392611 hasConceptScore W3035392611C204030448 @default.
- W3035392611 hasConceptScore W3035392611C205711294 @default.
- W3035392611 hasConceptScore W3035392611C2780451532 @default.
- W3035392611 hasConceptScore W3035392611C2781238097 @default.
- W3035392611 hasConceptScore W3035392611C31972630 @default.
- W3035392611 hasConceptScore W3035392611C38652104 @default.
- W3035392611 hasConceptScore W3035392611C41008148 @default.
- W3035392611 hasConceptScore W3035392611C80444323 @default.
- W3035392611 hasConceptScore W3035392611C97256817 @default.
- W3035392611 hasLocation W30353926111 @default.
- W3035392611 hasLocation W30353926112 @default.