Matches in SemOpenAlex for { <https://semopenalex.org/work/W3044570745> ?p ?o ?g. }
- W3044570745 abstract "The task of retrieving video content relevant to natural language queries plays a critical role in effectively handling internet-scale datasets. Most of the existing methods for this caption-to-video retrieval problem do not fully exploit cross-modal cues present in video. Furthermore, they aggregate per-frame visual features with limited or no temporal information. In this paper, we present a multi-modal transformer to jointly encode the different modalities in video, which allows each of them to attend to the others. The transformer architecture is also leveraged to encode and model the temporal information. On the natural language side, we investigate the best practices to jointly optimize the language embedding together with the multi-modal transformer. This novel framework allows us to establish state-of-the-art results for video retrieval on three datasets. More details are available at this http URL." @default.
- W3044570745 created "2020-07-29" @default.
- W3044570745 creator A5001789528 @default.
- W3044570745 creator A5045217258 @default.
- W3044570745 creator A5049440980 @default.
- W3044570745 creator A5060145891 @default.
- W3044570745 date "2020-07-21" @default.
- W3044570745 modified "2023-09-27" @default.
- W3044570745 title "Multi-modal Transformer for Video Retrieval" @default.
- W3044570745 cites W1614298861 @default.
- W3044570745 cites W1957706851 @default.
- W3044570745 cites W1980867644 @default.
- W3044570745 cites W2064675550 @default.
- W3044570745 cites W2078238240 @default.
- W3044570745 cites W2112912048 @default.
- W3044570745 cites W2117154949 @default.
- W3044570745 cites W2156303437 @default.
- W3044570745 cites W2425121537 @default.
- W3044570745 cites W2525778437 @default.
- W3044570745 cites W2526050071 @default.
- W3044570745 cites W2565656701 @default.
- W3044570745 cites W2732026016 @default.
- W3044570745 cites W2796207103 @default.
- W3044570745 cites W2808399042 @default.
- W3044570745 cites W2883429621 @default.
- W3044570745 cites W2885775891 @default.
- W3044570745 cites W2897439619 @default.
- W3044570745 cites W2910905530 @default.
- W3044570745 cites W2953276893 @default.
- W3044570745 cites W2962964995 @default.
- W3044570745 cites W2963341956 @default.
- W3044570745 cites W2963403868 @default.
- W3044570745 cites W2963420686 @default.
- W3044570745 cites W2963446712 @default.
- W3044570745 cites W2963524571 @default.
- W3044570745 cites W2963902314 @default.
- W3044570745 cites W2963916161 @default.
- W3044570745 cites W2965458216 @default.
- W3044570745 cites W2968848930 @default.
- W3044570745 cites W2972073579 @default.
- W3044570745 cites W2975357369 @default.
- W3044570745 cites W2981851019 @default.
- W3044570745 cites W2984008963 @default.
- W3044570745 cites W2991208183 @default.
- W3044570745 cites W3035635319 @default.
- W3044570745 hasPublicationYear "2020" @default.
- W3044570745 type Work @default.
- W3044570745 sameAs 3044570745 @default.
- W3044570745 citedByCount "0" @default.
- W3044570745 crossrefType "posted-content" @default.
- W3044570745 hasAuthorship W3044570745A5001789528 @default.
- W3044570745 hasAuthorship W3044570745A5045217258 @default.
- W3044570745 hasAuthorship W3044570745A5049440980 @default.
- W3044570745 hasAuthorship W3044570745A5060145891 @default.
- W3044570745 hasConcept C104317684 @default.
- W3044570745 hasConcept C121332964 @default.
- W3044570745 hasConcept C123657996 @default.
- W3044570745 hasConcept C142362112 @default.
- W3044570745 hasConcept C153349607 @default.
- W3044570745 hasConcept C154945302 @default.
- W3044570745 hasConcept C165696696 @default.
- W3044570745 hasConcept C165801399 @default.
- W3044570745 hasConcept C185592680 @default.
- W3044570745 hasConcept C188027245 @default.
- W3044570745 hasConcept C195324797 @default.
- W3044570745 hasConcept C23123220 @default.
- W3044570745 hasConcept C38652104 @default.
- W3044570745 hasConcept C41008148 @default.
- W3044570745 hasConcept C41608201 @default.
- W3044570745 hasConcept C55493867 @default.
- W3044570745 hasConcept C62520636 @default.
- W3044570745 hasConcept C66322947 @default.
- W3044570745 hasConcept C66746571 @default.
- W3044570745 hasConcept C71139939 @default.
- W3044570745 hasConceptScore W3044570745C104317684 @default.
- W3044570745 hasConceptScore W3044570745C121332964 @default.
- W3044570745 hasConceptScore W3044570745C123657996 @default.
- W3044570745 hasConceptScore W3044570745C142362112 @default.
- W3044570745 hasConceptScore W3044570745C153349607 @default.
- W3044570745 hasConceptScore W3044570745C154945302 @default.
- W3044570745 hasConceptScore W3044570745C165696696 @default.
- W3044570745 hasConceptScore W3044570745C165801399 @default.
- W3044570745 hasConceptScore W3044570745C185592680 @default.
- W3044570745 hasConceptScore W3044570745C188027245 @default.
- W3044570745 hasConceptScore W3044570745C195324797 @default.
- W3044570745 hasConceptScore W3044570745C23123220 @default.
- W3044570745 hasConceptScore W3044570745C38652104 @default.
- W3044570745 hasConceptScore W3044570745C41008148 @default.
- W3044570745 hasConceptScore W3044570745C41608201 @default.
- W3044570745 hasConceptScore W3044570745C55493867 @default.
- W3044570745 hasConceptScore W3044570745C62520636 @default.
- W3044570745 hasConceptScore W3044570745C66322947 @default.
- W3044570745 hasConceptScore W3044570745C66746571 @default.
- W3044570745 hasConceptScore W3044570745C71139939 @default.
- W3044570745 hasLocation W30445707451 @default.
- W3044570745 hasOpenAccess W3044570745 @default.
- W3044570745 hasPrimaryLocation W30445707451 @default.
- W3044570745 hasRelatedWork W1494070235 @default.
- W3044570745 hasRelatedWork W1978183412 @default.
- W3044570745 hasRelatedWork W2054778702 @default.