Matches in SemOpenAlex for { <https://semopenalex.org/work/W2775506363> ?p ?o ?g. }
- W2775506363 abstract "Video captioning is the task of automatically generating a textual description of the actions in a video. Although previous work (e.g. sequence-to-sequence model) has shown promising results in abstracting a coarse description of a short video, it is still very challenging to caption a video containing multiple fine-grained actions with a detailed description. This paper aims to address the challenge by proposing a novel hierarchical reinforcement learning framework for video captioning, where a high-level Manager module learns to design sub-goals and a low-level Worker module recognizes the primitive actions to fulfill the sub-goal. With this compositional framework to reinforce video captioning at different levels, our approach significantly outperforms all the baseline methods on a newly introduced large-scale dataset for fine-grained video captioning. Furthermore, our non-ensemble model has already achieved the state-of-the-art results on the widely-used MSR-VTT dataset." @default.
- W2775506363 created "2017-12-22" @default.
- W2775506363 creator A5004034770 @default.
- W2775506363 creator A5009633508 @default.
- W2775506363 creator A5031017740 @default.
- W2775506363 creator A5050195037 @default.
- W2775506363 creator A5063070498 @default.
- W2775506363 date "2017-11-29" @default.
- W2775506363 modified "2023-10-18" @default.
- W2775506363 title "Video Captioning via Hierarchical Reinforcement Learning" @default.
- W2775506363 cites W1596841185 @default.
- W2775506363 cites W1889081078 @default.
- W2775506363 cites W1957740064 @default.
- W2775506363 cites W2064675550 @default.
- W2775506363 cites W2095705004 @default.
- W2775506363 cites W2119717200 @default.
- W2775506363 cites W2123442489 @default.
- W2775506363 cites W2131774270 @default.
- W2775506363 cites W2136036867 @default.
- W2775506363 cites W2157331557 @default.
- W2775506363 cites W2165150801 @default.
- W2775506363 cites W2173248099 @default.
- W2775506363 cites W2176263492 @default.
- W2775506363 cites W2204302769 @default.
- W2775506363 cites W2273041409 @default.
- W2775506363 cites W2335959470 @default.
- W2775506363 cites W2425121537 @default.
- W2775506363 cites W2487501366 @default.
- W2775506363 cites W2523993696 @default.
- W2775506363 cites W2527349934 @default.
- W2775506363 cites W2560313346 @default.
- W2775506363 cites W2610163825 @default.
- W2775506363 cites W2742943414 @default.
- W2775506363 cites W2949118724 @default.
- W2775506363 cites W2949267040 @default.
- W2775506363 cites W2949376505 @default.
- W2775506363 cites W2949650786 @default.
- W2775506363 cites W2949964922 @default.
- W2775506363 cites W2950019618 @default.
- W2775506363 cites W2950304420 @default.
- W2775506363 cites W2950307714 @default.
- W2775506363 cites W2951225148 @default.
- W2775506363 cites W2951780762 @default.
- W2775506363 cites W2951813108 @default.
- W2775506363 cites W2951837690 @default.
- W2775506363 cites W2952103128 @default.
- W2775506363 cites W2952633803 @default.
- W2775506363 cites W2952686080 @default.
- W2775506363 cites W2964065937 @default.
- W2775506363 cites W2964308564 @default.
- W2775506363 cites W6908809 @default.
- W2775506363 doi "https://doi.org/10.48550/arxiv.1711.11135" @default.
- W2775506363 hasPublicationYear "2017" @default.
- W2775506363 type Work @default.
- W2775506363 sameAs 2775506363 @default.
- W2775506363 citedByCount "6" @default.
- W2775506363 countsByYear W27755063632018 @default.
- W2775506363 countsByYear W27755063632020 @default.
- W2775506363 countsByYear W27755063632021 @default.
- W2775506363 crossrefType "posted-content" @default.
- W2775506363 hasAuthorship W2775506363A5004034770 @default.
- W2775506363 hasAuthorship W2775506363A5009633508 @default.
- W2775506363 hasAuthorship W2775506363A5031017740 @default.
- W2775506363 hasAuthorship W2775506363A5050195037 @default.
- W2775506363 hasAuthorship W2775506363A5063070498 @default.
- W2775506363 hasBestOaLocation W27755063631 @default.
- W2775506363 hasConcept C111368507 @default.
- W2775506363 hasConcept C115961682 @default.
- W2775506363 hasConcept C121332964 @default.
- W2775506363 hasConcept C12725497 @default.
- W2775506363 hasConcept C127313418 @default.
- W2775506363 hasConcept C154945302 @default.
- W2775506363 hasConcept C157657479 @default.
- W2775506363 hasConcept C162324750 @default.
- W2775506363 hasConcept C187736073 @default.
- W2775506363 hasConcept C199360897 @default.
- W2775506363 hasConcept C204321447 @default.
- W2775506363 hasConcept C2778112365 @default.
- W2775506363 hasConcept C2778755073 @default.
- W2775506363 hasConcept C2780451532 @default.
- W2775506363 hasConcept C41008148 @default.
- W2775506363 hasConcept C48103436 @default.
- W2775506363 hasConcept C54355233 @default.
- W2775506363 hasConcept C62520636 @default.
- W2775506363 hasConcept C86803240 @default.
- W2775506363 hasConcept C97541855 @default.
- W2775506363 hasConceptScore W2775506363C111368507 @default.
- W2775506363 hasConceptScore W2775506363C115961682 @default.
- W2775506363 hasConceptScore W2775506363C121332964 @default.
- W2775506363 hasConceptScore W2775506363C12725497 @default.
- W2775506363 hasConceptScore W2775506363C127313418 @default.
- W2775506363 hasConceptScore W2775506363C154945302 @default.
- W2775506363 hasConceptScore W2775506363C157657479 @default.
- W2775506363 hasConceptScore W2775506363C162324750 @default.
- W2775506363 hasConceptScore W2775506363C187736073 @default.
- W2775506363 hasConceptScore W2775506363C199360897 @default.
- W2775506363 hasConceptScore W2775506363C204321447 @default.
- W2775506363 hasConceptScore W2775506363C2778112365 @default.
- W2775506363 hasConceptScore W2775506363C2778755073 @default.
- W2775506363 hasConceptScore W2775506363C2780451532 @default.