Matches in SemOpenAlex for { <https://semopenalex.org/work/W2950371778> ?p ?o ?g. }
- W2950371778 abstract "Dense captioning is a newly emerging computer vision topic for understanding images with dense language descriptions. The goal is to densely detect visual concepts (e.g., objects, object parts, and interactions between them) from images, labeling each with a short descriptive phrase. We identify two key challenges of dense captioning that need to be properly addressed when tackling the problem. First, dense visual concept annotations in each image are associated with highly overlapping target regions, making accurate localization of each visual concept challenging. Second, the large amount of visual concepts makes it hard to recognize each of them by appearance alone. We propose a new model pipeline based on two novel ideas, joint inference and context fusion, to alleviate these two challenges. We design our model architecture in a methodical manner and thoroughly evaluate the variations in architecture. Our final model, compact and efficient, achieves state-of-the-art accuracy on Visual Genome for dense captioning with a relative gain of 73% compared to the previous best algorithm. Qualitative experiments also reveal the semantic capabilities of our model in dense captioning." @default.
- W2950371778 created "2019-06-27" @default.
- W2950371778 creator A5009049500 @default.
- W2950371778 creator A5038815767 @default.
- W2950371778 creator A5042544148 @default.
- W2950371778 creator A5043162191 @default.
- W2950371778 date "2016-11-21" @default.
- W2950371778 modified "2023-09-24" @default.
- W2950371778 title "Dense Captioning with Joint Inference and Visual Context" @default.
- W2950371778 cites W1514535095 @default.
- W2950371778 cites W1686810756 @default.
- W2950371778 cites W1785460851 @default.
- W2950371778 cites W179875071 @default.
- W2950371778 cites W1889081078 @default.
- W2950371778 cites W1905882502 @default.
- W2950371778 cites W196214544 @default.
- W2950371778 cites W1971129545 @default.
- W2950371778 cites W2064675550 @default.
- W2950371778 cites W2101105183 @default.
- W2950371778 cites W2108598243 @default.
- W2950371778 cites W2112796928 @default.
- W2950371778 cites W2112912048 @default.
- W2950371778 cites W2123301721 @default.
- W2950371778 cites W2125215748 @default.
- W2950371778 cites W2132339004 @default.
- W2950371778 cites W2141364309 @default.
- W2950371778 cites W2144960104 @default.
- W2950371778 cites W2159243025 @default.
- W2950371778 cites W2185175083 @default.
- W2950371778 cites W2277195237 @default.
- W2950371778 cites W2438869444 @default.
- W2950371778 cites W2505639562 @default.
- W2950371778 cites W2594494421 @default.
- W2950371778 cites W2949107813 @default.
- W2950371778 cites W2949769367 @default.
- W2950371778 cites W2950094539 @default.
- W2950371778 cites W2950761309 @default.
- W2950371778 cites W2951183276 @default.
- W2950371778 cites W2951548327 @default.
- W2950371778 cites W2951638509 @default.
- W2950371778 cites W2951912364 @default.
- W2950371778 cites W2952246170 @default.
- W2950371778 cites W2952574180 @default.
- W2950371778 cites W2952632681 @default.
- W2950371778 cites W2953106684 @default.
- W2950371778 cites W2953238423 @default.
- W2950371778 cites W2963735856 @default.
- W2950371778 cites W2963758027 @default.
- W2950371778 cites W3106250896 @default.
- W2950371778 cites W603908379 @default.
- W2950371778 doi "https://doi.org/10.48550/arxiv.1611.06949" @default.
- W2950371778 hasPublicationYear "2016" @default.
- W2950371778 type Work @default.
- W2950371778 sameAs 2950371778 @default.
- W2950371778 citedByCount "5" @default.
- W2950371778 countsByYear W29503717782017 @default.
- W2950371778 countsByYear W29503717782018 @default.
- W2950371778 countsByYear W29503717782019 @default.
- W2950371778 countsByYear W29503717782020 @default.
- W2950371778 crossrefType "posted-content" @default.
- W2950371778 hasAuthorship W2950371778A5009049500 @default.
- W2950371778 hasAuthorship W2950371778A5038815767 @default.
- W2950371778 hasAuthorship W2950371778A5042544148 @default.
- W2950371778 hasAuthorship W2950371778A5043162191 @default.
- W2950371778 hasBestOaLocation W29503717781 @default.
- W2950371778 hasConcept C115961682 @default.
- W2950371778 hasConcept C123657996 @default.
- W2950371778 hasConcept C142362112 @default.
- W2950371778 hasConcept C151730666 @default.
- W2950371778 hasConcept C153349607 @default.
- W2950371778 hasConcept C154945302 @default.
- W2950371778 hasConcept C157657479 @default.
- W2950371778 hasConcept C199360897 @default.
- W2950371778 hasConcept C204321447 @default.
- W2950371778 hasConcept C2776214188 @default.
- W2950371778 hasConcept C2776224158 @default.
- W2950371778 hasConcept C2779343474 @default.
- W2950371778 hasConcept C2781238097 @default.
- W2950371778 hasConcept C36464697 @default.
- W2950371778 hasConcept C41008148 @default.
- W2950371778 hasConcept C43521106 @default.
- W2950371778 hasConcept C86803240 @default.
- W2950371778 hasConceptScore W2950371778C115961682 @default.
- W2950371778 hasConceptScore W2950371778C123657996 @default.
- W2950371778 hasConceptScore W2950371778C142362112 @default.
- W2950371778 hasConceptScore W2950371778C151730666 @default.
- W2950371778 hasConceptScore W2950371778C153349607 @default.
- W2950371778 hasConceptScore W2950371778C154945302 @default.
- W2950371778 hasConceptScore W2950371778C157657479 @default.
- W2950371778 hasConceptScore W2950371778C199360897 @default.
- W2950371778 hasConceptScore W2950371778C204321447 @default.
- W2950371778 hasConceptScore W2950371778C2776214188 @default.
- W2950371778 hasConceptScore W2950371778C2776224158 @default.
- W2950371778 hasConceptScore W2950371778C2779343474 @default.
- W2950371778 hasConceptScore W2950371778C2781238097 @default.
- W2950371778 hasConceptScore W2950371778C36464697 @default.
- W2950371778 hasConceptScore W2950371778C41008148 @default.
- W2950371778 hasConceptScore W2950371778C43521106 @default.
- W2950371778 hasConceptScore W2950371778C86803240 @default.
- W2950371778 hasLocation W29503717781 @default.