Matches in SemOpenAlex for { <https://semopenalex.org/work/W4380450071> ?p ?o ?g. }
Showing items 1 to 69 of
69
with 100 items per page.
- W4380450071 abstract "Abstract In this work, we perform zero-shot image captioning and visual question answering on images using a simple model composition framework, composing the dense image captioning capabilities of a visual language model with the powerful reasoning abilities of a large language model, ChatGPT. The proposed method utilizes zero-shot learning to enable cross-modal integration of vision and language in order to create a comprehensive visual language model. We achieve zero-shot state-of-the-art performance on VQAv2, demonstrating its effectiveness and high accuracy. The method's simplicity makes it highly scalable and adaptable to a wide range of applications, including integration from OpenAI’s multimodal model, GPT-4, with audio language models in the future. The results demonstrate the vast potential of this simple zero-shot framework in improving the accuracy and relevance of vision and language applications, constituting an effective approach to image captioning and visual question answering, as well as future multimodal composition." @default.
- W4380450071 created "2023-06-14" @default.
- W4380450071 creator A5011690697 @default.
- W4380450071 date "2023-06-13" @default.
- W4380450071 modified "2023-09-27" @default.
- W4380450071 title "Simplifying Multimodal Composition: A Novel Zero-shot Framework to Visual Question Answering and Image Captioning" @default.
- W4380450071 cites W4226182655 @default.
- W4380450071 cites W4281557260 @default.
- W4380450071 cites W4308244910 @default.
- W4380450071 cites W4311000453 @default.
- W4380450071 doi "https://doi.org/10.21203/rs.3.rs-3027308/v1" @default.
- W4380450071 hasPublicationYear "2023" @default.
- W4380450071 type Work @default.
- W4380450071 citedByCount "0" @default.
- W4380450071 crossrefType "posted-content" @default.
- W4380450071 hasAuthorship W4380450071A5011690697 @default.
- W4380450071 hasBestOaLocation W43804500711 @default.
- W4380450071 hasConcept C111472728 @default.
- W4380450071 hasConcept C115961682 @default.
- W4380450071 hasConcept C138885662 @default.
- W4380450071 hasConcept C154945302 @default.
- W4380450071 hasConcept C157657479 @default.
- W4380450071 hasConcept C158154518 @default.
- W4380450071 hasConcept C17744445 @default.
- W4380450071 hasConcept C178790620 @default.
- W4380450071 hasConcept C185592680 @default.
- W4380450071 hasConcept C199539241 @default.
- W4380450071 hasConcept C204321447 @default.
- W4380450071 hasConcept C2776372474 @default.
- W4380450071 hasConcept C2778344882 @default.
- W4380450071 hasConcept C2780813799 @default.
- W4380450071 hasConcept C2780878386 @default.
- W4380450071 hasConcept C41008148 @default.
- W4380450071 hasConcept C41895202 @default.
- W4380450071 hasConcept C44291984 @default.
- W4380450071 hasConceptScore W4380450071C111472728 @default.
- W4380450071 hasConceptScore W4380450071C115961682 @default.
- W4380450071 hasConceptScore W4380450071C138885662 @default.
- W4380450071 hasConceptScore W4380450071C154945302 @default.
- W4380450071 hasConceptScore W4380450071C157657479 @default.
- W4380450071 hasConceptScore W4380450071C158154518 @default.
- W4380450071 hasConceptScore W4380450071C17744445 @default.
- W4380450071 hasConceptScore W4380450071C178790620 @default.
- W4380450071 hasConceptScore W4380450071C185592680 @default.
- W4380450071 hasConceptScore W4380450071C199539241 @default.
- W4380450071 hasConceptScore W4380450071C204321447 @default.
- W4380450071 hasConceptScore W4380450071C2776372474 @default.
- W4380450071 hasConceptScore W4380450071C2778344882 @default.
- W4380450071 hasConceptScore W4380450071C2780813799 @default.
- W4380450071 hasConceptScore W4380450071C2780878386 @default.
- W4380450071 hasConceptScore W4380450071C41008148 @default.
- W4380450071 hasConceptScore W4380450071C41895202 @default.
- W4380450071 hasConceptScore W4380450071C44291984 @default.
- W4380450071 hasLocation W43804500711 @default.
- W4380450071 hasOpenAccess W4380450071 @default.
- W4380450071 hasPrimaryLocation W43804500711 @default.
- W4380450071 hasRelatedWork W207304934 @default.
- W4380450071 hasRelatedWork W2295744208 @default.
- W4380450071 hasRelatedWork W2737766105 @default.
- W4380450071 hasRelatedWork W2745461083 @default.
- W4380450071 hasRelatedWork W2883011457 @default.
- W4380450071 hasRelatedWork W2951590222 @default.
- W4380450071 hasRelatedWork W2952322859 @default.
- W4380450071 hasRelatedWork W2971866238 @default.
- W4380450071 hasRelatedWork W4297834192 @default.
- W4380450071 hasRelatedWork W4377703168 @default.
- W4380450071 isParatext "false" @default.
- W4380450071 isRetracted "false" @default.
- W4380450071 workType "article" @default.