Matches in SemOpenAlex for { <https://semopenalex.org/work/W3109643012> ?p ?o ?g. }
- W3109643012 endingPage "67" @default.
- W3109643012 startingPage "51" @default.
- W3109643012 abstract "We introduce the task of Image-Set Visual Question Answering (ISVQA), which generalizes the commonly studied single-image VQA problem to multi-image settings. Taking a natural language question and a set of images as input, it aims to answer the question based on the content of the images. The questions can be about objects and relationships in one or more images or about the entire scene depicted by the image set. To enable research in this new topic, we introduce two ISVQA datasets – indoor and outdoor scenes. They simulate the real-world scenarios of indoor image collections and multiple car-mounted cameras, respectively. The indoor-scene dataset contains 91,479 human-annotated questions for 48,138 image sets, and the outdoor-scene dataset has 49,617 questions for 12,746 image sets. We analyze the properties of the two datasets, including question-and-answer distributions, types of questions, biases in dataset, and question-image dependencies. We also build new baseline models to investigate new research challenges in ISVQA." @default.
- W3109643012 created "2020-12-07" @default.
- W3109643012 creator A5009101133 @default.
- W3109643012 creator A5023195442 @default.
- W3109643012 creator A5084058732 @default.
- W3109643012 date "2020-01-01" @default.
- W3109643012 modified "2023-10-10" @default.
- W3109643012 title "Visual Question Answering on Image Sets" @default.
- W3109643012 cites W1924121366 @default.
- W3109643012 cites W1933349210 @default.
- W3109643012 cites W2250539671 @default.
- W3109643012 cites W2277195237 @default.
- W3109643012 cites W2561715562 @default.
- W3109643012 cites W2606982687 @default.
- W3109643012 cites W2745461083 @default.
- W3109643012 cites W2798786641 @default.
- W3109643012 cites W2890399523 @default.
- W3109643012 cites W2947312908 @default.
- W3109643012 cites W2950697717 @default.
- W3109643012 cites W2953127211 @default.
- W3109643012 cites W2954199749 @default.
- W3109643012 cites W2962684798 @default.
- W3109643012 cites W2962749469 @default.
- W3109643012 cites W2962994687 @default.
- W3109643012 cites W2963260436 @default.
- W3109643012 cites W2963518342 @default.
- W3109643012 cites W2963622213 @default.
- W3109643012 cites W2963890755 @default.
- W3109643012 cites W2964067226 @default.
- W3109643012 cites W2964146787 @default.
- W3109643012 cites W2970231061 @default.
- W3109643012 cites W2979382951 @default.
- W3109643012 cites W3009928773 @default.
- W3109643012 cites W3016211260 @default.
- W3109643012 cites W3034636873 @default.
- W3109643012 cites W3035574168 @default.
- W3109643012 doi "https://doi.org/10.1007/978-3-030-58589-1_4" @default.
- W3109643012 hasPublicationYear "2020" @default.
- W3109643012 type Work @default.
- W3109643012 sameAs 3109643012 @default.
- W3109643012 citedByCount "10" @default.
- W3109643012 countsByYear W31096430122021 @default.
- W3109643012 countsByYear W31096430122022 @default.
- W3109643012 countsByYear W31096430122023 @default.
- W3109643012 crossrefType "book-chapter" @default.
- W3109643012 hasAuthorship W3109643012A5009101133 @default.
- W3109643012 hasAuthorship W3109643012A5023195442 @default.
- W3109643012 hasAuthorship W3109643012A5084058732 @default.
- W3109643012 hasBestOaLocation W31096430122 @default.
- W3109643012 hasConcept C111368507 @default.
- W3109643012 hasConcept C115961682 @default.
- W3109643012 hasConcept C12725497 @default.
- W3109643012 hasConcept C127313418 @default.
- W3109643012 hasConcept C153180895 @default.
- W3109643012 hasConcept C154945302 @default.
- W3109643012 hasConcept C162324750 @default.
- W3109643012 hasConcept C166957645 @default.
- W3109643012 hasConcept C177264268 @default.
- W3109643012 hasConcept C187736073 @default.
- W3109643012 hasConcept C199360897 @default.
- W3109643012 hasConcept C205649164 @default.
- W3109643012 hasConcept C23123220 @default.
- W3109643012 hasConcept C2776608160 @default.
- W3109643012 hasConcept C2780451532 @default.
- W3109643012 hasConcept C31972630 @default.
- W3109643012 hasConcept C41008148 @default.
- W3109643012 hasConcept C44291984 @default.
- W3109643012 hasConceptScore W3109643012C111368507 @default.
- W3109643012 hasConceptScore W3109643012C115961682 @default.
- W3109643012 hasConceptScore W3109643012C12725497 @default.
- W3109643012 hasConceptScore W3109643012C127313418 @default.
- W3109643012 hasConceptScore W3109643012C153180895 @default.
- W3109643012 hasConceptScore W3109643012C154945302 @default.
- W3109643012 hasConceptScore W3109643012C162324750 @default.
- W3109643012 hasConceptScore W3109643012C166957645 @default.
- W3109643012 hasConceptScore W3109643012C177264268 @default.
- W3109643012 hasConceptScore W3109643012C187736073 @default.
- W3109643012 hasConceptScore W3109643012C199360897 @default.
- W3109643012 hasConceptScore W3109643012C205649164 @default.
- W3109643012 hasConceptScore W3109643012C23123220 @default.
- W3109643012 hasConceptScore W3109643012C2776608160 @default.
- W3109643012 hasConceptScore W3109643012C2780451532 @default.
- W3109643012 hasConceptScore W3109643012C31972630 @default.
- W3109643012 hasConceptScore W3109643012C41008148 @default.
- W3109643012 hasConceptScore W3109643012C44291984 @default.
- W3109643012 hasLocation W31096430121 @default.
- W3109643012 hasLocation W31096430122 @default.
- W3109643012 hasOpenAccess W3109643012 @default.
- W3109643012 hasPrimaryLocation W31096430121 @default.
- W3109643012 hasRelatedWork W1893928041 @default.
- W3109643012 hasRelatedWork W2005185696 @default.
- W3109643012 hasRelatedWork W2051167396 @default.
- W3109643012 hasRelatedWork W2129745818 @default.
- W3109643012 hasRelatedWork W2135033253 @default.
- W3109643012 hasRelatedWork W2139771701 @default.
- W3109643012 hasRelatedWork W219090214 @default.
- W3109643012 hasRelatedWork W2608030593 @default.
- W3109643012 hasRelatedWork W2611071287 @default.