Matches in SemOpenAlex for { <https://semopenalex.org/work/W4320165839> ?p ?o ?g. }
Showing items 1 to 51 of
51
with 100 items per page.
- W4320165839 abstract "More than one hundred benchmarks have been developed to test the commonsense knowledge and commonsense reasoning abilities of artificial intelligence (AI) systems. However, these benchmarks are often flawed and many aspects of common sense remain untested. Consequently, we do not currently have any reliable way of measuring to what extent existing AI systems have achieved these abilities. This paper surveys the development and uses of AI commonsense benchmarks. We discuss the nature of common sense; the role of common sense in AI; the goals served by constructing commonsense benchmarks; and desirable features of commonsense benchmarks. We analyze the common flaws in benchmarks, and we argue that it is worthwhile to invest the work needed ensure that benchmark examples are consistently high quality. We survey the various methods of constructing commonsense benchmarks. We enumerate 139 commonsense benchmarks that have been developed: 102 text-based, 18 image-based, 12 video based, and 7 simulated physical environments. We discuss the gaps in the existing benchmarks and aspects of commonsense reasoning that are not addressed in any existing benchmark. We conclude with a number of recommendations for future development of commonsense AI benchmarks." @default.
- W4320165839 created "2023-02-13" @default.
- W4320165839 creator A5010452641 @default.
- W4320165839 date "2023-02-09" @default.
- W4320165839 modified "2023-09-29" @default.
- W4320165839 title "Benchmarks for Automated Commonsense Reasoning: A Survey" @default.
- W4320165839 doi "https://doi.org/10.48550/arxiv.2302.04752" @default.
- W4320165839 hasPublicationYear "2023" @default.
- W4320165839 type Work @default.
- W4320165839 citedByCount "0" @default.
- W4320165839 crossrefType "posted-content" @default.
- W4320165839 hasAuthorship W4320165839A5010452641 @default.
- W4320165839 hasBestOaLocation W43201658391 @default.
- W4320165839 hasConcept C111472728 @default.
- W4320165839 hasConcept C13280743 @default.
- W4320165839 hasConcept C138885662 @default.
- W4320165839 hasConcept C154945302 @default.
- W4320165839 hasConcept C161301231 @default.
- W4320165839 hasConcept C185798385 @default.
- W4320165839 hasConcept C193221554 @default.
- W4320165839 hasConcept C205649164 @default.
- W4320165839 hasConcept C2779814899 @default.
- W4320165839 hasConcept C30542707 @default.
- W4320165839 hasConcept C41008148 @default.
- W4320165839 hasConceptScore W4320165839C111472728 @default.
- W4320165839 hasConceptScore W4320165839C13280743 @default.
- W4320165839 hasConceptScore W4320165839C138885662 @default.
- W4320165839 hasConceptScore W4320165839C154945302 @default.
- W4320165839 hasConceptScore W4320165839C161301231 @default.
- W4320165839 hasConceptScore W4320165839C185798385 @default.
- W4320165839 hasConceptScore W4320165839C193221554 @default.
- W4320165839 hasConceptScore W4320165839C205649164 @default.
- W4320165839 hasConceptScore W4320165839C2779814899 @default.
- W4320165839 hasConceptScore W4320165839C30542707 @default.
- W4320165839 hasConceptScore W4320165839C41008148 @default.
- W4320165839 hasLocation W43201658391 @default.
- W4320165839 hasOpenAccess W4320165839 @default.
- W4320165839 hasPrimaryLocation W43201658391 @default.
- W4320165839 hasRelatedWork W2114653216 @default.
- W4320165839 hasRelatedWork W2151799802 @default.
- W4320165839 hasRelatedWork W2734382736 @default.
- W4320165839 hasRelatedWork W2971788065 @default.
- W4320165839 hasRelatedWork W3035925421 @default.
- W4320165839 hasRelatedWork W3103326498 @default.
- W4320165839 hasRelatedWork W3121101797 @default.
- W4320165839 hasRelatedWork W3152855350 @default.
- W4320165839 hasRelatedWork W3168455342 @default.
- W4320165839 hasRelatedWork W4289528260 @default.
- W4320165839 isParatext "false" @default.
- W4320165839 isRetracted "false" @default.
- W4320165839 workType "article" @default.