Matches in SemOpenAlex for { <https://semopenalex.org/work/W4386607580> ?p ?o ?g. }
Showing items 1 to 84 of
84
with 100 items per page.
- W4386607580 abstract "More than one hundred benchmarks have been developed to test the commonsense knowledge and commonsense reasoning abilities of artificial intelligence (AI) systems. However, these benchmarks are often flawed, and many aspects of common sense remain untested. Consequently, there is currently no reliable way of measuring to what extent existing AI systems have achieved these abilities. This paper surveys the development and uses of AI commonsense benchmarks. It enumerates 139 commonsense benchmarks that have been developed: 102 text-based, 18 image-based, 12 video-based, and 7 based in simulated physical environments. It gives more detailed descriptions of twelve of these, three from each category. It surveys the various methods used to construct commonsense benchmarks. It discusses the nature of common sense, the role of common sense in AI, the goals served by constructing commonsense benchmarks, desirable features of commonsense benchmarks, and flaws and gap in existing benchmarks. It concludes with a number of recommendations for future development of commonsense AI benchmarks; most importantly, that the creators of benchmarks invest the work needed to ensure that benchmark examples are consistently high quality." @default.
- W4386607580 created "2023-09-12" @default.
- W4386607580 creator A5010452641 @default.
- W4386607580 date "2023-09-11" @default.
- W4386607580 modified "2023-09-29" @default.
- W4386607580 title "Benchmarks for Automated Commonsense Reasoning: A Survey" @default.
- W4386607580 cites W1927052826 @default.
- W4386607580 cites W2011945332 @default.
- W4386607580 cites W2073302931 @default.
- W4386607580 cites W2088589173 @default.
- W4386607580 cites W2105539942 @default.
- W4386607580 cites W2110764733 @default.
- W4386607580 cites W2122143462 @default.
- W4386607580 cites W2139188400 @default.
- W4386607580 cites W2161484642 @default.
- W4386607580 cites W2163908084 @default.
- W4386607580 cites W2250384498 @default.
- W4386607580 cites W2277195237 @default.
- W4386607580 cites W2337252826 @default.
- W4386607580 cites W2476837489 @default.
- W4386607580 cites W2561529111 @default.
- W4386607580 cites W2804897457 @default.
- W4386607580 cites W2890894339 @default.
- W4386607580 cites W2914699769 @default.
- W4386607580 cites W2946609015 @default.
- W4386607580 cites W2963353834 @default.
- W4386607580 cites W2964303913 @default.
- W4386607580 cites W2971147709 @default.
- W4386607580 cites W3035733645 @default.
- W4386607580 cites W3099919888 @default.
- W4386607580 cites W3101767943 @default.
- W4386607580 cites W3102749280 @default.
- W4386607580 cites W3170403598 @default.
- W4386607580 cites W3175287561 @default.
- W4386607580 cites W3194676777 @default.
- W4386607580 cites W3201531807 @default.
- W4386607580 cites W3212464620 @default.
- W4386607580 cites W4213447308 @default.
- W4386607580 cites W4225661174 @default.
- W4386607580 cites W4306247398 @default.
- W4386607580 cites W4383875393 @default.
- W4386607580 doi "https://doi.org/10.1145/3615355" @default.
- W4386607580 hasPublicationYear "2023" @default.
- W4386607580 type Work @default.
- W4386607580 citedByCount "0" @default.
- W4386607580 crossrefType "journal-article" @default.
- W4386607580 hasAuthorship W4386607580A5010452641 @default.
- W4386607580 hasBestOaLocation W43866075801 @default.
- W4386607580 hasConcept C13280743 @default.
- W4386607580 hasConcept C154945302 @default.
- W4386607580 hasConcept C161301231 @default.
- W4386607580 hasConcept C185798385 @default.
- W4386607580 hasConcept C193221554 @default.
- W4386607580 hasConcept C199360897 @default.
- W4386607580 hasConcept C205649164 @default.
- W4386607580 hasConcept C2780801425 @default.
- W4386607580 hasConcept C30542707 @default.
- W4386607580 hasConcept C41008148 @default.
- W4386607580 hasConceptScore W4386607580C13280743 @default.
- W4386607580 hasConceptScore W4386607580C154945302 @default.
- W4386607580 hasConceptScore W4386607580C161301231 @default.
- W4386607580 hasConceptScore W4386607580C185798385 @default.
- W4386607580 hasConceptScore W4386607580C193221554 @default.
- W4386607580 hasConceptScore W4386607580C199360897 @default.
- W4386607580 hasConceptScore W4386607580C205649164 @default.
- W4386607580 hasConceptScore W4386607580C2780801425 @default.
- W4386607580 hasConceptScore W4386607580C30542707 @default.
- W4386607580 hasConceptScore W4386607580C41008148 @default.
- W4386607580 hasLocation W43866075801 @default.
- W4386607580 hasOpenAccess W4386607580 @default.
- W4386607580 hasPrimaryLocation W43866075801 @default.
- W4386607580 hasRelatedWork W2073302931 @default.
- W4386607580 hasRelatedWork W2151799802 @default.
- W4386607580 hasRelatedWork W2196562041 @default.
- W4386607580 hasRelatedWork W2964532710 @default.
- W4386607580 hasRelatedWork W2989033444 @default.
- W4386607580 hasRelatedWork W3035583586 @default.
- W4386607580 hasRelatedWork W3115951983 @default.
- W4386607580 hasRelatedWork W4221141571 @default.
- W4386607580 hasRelatedWork W4320165839 @default.
- W4386607580 hasRelatedWork W4385488510 @default.
- W4386607580 isParatext "false" @default.
- W4386607580 isRetracted "false" @default.
- W4386607580 workType "article" @default.