Matches in SemOpenAlex for { <https://semopenalex.org/work/W240215681> ?p ?o ?g. }
Showing items 1 to 79 of
79
with 100 items per page.
- W240215681 abstract "In this position paper, I argue that standardized tests for elementary science such as SAT or Regents tests are not very good benchmarks for measuring the progress of artificial intelligence systems in understanding basic science. The primary problem is that these tests are designed to test aspects of knowledge and ability that are challenging for people; the aspects that are challenging for AI systems are very different. In particular, standardized tests do not test knowledge that is obvious for people; none of this knowledge can be assumed in AI systems. Individual standardized tests also have specific features that are not necessarily appropriate for an AI benchmark. I analyze the Physics subject SAT in some detail and the New York State Regents Science test more briefly. I also argue that the apparent advantages offered by using standardized tests are mostly either minor or illusory. The one major real advantage is that the significance is easily explained to the public; but I argue that even this is a somewhat mixed blessing. I conclude by arguing that, first, more appropriate collections of exam style problems could be assembled, and second, that there are better kinds of benchmarks than exam-style problems. In an appendix I present a collection of sample exam-style problems that test kinds of knowledge missing from the standardized tests." @default.
- W240215681 created "2016-06-24" @default.
- W240215681 creator A5010452641 @default.
- W240215681 date "2014-11-06" @default.
- W240215681 modified "2023-09-27" @default.
- W240215681 title "The Limitations of Standardized Science Tests as Benchmarks for Artificial Intelligence Research: Position Paper." @default.
- W240215681 cites W1599016936 @default.
- W240215681 cites W2011945332 @default.
- W240215681 cites W2116051222 @default.
- W240215681 cites W2121036250 @default.
- W240215681 cites W2124567236 @default.
- W240215681 cites W2187018325 @default.
- W240215681 cites W97534101 @default.
- W240215681 hasPublicationYear "2014" @default.
- W240215681 type Work @default.
- W240215681 sameAs 240215681 @default.
- W240215681 citedByCount "3" @default.
- W240215681 countsByYear W2402156812015 @default.
- W240215681 countsByYear W2402156812018 @default.
- W240215681 countsByYear W2402156812020 @default.
- W240215681 crossrefType "posted-content" @default.
- W240215681 hasAuthorship W240215681A5010452641 @default.
- W240215681 hasConcept C13280743 @default.
- W240215681 hasConcept C145420912 @default.
- W240215681 hasConcept C151730666 @default.
- W240215681 hasConcept C154945302 @default.
- W240215681 hasConcept C15744967 @default.
- W240215681 hasConcept C166957645 @default.
- W240215681 hasConcept C185798385 @default.
- W240215681 hasConcept C203151758 @default.
- W240215681 hasConcept C205649164 @default.
- W240215681 hasConcept C2776195157 @default.
- W240215681 hasConcept C2777267654 @default.
- W240215681 hasConcept C41008148 @default.
- W240215681 hasConcept C81369262 @default.
- W240215681 hasConcept C86803240 @default.
- W240215681 hasConcept C95457728 @default.
- W240215681 hasConceptScore W240215681C13280743 @default.
- W240215681 hasConceptScore W240215681C145420912 @default.
- W240215681 hasConceptScore W240215681C151730666 @default.
- W240215681 hasConceptScore W240215681C154945302 @default.
- W240215681 hasConceptScore W240215681C15744967 @default.
- W240215681 hasConceptScore W240215681C166957645 @default.
- W240215681 hasConceptScore W240215681C185798385 @default.
- W240215681 hasConceptScore W240215681C203151758 @default.
- W240215681 hasConceptScore W240215681C205649164 @default.
- W240215681 hasConceptScore W240215681C2776195157 @default.
- W240215681 hasConceptScore W240215681C2777267654 @default.
- W240215681 hasConceptScore W240215681C41008148 @default.
- W240215681 hasConceptScore W240215681C81369262 @default.
- W240215681 hasConceptScore W240215681C86803240 @default.
- W240215681 hasConceptScore W240215681C95457728 @default.
- W240215681 hasLocation W2402156811 @default.
- W240215681 hasOpenAccess W240215681 @default.
- W240215681 hasPrimaryLocation W2402156811 @default.
- W240215681 hasRelatedWork W106153781 @default.
- W240215681 hasRelatedWork W160150769 @default.
- W240215681 hasRelatedWork W1963800794 @default.
- W240215681 hasRelatedWork W1969175074 @default.
- W240215681 hasRelatedWork W1981319333 @default.
- W240215681 hasRelatedWork W199703059 @default.
- W240215681 hasRelatedWork W2011451095 @default.
- W240215681 hasRelatedWork W2042718003 @default.
- W240215681 hasRelatedWork W2084398551 @default.
- W240215681 hasRelatedWork W2282336683 @default.
- W240215681 hasRelatedWork W2372877066 @default.
- W240215681 hasRelatedWork W2413531407 @default.
- W240215681 hasRelatedWork W2460629147 @default.
- W240215681 hasRelatedWork W251352089 @default.
- W240215681 hasRelatedWork W2553081798 @default.
- W240215681 hasRelatedWork W2561083987 @default.
- W240215681 hasRelatedWork W2975878921 @default.
- W240215681 hasRelatedWork W3004757630 @default.
- W240215681 hasRelatedWork W3124956383 @default.
- W240215681 hasRelatedWork W91073433 @default.
- W240215681 isParatext "false" @default.
- W240215681 isRetracted "false" @default.
- W240215681 magId "240215681" @default.
- W240215681 workType "article" @default.