Matches in SemOpenAlex for { <https://semopenalex.org/work/W4366729322> ?p ?o ?g. }
- W4366729322 abstract "Desired model behavior often differs across contexts (e.g., different geographies, communities, or institutions), but there is little infrastructure to facilitate context-specific evaluations key to deployment decisions and building trust. Here, we present Kaleidoscope, a system for evaluating models in terms of user-driven, domain-relevant concepts. Kaleidoscope’s iterative workflow enables generalizing from a few examples into a larger, diverse set representing an important concept. These example sets can be used to test model outputs or shifts in model behavior in semantically-meaningful ways. For instance, we might construct a “xenophobic comments” set and test that its examples are more likely to be flagged by a content moderation model than a “civil discussion” set. To evaluate Kaleidoscope, we compare it against template- and DSL-based grouping methods, and conduct a usability study with 13 Reddit users testing a content moderation model. We find that Kaleidoscope facilitates iterative, exploratory hypothesis testing across diverse, conceptually-meaningful example sets." @default.
- W4366729322 created "2023-04-24" @default.
- W4366729322 creator A5005452839 @default.
- W4366729322 creator A5007282049 @default.
- W4366729322 creator A5059936375 @default.
- W4366729322 creator A5060694111 @default.
- W4366729322 creator A5071778232 @default.
- W4366729322 creator A5077783676 @default.
- W4366729322 creator A5083496477 @default.
- W4366729322 date "2023-04-19" @default.
- W4366729322 modified "2023-10-18" @default.
- W4366729322 title "Kaleidoscope: Semantically-grounded, context-specific ML model evaluation" @default.
- W4366729322 cites W2030246490 @default.
- W4366729322 cites W2141880430 @default.
- W4366729322 cites W2810857251 @default.
- W4366729322 cites W2891177506 @default.
- W4366729322 cites W2899027170 @default.
- W4366729322 cites W2920807444 @default.
- W4366729322 cites W2949858875 @default.
- W4366729322 cites W2970837303 @default.
- W4366729322 cites W2979826702 @default.
- W4366729322 cites W3002972902 @default.
- W4366729322 cites W3003809373 @default.
- W4366729322 cites W3005013146 @default.
- W4366729322 cites W3016970897 @default.
- W4366729322 cites W3035507081 @default.
- W4366729322 cites W3100279624 @default.
- W4366729322 cites W3120485916 @default.
- W4366729322 cites W3130185945 @default.
- W4366729322 cites W3134111219 @default.
- W4366729322 cites W3160037564 @default.
- W4366729322 cites W3172794097 @default.
- W4366729322 cites W4229447062 @default.
- W4366729322 cites W4230238155 @default.
- W4366729322 cites W4288083803 @default.
- W4366729322 cites W4290943938 @default.
- W4366729322 cites W4385573966 @default.
- W4366729322 doi "https://doi.org/10.1145/3544548.3581482" @default.
- W4366729322 hasPublicationYear "2023" @default.
- W4366729322 type Work @default.
- W4366729322 citedByCount "0" @default.
- W4366729322 crossrefType "proceedings-article" @default.
- W4366729322 hasAuthorship W4366729322A5005452839 @default.
- W4366729322 hasAuthorship W4366729322A5007282049 @default.
- W4366729322 hasAuthorship W4366729322A5059936375 @default.
- W4366729322 hasAuthorship W4366729322A5060694111 @default.
- W4366729322 hasAuthorship W4366729322A5071778232 @default.
- W4366729322 hasAuthorship W4366729322A5077783676 @default.
- W4366729322 hasAuthorship W4366729322A5083496477 @default.
- W4366729322 hasBestOaLocation W43667293221 @default.
- W4366729322 hasConcept C107457646 @default.
- W4366729322 hasConcept C119857082 @default.
- W4366729322 hasConcept C151730666 @default.
- W4366729322 hasConcept C154945302 @default.
- W4366729322 hasConcept C170130773 @default.
- W4366729322 hasConcept C177212765 @default.
- W4366729322 hasConcept C177264268 @default.
- W4366729322 hasConcept C183322885 @default.
- W4366729322 hasConcept C199360897 @default.
- W4366729322 hasConcept C201374245 @default.
- W4366729322 hasConcept C23123220 @default.
- W4366729322 hasConcept C2522767166 @default.
- W4366729322 hasConcept C2778037017 @default.
- W4366729322 hasConcept C2779343474 @default.
- W4366729322 hasConcept C2780801425 @default.
- W4366729322 hasConcept C2781238097 @default.
- W4366729322 hasConcept C41008148 @default.
- W4366729322 hasConcept C76155785 @default.
- W4366729322 hasConcept C77088390 @default.
- W4366729322 hasConcept C86803240 @default.
- W4366729322 hasConcept C93225998 @default.
- W4366729322 hasConceptScore W4366729322C107457646 @default.
- W4366729322 hasConceptScore W4366729322C119857082 @default.
- W4366729322 hasConceptScore W4366729322C151730666 @default.
- W4366729322 hasConceptScore W4366729322C154945302 @default.
- W4366729322 hasConceptScore W4366729322C170130773 @default.
- W4366729322 hasConceptScore W4366729322C177212765 @default.
- W4366729322 hasConceptScore W4366729322C177264268 @default.
- W4366729322 hasConceptScore W4366729322C183322885 @default.
- W4366729322 hasConceptScore W4366729322C199360897 @default.
- W4366729322 hasConceptScore W4366729322C201374245 @default.
- W4366729322 hasConceptScore W4366729322C23123220 @default.
- W4366729322 hasConceptScore W4366729322C2522767166 @default.
- W4366729322 hasConceptScore W4366729322C2778037017 @default.
- W4366729322 hasConceptScore W4366729322C2779343474 @default.
- W4366729322 hasConceptScore W4366729322C2780801425 @default.
- W4366729322 hasConceptScore W4366729322C2781238097 @default.
- W4366729322 hasConceptScore W4366729322C41008148 @default.
- W4366729322 hasConceptScore W4366729322C76155785 @default.
- W4366729322 hasConceptScore W4366729322C77088390 @default.
- W4366729322 hasConceptScore W4366729322C86803240 @default.
- W4366729322 hasConceptScore W4366729322C93225998 @default.
- W4366729322 hasLocation W43667293221 @default.
- W4366729322 hasOpenAccess W4366729322 @default.
- W4366729322 hasPrimaryLocation W43667293221 @default.
- W4366729322 hasRelatedWork W1485793233 @default.
- W4366729322 hasRelatedWork W1533929296 @default.
- W4366729322 hasRelatedWork W211986840 @default.
- W4366729322 hasRelatedWork W2312242701 @default.
- W4366729322 hasRelatedWork W261809117 @default.