Matches in SemOpenAlex for { <https://semopenalex.org/work/W4379911458> ?p ?o ?g. }
Showing items 1 to 86 of
86
with 100 items per page.
- W4379911458 endingPage "7" @default.
- W4379911458 startingPage "1" @default.
- W4379911458 abstract "Abstract A standard text-as-data workflow in the social sciences involves identifying a set of documents to be labeled, selecting a random sample of them to label using research assistants, training a supervised learner to label the remaining documents, and validating that model’s performance using standard accuracy metrics. The most resource-intensive component of this is the hand-labeling: carefully reading documents, training research assistants, and paying human coders to label documents in duplicate or more. We show that hand-coding an algorithmically selected rather than a simple-random sample can improve model performance above baseline by as much as 50%, or reduce hand-coding costs by up to two-thirds, in applications predicting (1) U.S. executive-order significance and (2) financial sentiment on social media. We accompany this manuscript with open-source software to implement these tools, which we hope can make supervised learning cheaper and more accessible to researchers." @default.
- W4379911458 created "2023-06-09" @default.
- W4379911458 creator A5055539519 @default.
- W4379911458 date "2023-06-08" @default.
- W4379911458 modified "2023-09-25" @default.
- W4379911458 title "Selecting More Informative Training Sets with Fewer Observations" @default.
- W4379911458 cites W2053978350 @default.
- W4379911458 cites W2095655043 @default.
- W4379911458 cites W2111222556 @default.
- W4379911458 cites W2557738935 @default.
- W4379911458 cites W2755046603 @default.
- W4379911458 cites W2799051560 @default.
- W4379911458 cites W2809234980 @default.
- W4379911458 cites W2888294514 @default.
- W4379911458 cites W2891649320 @default.
- W4379911458 cites W2962907576 @default.
- W4379911458 cites W2994684270 @default.
- W4379911458 cites W2997668629 @default.
- W4379911458 cites W3020745670 @default.
- W4379911458 cites W3124321814 @default.
- W4379911458 cites W3134241185 @default.
- W4379911458 cites W3134427152 @default.
- W4379911458 cites W3159574466 @default.
- W4379911458 cites W4205599725 @default.
- W4379911458 cites W4206708978 @default.
- W4379911458 cites W4245728648 @default.
- W4379911458 cites W632139601 @default.
- W4379911458 doi "https://doi.org/10.1017/pan.2023.19" @default.
- W4379911458 hasPublicationYear "2023" @default.
- W4379911458 type Work @default.
- W4379911458 citedByCount "0" @default.
- W4379911458 crossrefType "journal-article" @default.
- W4379911458 hasAuthorship W4379911458A5055539519 @default.
- W4379911458 hasBestOaLocation W43799114581 @default.
- W4379911458 hasConcept C105795698 @default.
- W4379911458 hasConcept C119857082 @default.
- W4379911458 hasConcept C154945302 @default.
- W4379911458 hasConcept C177212765 @default.
- W4379911458 hasConcept C179518139 @default.
- W4379911458 hasConcept C185592680 @default.
- W4379911458 hasConcept C198531522 @default.
- W4379911458 hasConcept C199360897 @default.
- W4379911458 hasConcept C204321447 @default.
- W4379911458 hasConcept C23123220 @default.
- W4379911458 hasConcept C2522767166 @default.
- W4379911458 hasConcept C2777904410 @default.
- W4379911458 hasConcept C3018397939 @default.
- W4379911458 hasConcept C33923547 @default.
- W4379911458 hasConcept C41008148 @default.
- W4379911458 hasConcept C43617362 @default.
- W4379911458 hasConcept C77088390 @default.
- W4379911458 hasConceptScore W4379911458C105795698 @default.
- W4379911458 hasConceptScore W4379911458C119857082 @default.
- W4379911458 hasConceptScore W4379911458C154945302 @default.
- W4379911458 hasConceptScore W4379911458C177212765 @default.
- W4379911458 hasConceptScore W4379911458C179518139 @default.
- W4379911458 hasConceptScore W4379911458C185592680 @default.
- W4379911458 hasConceptScore W4379911458C198531522 @default.
- W4379911458 hasConceptScore W4379911458C199360897 @default.
- W4379911458 hasConceptScore W4379911458C204321447 @default.
- W4379911458 hasConceptScore W4379911458C23123220 @default.
- W4379911458 hasConceptScore W4379911458C2522767166 @default.
- W4379911458 hasConceptScore W4379911458C2777904410 @default.
- W4379911458 hasConceptScore W4379911458C3018397939 @default.
- W4379911458 hasConceptScore W4379911458C33923547 @default.
- W4379911458 hasConceptScore W4379911458C41008148 @default.
- W4379911458 hasConceptScore W4379911458C43617362 @default.
- W4379911458 hasConceptScore W4379911458C77088390 @default.
- W4379911458 hasLocation W43799114581 @default.
- W4379911458 hasOpenAccess W4379911458 @default.
- W4379911458 hasPrimaryLocation W43799114581 @default.
- W4379911458 hasRelatedWork W1102762066 @default.
- W4379911458 hasRelatedWork W2081035100 @default.
- W4379911458 hasRelatedWork W2158645158 @default.
- W4379911458 hasRelatedWork W2376314740 @default.
- W4379911458 hasRelatedWork W2384888906 @default.
- W4379911458 hasRelatedWork W2961085424 @default.
- W4379911458 hasRelatedWork W3202169202 @default.
- W4379911458 hasRelatedWork W3206324740 @default.
- W4379911458 hasRelatedWork W4306674287 @default.
- W4379911458 hasRelatedWork W4224009465 @default.
- W4379911458 isParatext "false" @default.
- W4379911458 isRetracted "false" @default.
- W4379911458 workType "article" @default.