Matches in SemOpenAlex for { <https://semopenalex.org/work/W3037903309> ?p ?o ?g. }
- W3037903309 endingPage "439" @default.
- W3037903309 startingPage "428" @default.
- W3037903309 abstract "We explore the task of intrinsic source attribution: inferring which portions of a derived document were adapted from an unobserved source document. Specifically, we model the relationship between news articles and their press release sources using a dataset of 64,784 health science news articles and 23,068 press releases. We approach the problem at the sentence level and work with science journalism professors to develop a four point Likert scale describing the extent to which a news article sentence is derived from the content in the corresponding press release. Because manual annotation of news article - press release pairs is time-consuming, we turn to a mix of expert, non-expert, and heuristic-based annotation to label our dataset. After a small pilot study, which found that humans, when only able to view the text of the news article, struggle to identify which content is derived or not, we compare four different sentence regression models on the task. We find that modeling a sentence's context in the entire document is important, with the best performing model, a sequence regression model with BERT token representations, achieving a spearman's ρ of 0.49 and NDCG@1 of 0.60 on the expert-labeled test set. Examining the model's predictions, we find that it successfully identifies copied or closely paraphrased sentences in articles with a mix of derived and original content, but struggles to differentiate between loosely paraphrased and original sentences in articles with mostly original writing." @default.
- W3037903309 created "2020-07-02" @default.
- W3037903309 creator A5005516194 @default.
- W3037903309 creator A5025076169 @default.
- W3037903309 creator A5047474127 @default.
- W3037903309 creator A5060452633 @default.
- W3037903309 date "2020-05-26" @default.
- W3037903309 modified "2023-10-18" @default.
- W3037903309 title "Source Attribution: Recovering the Press Releases Behind Health Science News" @default.
- W3037903309 cites W131533222 @default.
- W3037903309 cites W1509629370 @default.
- W3037903309 cites W1822819589 @default.
- W3037903309 cites W1974132922 @default.
- W3037903309 cites W2069870183 @default.
- W3037903309 cites W2069985874 @default.
- W3037903309 cites W2099984129 @default.
- W3037903309 cites W2101105183 @default.
- W3037903309 cites W2110787135 @default.
- W3037903309 cites W2113329865 @default.
- W3037903309 cites W2120615054 @default.
- W3037903309 cites W2123301721 @default.
- W3037903309 cites W2127492100 @default.
- W3037903309 cites W2141799614 @default.
- W3037903309 cites W2145865597 @default.
- W3037903309 cites W2148578434 @default.
- W3037903309 cites W2148752429 @default.
- W3037903309 cites W2154652894 @default.
- W3037903309 cites W2170196196 @default.
- W3037903309 cites W2250539671 @default.
- W3037903309 cites W2251831045 @default.
- W3037903309 cites W2251935993 @default.
- W3037903309 cites W2378789977 @default.
- W3037903309 cites W2468414286 @default.
- W3037903309 cites W2525778437 @default.
- W3037903309 cites W2572185161 @default.
- W3037903309 cites W2593408211 @default.
- W3037903309 cites W2593644299 @default.
- W3037903309 cites W2759820691 @default.
- W3037903309 cites W2790166049 @default.
- W3037903309 cites W2794557536 @default.
- W3037903309 cites W2798533112 @default.
- W3037903309 cites W2896133714 @default.
- W3037903309 cites W2900471836 @default.
- W3037903309 cites W2916132663 @default.
- W3037903309 cites W2929938180 @default.
- W3037903309 cites W2945867775 @default.
- W3037903309 cites W2963341956 @default.
- W3037903309 cites W2963921497 @default.
- W3037903309 cites W2963940534 @default.
- W3037903309 cites W2970352191 @default.
- W3037903309 cites W2970641574 @default.
- W3037903309 cites W2973050267 @default.
- W3037903309 cites W3104033643 @default.
- W3037903309 cites W658020064 @default.
- W3037903309 doi "https://doi.org/10.1609/icwsm.v14i1.7312" @default.
- W3037903309 hasPublicationYear "2020" @default.
- W3037903309 type Work @default.
- W3037903309 sameAs 3037903309 @default.
- W3037903309 citedByCount "1" @default.
- W3037903309 countsByYear W30379033092021 @default.
- W3037903309 crossrefType "journal-article" @default.
- W3037903309 hasAuthorship W3037903309A5005516194 @default.
- W3037903309 hasAuthorship W3037903309A5025076169 @default.
- W3037903309 hasAuthorship W3037903309A5047474127 @default.
- W3037903309 hasAuthorship W3037903309A5060452633 @default.
- W3037903309 hasBestOaLocation W30379033091 @default.
- W3037903309 hasConcept C105776082 @default.
- W3037903309 hasConcept C105795698 @default.
- W3037903309 hasConcept C114614502 @default.
- W3037903309 hasConcept C143299363 @default.
- W3037903309 hasConcept C151730666 @default.
- W3037903309 hasConcept C154945302 @default.
- W3037903309 hasConcept C15744967 @default.
- W3037903309 hasConcept C162324750 @default.
- W3037903309 hasConcept C164226766 @default.
- W3037903309 hasConcept C169903167 @default.
- W3037903309 hasConcept C187736073 @default.
- W3037903309 hasConcept C204321447 @default.
- W3037903309 hasConcept C23123220 @default.
- W3037903309 hasConcept C2776321320 @default.
- W3037903309 hasConcept C2777267654 @default.
- W3037903309 hasConcept C2777530160 @default.
- W3037903309 hasConcept C2779343474 @default.
- W3037903309 hasConcept C2780451532 @default.
- W3037903309 hasConcept C33923547 @default.
- W3037903309 hasConcept C41008148 @default.
- W3037903309 hasConcept C77805123 @default.
- W3037903309 hasConcept C86803240 @default.
- W3037903309 hasConceptScore W3037903309C105776082 @default.
- W3037903309 hasConceptScore W3037903309C105795698 @default.
- W3037903309 hasConceptScore W3037903309C114614502 @default.
- W3037903309 hasConceptScore W3037903309C143299363 @default.
- W3037903309 hasConceptScore W3037903309C151730666 @default.
- W3037903309 hasConceptScore W3037903309C154945302 @default.
- W3037903309 hasConceptScore W3037903309C15744967 @default.
- W3037903309 hasConceptScore W3037903309C162324750 @default.
- W3037903309 hasConceptScore W3037903309C164226766 @default.
- W3037903309 hasConceptScore W3037903309C169903167 @default.