Matches in SemOpenAlex for { <https://semopenalex.org/work/W3178585493> ?p ?o ?g. }
- W3178585493 endingPage "e0254007" @default.
- W3178585493 startingPage "e0254007" @default.
- W3178585493 abstract "Automated monitoring of websites that trade wildlife is increasingly necessary to inform conservation and biosecurity efforts. However, e-commerce and wildlife trading websites can contain a vast number of advertisements, an unknown proportion of which may be irrelevant to researchers and practitioners. Given that many wildlife-trade advertisements have an unstructured text format, automated identification of relevant listings has not traditionally been possible, nor attempted. Other scientific disciplines have solved similar problems using machine learning and natural language processing models, such as text classifiers. Here, we test the ability of a suite of text classifiers to extract relevant advertisements from wildlife trade occurring on the Internet. We collected data from an Australian classifieds website where people can post advertisements of their pet birds (n = 16.5k advertisements). We found that text classifiers can predict, with a high degree of accuracy, which listings are relevant (ROC AUC ≥ 0.98, F1 score ≥ 0.77). Furthermore, in an attempt to answer the question ‘how much data is required to have an adequately performing model?’, we conducted a sensitivity analysis by simulating decreases in sample sizes to measure the subsequent change in model performance. From our sensitivity analysis, we found that text classifiers required a minimum sample size of 33% (c. 5.5k listings) to accurately identify relevant listings (for our dataset), providing a reference point for future applications of this sort. Our results suggest that text classification is a viable tool that can be applied to the online trade of wildlife to reduce time dedicated to data cleaning. However, the success of text classifiers will vary depending on the advertisements and websites, and will therefore be context dependent. Further work to integrate other machine learning tools, such as image classification, may provide better predictive abilities in the context of streamlining data processing for wildlife trade related online data." @default.
- W3178585493 created "2021-07-19" @default.
- W3178585493 creator A5000213926 @default.
- W3178585493 creator A5004424630 @default.
- W3178585493 creator A5016100998 @default.
- W3178585493 creator A5026776788 @default.
- W3178585493 creator A5055805371 @default.
- W3178585493 creator A5073900218 @default.
- W3178585493 creator A5089061217 @default.
- W3178585493 date "2021-07-09" @default.
- W3178585493 modified "2023-10-17" @default.
- W3178585493 title "Text classification to streamline online wildlife trade analyses" @default.
- W3178585493 cites W1037504319 @default.
- W3178585493 cites W2125943921 @default.
- W3178585493 cites W2169955508 @default.
- W3178585493 cites W2301833477 @default.
- W3178585493 cites W2465750682 @default.
- W3178585493 cites W2589170546 @default.
- W3178585493 cites W2769210209 @default.
- W3178585493 cites W2789890555 @default.
- W3178585493 cites W2888190345 @default.
- W3178585493 cites W2894443252 @default.
- W3178585493 cites W2977474077 @default.
- W3178585493 cites W2980008747 @default.
- W3178585493 cites W3015226025 @default.
- W3178585493 cites W3027478280 @default.
- W3178585493 cites W327991062 @default.
- W3178585493 cites W4230096730 @default.
- W3178585493 doi "https://doi.org/10.1371/journal.pone.0254007" @default.
- W3178585493 hasPubMedCentralId "https://www.ncbi.nlm.nih.gov/pmc/articles/8270201" @default.
- W3178585493 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/34242279" @default.
- W3178585493 hasPublicationYear "2021" @default.
- W3178585493 type Work @default.
- W3178585493 sameAs 3178585493 @default.
- W3178585493 citedByCount "7" @default.
- W3178585493 countsByYear W31785854932022 @default.
- W3178585493 countsByYear W31785854932023 @default.
- W3178585493 crossrefType "journal-article" @default.
- W3178585493 hasAuthorship W3178585493A5000213926 @default.
- W3178585493 hasAuthorship W3178585493A5004424630 @default.
- W3178585493 hasAuthorship W3178585493A5016100998 @default.
- W3178585493 hasAuthorship W3178585493A5026776788 @default.
- W3178585493 hasAuthorship W3178585493A5055805371 @default.
- W3178585493 hasAuthorship W3178585493A5073900218 @default.
- W3178585493 hasAuthorship W3178585493A5089061217 @default.
- W3178585493 hasBestOaLocation W31785854931 @default.
- W3178585493 hasConcept C110875604 @default.
- W3178585493 hasConcept C116834253 @default.
- W3178585493 hasConcept C119857082 @default.
- W3178585493 hasConcept C136764020 @default.
- W3178585493 hasConcept C154945302 @default.
- W3178585493 hasConcept C185592680 @default.
- W3178585493 hasConcept C18903297 @default.
- W3178585493 hasConcept C198531522 @default.
- W3178585493 hasConcept C204321447 @default.
- W3178585493 hasConcept C23123220 @default.
- W3178585493 hasConcept C2522767166 @default.
- W3178585493 hasConcept C2776605222 @default.
- W3178585493 hasConcept C29376679 @default.
- W3178585493 hasConcept C41008148 @default.
- W3178585493 hasConcept C43617362 @default.
- W3178585493 hasConcept C86803240 @default.
- W3178585493 hasConcept C88548561 @default.
- W3178585493 hasConceptScore W3178585493C110875604 @default.
- W3178585493 hasConceptScore W3178585493C116834253 @default.
- W3178585493 hasConceptScore W3178585493C119857082 @default.
- W3178585493 hasConceptScore W3178585493C136764020 @default.
- W3178585493 hasConceptScore W3178585493C154945302 @default.
- W3178585493 hasConceptScore W3178585493C185592680 @default.
- W3178585493 hasConceptScore W3178585493C18903297 @default.
- W3178585493 hasConceptScore W3178585493C198531522 @default.
- W3178585493 hasConceptScore W3178585493C204321447 @default.
- W3178585493 hasConceptScore W3178585493C23123220 @default.
- W3178585493 hasConceptScore W3178585493C2522767166 @default.
- W3178585493 hasConceptScore W3178585493C2776605222 @default.
- W3178585493 hasConceptScore W3178585493C29376679 @default.
- W3178585493 hasConceptScore W3178585493C41008148 @default.
- W3178585493 hasConceptScore W3178585493C43617362 @default.
- W3178585493 hasConceptScore W3178585493C86803240 @default.
- W3178585493 hasConceptScore W3178585493C88548561 @default.
- W3178585493 hasIssue "7" @default.
- W3178585493 hasLocation W31785854931 @default.
- W3178585493 hasLocation W31785854932 @default.
- W3178585493 hasOpenAccess W3178585493 @default.
- W3178585493 hasPrimaryLocation W31785854931 @default.
- W3178585493 hasRelatedWork W2066983820 @default.
- W3178585493 hasRelatedWork W2182984660 @default.
- W3178585493 hasRelatedWork W2390613933 @default.
- W3178585493 hasRelatedWork W2495143571 @default.
- W3178585493 hasRelatedWork W2756548619 @default.
- W3178585493 hasRelatedWork W2975654092 @default.
- W3178585493 hasRelatedWork W3000378012 @default.
- W3178585493 hasRelatedWork W3169168350 @default.
- W3178585493 hasRelatedWork W4283464443 @default.
- W3178585493 hasRelatedWork W4385958714 @default.
- W3178585493 hasVolume "16" @default.
- W3178585493 isParatext "false" @default.
- W3178585493 isRetracted "false" @default.