Matches in SemOpenAlex for { <https://semopenalex.org/work/W4289767364> ?p ?o ?g. }
- W4289767364 abstract "Digital point-occurrence records from the Global Biodiversity Information Facility (GBIF) and other data providers enable a wide range of research in macroecology and biogeography. However, data errors may hamper immediate use. Manual data cleaning is time-consuming and often unfeasible, given that the databases may contain thousands or millions of records. Automated data cleaning pipelines are therefore of high importance. Taking North American Ephedra as a model, we examined how different data cleaning pipelines (using, e.g., the GBIF web application, and four different R packages) affect downstream species distribution models (SDMs). We also assessed how data differed from expert data. From 13,889 North American Ephedra observations in GBIF, the pipelines removed 31.7% to 62.7% false positives, invalid coordinates, and duplicates, leading to datasets between 9484 (GBIF application) and 5196 records (manual-guided filtering). The expert data consisted of 704 records, comparable to data from field studies. Although differences in the absolute numbers of records were relatively large, species richness models based on stacked SDMs (S-SDM) from pipeline and expert data were strongly correlated (mean Pearson's r across the pipelines: .9986, vs. the expert data: .9173). Our results suggest that all R package-based pipelines reliably identified invalid coordinates. In contrast, the GBIF-filtered data still contained both spatial and taxonomic errors. Major drawbacks emerge from the fact that no pipeline fully discovered misidentified specimens without the assistance of taxonomic expert knowledge. We conclude that application-filtered GBIF data will still need additional review to achieve higher spatial data quality. Achieving high-quality taxonomic data will require extra effort, probably by thoroughly analyzing the data for misidentified taxa, supported by experts." @default.
- W4289767364 created "2022-08-04" @default.
- W4289767364 creator A5043612479 @default.
- W4289767364 creator A5052825170 @default.
- W4289767364 creator A5082734087 @default.
- W4289767364 date "2022-08-01" @default.
- W4289767364 modified "2023-09-27" @default.
- W4289767364 title "Influence of different data cleaning solutions of point‐occurrence records on downstream macroecological diversity models" @default.
- W4289767364 cites W1144607668 @default.
- W4289767364 cites W1549716303 @default.
- W4289767364 cites W1848926566 @default.
- W4289767364 cites W1991784289 @default.
- W4289767364 cites W2004610394 @default.
- W4289767364 cites W2010663300 @default.
- W4289767364 cites W2020705682 @default.
- W4289767364 cites W2035791470 @default.
- W4289767364 cites W2051342072 @default.
- W4289767364 cites W2053351790 @default.
- W4289767364 cites W2069833836 @default.
- W4289767364 cites W2081036027 @default.
- W4289767364 cites W2107782151 @default.
- W4289767364 cites W2121744618 @default.
- W4289767364 cites W2135004835 @default.
- W4289767364 cites W2145126338 @default.
- W4289767364 cites W2146081698 @default.
- W4289767364 cites W2159327312 @default.
- W4289767364 cites W2179969133 @default.
- W4289767364 cites W2239138544 @default.
- W4289767364 cites W2239317424 @default.
- W4289767364 cites W2439879804 @default.
- W4289767364 cites W2494322157 @default.
- W4289767364 cites W2552708329 @default.
- W4289767364 cites W2738345813 @default.
- W4289767364 cites W2741261571 @default.
- W4289767364 cites W2891769525 @default.
- W4289767364 cites W2900235934 @default.
- W4289767364 cites W2910713426 @default.
- W4289767364 cites W2916425859 @default.
- W4289767364 cites W2921015294 @default.
- W4289767364 cites W2958856141 @default.
- W4289767364 cites W2970707515 @default.
- W4289767364 cites W2990427812 @default.
- W4289767364 cites W3090077579 @default.
- W4289767364 cites W3104895181 @default.
- W4289767364 cites W3153999239 @default.
- W4289767364 cites W4241341277 @default.
- W4289767364 doi "https://doi.org/10.1002/ece3.9168" @default.
- W4289767364 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/35949539" @default.
- W4289767364 hasPublicationYear "2022" @default.
- W4289767364 type Work @default.
- W4289767364 citedByCount "3" @default.
- W4289767364 countsByYear W42897673642022 @default.
- W4289767364 countsByYear W42897673642023 @default.
- W4289767364 crossrefType "journal-article" @default.
- W4289767364 hasAuthorship W4289767364A5043612479 @default.
- W4289767364 hasAuthorship W4289767364A5052825170 @default.
- W4289767364 hasAuthorship W4289767364A5082734087 @default.
- W4289767364 hasBestOaLocation W42897673643 @default.
- W4289767364 hasConcept C119857082 @default.
- W4289767364 hasConcept C124101348 @default.
- W4289767364 hasConcept C127413603 @default.
- W4289767364 hasConcept C176217482 @default.
- W4289767364 hasConcept C199360897 @default.
- W4289767364 hasConcept C21547014 @default.
- W4289767364 hasConcept C24756922 @default.
- W4289767364 hasConcept C41008148 @default.
- W4289767364 hasConcept C43521106 @default.
- W4289767364 hasConcept C64869954 @default.
- W4289767364 hasConceptScore W4289767364C119857082 @default.
- W4289767364 hasConceptScore W4289767364C124101348 @default.
- W4289767364 hasConceptScore W4289767364C127413603 @default.
- W4289767364 hasConceptScore W4289767364C176217482 @default.
- W4289767364 hasConceptScore W4289767364C199360897 @default.
- W4289767364 hasConceptScore W4289767364C21547014 @default.
- W4289767364 hasConceptScore W4289767364C24756922 @default.
- W4289767364 hasConceptScore W4289767364C41008148 @default.
- W4289767364 hasConceptScore W4289767364C43521106 @default.
- W4289767364 hasConceptScore W4289767364C64869954 @default.
- W4289767364 hasIssue "8" @default.
- W4289767364 hasLocation W42897673641 @default.
- W4289767364 hasLocation W42897673642 @default.
- W4289767364 hasLocation W42897673643 @default.
- W4289767364 hasLocation W42897673644 @default.
- W4289767364 hasLocation W42897673645 @default.
- W4289767364 hasLocation W42897673646 @default.
- W4289767364 hasOpenAccess W4289767364 @default.
- W4289767364 hasPrimaryLocation W42897673641 @default.
- W4289767364 hasRelatedWork W1506576323 @default.
- W4289767364 hasRelatedWork W2039534605 @default.
- W4289767364 hasRelatedWork W2360883279 @default.
- W4289767364 hasRelatedWork W2560284304 @default.
- W4289767364 hasRelatedWork W2981087920 @default.
- W4289767364 hasRelatedWork W2992516105 @default.
- W4289767364 hasRelatedWork W3081509258 @default.
- W4289767364 hasRelatedWork W3120899676 @default.
- W4289767364 hasRelatedWork W3186188717 @default.
- W4289767364 hasRelatedWork W3210635025 @default.
- W4289767364 hasVolume "12" @default.
- W4289767364 isParatext "false" @default.
- W4289767364 isRetracted "false" @default.