Matches in SemOpenAlex for { <https://semopenalex.org/work/W3090893089> ?p ?o ?g. }
Showing items 1 to 74 of
74
with 100 items per page.
- W3090893089 abstract "Most biological data and knowledge are directly or indirectly linked to biological taxa via taxon names. Using taxon names is one of the most fundamental and ubiquitous ways in which a wide range of biological data are integrated, aggregated, and indexed, from genomic and microbial diversity to macro-ecological data. To this day, the names used, as well as most methods and resources developed for this purpose, are drawn from Linnaean nomenclature. This leads to numerous problems when applied to data-intensive science that depends on computation to take full advantage of the vast – and rapidly increasing – amount of available digital biodiversity data. The theoretical and practical complexities of reconciling taxon names and concepts has plagued the systematics community for decades and now more than ever before, Linnaean names based in Linnaean taxonomy, by far the most prevalent means of linking data to taxa, are unfit for the age of computation-driven data science, due to fundamental theoretical and practical shortfalls that cannot be cured. We propose an alternate approach based on the use of phylogenetic clade definitions, which is a well-developed method for unambiguously defining the semantics of a clade concept in terms of shared evolutionary ancestry (de Queiroz and Gauthier 1990, de Queiroz and Gauthier 1994). These semantics allow locating the defined clade on any phylogeny, or showing that a clade is inconsistent with the topology of a given phylogeny and hence cannot be present on it at all. We have built a workflow for defining phylogenetic clade definitions in terms of shared ancestor and excluded lineage properties, and locating these definitions on any input phylogeny. Once these definitions have been located, we can use the list of species found within that clade on that phylogeny in order to aggregate occurrence data from the Global Biodiversity Information Facility (GBIF). Thus, our approach uses clade definitions with machine-understandable semantics to programmatically and reproducibly aggregate biodiversity data by higher-level taxonomic concepts. This approach has several advantages over the use of taxonomic hierarchies: Unlike taxa, the semantics of clade definitions can be expressed in unambiguous, machine-understandable and reproducible terms and language. The resolution of a given clade definition will depend on the phylogeny being used. Thus, if the phylogeny of groups of interest is updated in light of new evolutionary knowledge, the clade definition can be applied to the new phylogeny to obtain an updated list of clade members consistent with the updated evolutionary knowledge. Machine reproducibility of analyses is possible simply by archiving the machine-readable representations of the clade definition and the phylogeny being used. Unlike taxa, the semantics of clade definitions can be expressed in unambiguous, machine-understandable and reproducible terms and language. The resolution of a given clade definition will depend on the phylogeny being used. Thus, if the phylogeny of groups of interest is updated in light of new evolutionary knowledge, the clade definition can be applied to the new phylogeny to obtain an updated list of clade members consistent with the updated evolutionary knowledge. Machine reproducibility of analyses is possible simply by archiving the machine-readable representations of the clade definition and the phylogeny being used. Clade definitions can be created by biologists as needed or can be reused from those published in peer-reviewed journals. In addition, nearly 300 peer-reviewed clade definitions were recently published as part of the Phylonym volume of the PhyloCode (de Queiroz et al. 2020) and are now available on the Regnum website. As part of the Phyloreferencing Project, we digitize this collection as a machine-readable ontology, where each clade is represented as a class defined by logical conjunctions for class membership, corresponding to a set of necessary and sufficient conditions of shared or divergent evolutionary ancestry. We call these classes phyloreferences, and have created a fully automated workflow for digitizing the Regnum database content into an OWL ontology (W3C OWL Working Group 2012) that we call the Clade Ontology. This ontology includes reference phylogenies and additional metadata about the verbatim clade definitions. Once complete, the Clade Ontology will include all clade definitions from RegNum, both those included in Phylonym after passing peer-review, and those contributed by the community, whether or not under the PhyloCode nomenclature. As an openly available community resource, this will allow researchers to use them to aggregate biodiversity data for comparative biology with grouping semantics that are transparent, machine-processable, and reproducible. In our presentation, we will demonstrate the use of phyloreferences to locate clades on the Open Tree of Life synthetic tree (Hinchliff et al. 2015), to retrieve lists of species in each clade, and to use them to find and aggregate occurrence records in GBIF. We will also describe the workflow we are currently using to build and test the Clade Ontology, and describe our plans for publishing this resource. Finally, we will discuss the advantages and disadvantages of this approach as compared to taxonomic checklists." @default.
- W3090893089 created "2020-10-08" @default.
- W3090893089 creator A5005484581 @default.
- W3090893089 creator A5053456134 @default.
- W3090893089 creator A5055867064 @default.
- W3090893089 date "2020-10-02" @default.
- W3090893089 modified "2023-09-23" @default.
- W3090893089 title "Enabling Machines to Integrate Biodiversity Data with Evolutionary Knowledge" @default.
- W3090893089 cites W2004477458 @default.
- W3090893089 cites W2088138371 @default.
- W3090893089 cites W2951002538 @default.
- W3090893089 cites W2991178345 @default.
- W3090893089 doi "https://doi.org/10.3897/biss.4.59088" @default.
- W3090893089 hasPublicationYear "2020" @default.
- W3090893089 type Work @default.
- W3090893089 sameAs 3090893089 @default.
- W3090893089 citedByCount "0" @default.
- W3090893089 crossrefType "journal-article" @default.
- W3090893089 hasAuthorship W3090893089A5005484581 @default.
- W3090893089 hasAuthorship W3090893089A5053456134 @default.
- W3090893089 hasAuthorship W3090893089A5055867064 @default.
- W3090893089 hasBestOaLocation W30908930891 @default.
- W3090893089 hasConcept C104317684 @default.
- W3090893089 hasConcept C130217890 @default.
- W3090893089 hasConcept C177212765 @default.
- W3090893089 hasConcept C18903297 @default.
- W3090893089 hasConcept C193252679 @default.
- W3090893089 hasConcept C198260499 @default.
- W3090893089 hasConcept C2522767166 @default.
- W3090893089 hasConcept C41008148 @default.
- W3090893089 hasConcept C41806617 @default.
- W3090893089 hasConcept C44465124 @default.
- W3090893089 hasConcept C55493867 @default.
- W3090893089 hasConcept C58642233 @default.
- W3090893089 hasConcept C71640776 @default.
- W3090893089 hasConcept C77088390 @default.
- W3090893089 hasConcept C78458016 @default.
- W3090893089 hasConcept C86803240 @default.
- W3090893089 hasConceptScore W3090893089C104317684 @default.
- W3090893089 hasConceptScore W3090893089C130217890 @default.
- W3090893089 hasConceptScore W3090893089C177212765 @default.
- W3090893089 hasConceptScore W3090893089C18903297 @default.
- W3090893089 hasConceptScore W3090893089C193252679 @default.
- W3090893089 hasConceptScore W3090893089C198260499 @default.
- W3090893089 hasConceptScore W3090893089C2522767166 @default.
- W3090893089 hasConceptScore W3090893089C41008148 @default.
- W3090893089 hasConceptScore W3090893089C41806617 @default.
- W3090893089 hasConceptScore W3090893089C44465124 @default.
- W3090893089 hasConceptScore W3090893089C55493867 @default.
- W3090893089 hasConceptScore W3090893089C58642233 @default.
- W3090893089 hasConceptScore W3090893089C71640776 @default.
- W3090893089 hasConceptScore W3090893089C77088390 @default.
- W3090893089 hasConceptScore W3090893089C78458016 @default.
- W3090893089 hasConceptScore W3090893089C86803240 @default.
- W3090893089 hasFunder F4320306076 @default.
- W3090893089 hasLocation W30908930891 @default.
- W3090893089 hasLocation W30908930892 @default.
- W3090893089 hasOpenAccess W3090893089 @default.
- W3090893089 hasPrimaryLocation W30908930891 @default.
- W3090893089 hasRelatedWork W1988278889 @default.
- W3090893089 hasRelatedWork W2006146801 @default.
- W3090893089 hasRelatedWork W2023247886 @default.
- W3090893089 hasRelatedWork W2076732450 @default.
- W3090893089 hasRelatedWork W2124086049 @default.
- W3090893089 hasRelatedWork W2754231444 @default.
- W3090893089 hasRelatedWork W4226237665 @default.
- W3090893089 hasRelatedWork W4237753771 @default.
- W3090893089 hasRelatedWork W4239077547 @default.
- W3090893089 hasRelatedWork W4244972855 @default.
- W3090893089 hasVolume "4" @default.
- W3090893089 isParatext "false" @default.
- W3090893089 isRetracted "false" @default.
- W3090893089 magId "3090893089" @default.
- W3090893089 workType "article" @default.