Matches in SemOpenAlex for { <https://semopenalex.org/work/W4220853821> ?p ?o ?g. }
- W4220853821 endingPage "341" @default.
- W4220853821 startingPage "320" @default.
- W4220853821 abstract "Abstract A key limiting factor in organising and using information from physical specimens curated in natural science collections is making that information computable, with institutional digitization tending to focus more on imaging the specimens themselves than on efficiently capturing computable data about them. Label data are traditionally manually transcribed today with high cost and low throughput, rendering such a task constrained for many collection-holding institutions at current funding levels. We show how computer vision, optical character recognition, handwriting recognition, named entity recognition and language translation technologies can be implemented into canonical workflow component libraries with findable, accessible, interoperable, and reusable (FAIR) characteristics. These libraries are being developed in a cloud-based workflow platform—the ‘Specimen Data Refinery’ (SDR)—founded on Galaxy workflow engine, Common Workflow Language, Research Object Crates (RO-Crate) and WorkflowHub technologies. The SDR can be applied to specimens’ labels and other artefacts, offering the prospect of greatly accelerated and more accurate data capture in computable form. Two kinds of FAIR Digital Objects (FDO) are created by packaging outputs of SDR workflows and workflow components as digital objects with metadata, a persistent identifier, and a specific type definition. The first kind of FDO are computable Digital Specimen (DS) objects that can be consumed/produced by workflows, and other applications. A single DS is the input data structure submitted to a workflow that is modified by each workflow component in turn to produce a refined DS at the end. The Specimen Data Refinery provides a library of such components that can be used individually, or in series. To cofunction, each library component describes the fields it requires from the DS and the fields it will in turn populate or enrich. The second kind of FDO, RO-Crates gather and archive the diverse set of digital and real-world resources, configurations, and actions (the provenance) contributing to a unit of research work, allowing that work to be faithfully recorded and reproduced. Here we describe the Specimen Data Refinery with its motivating requirements, focusing on what is essential in the creation of canonical workflow component libraries and its conformance with the requirements of an emerging FDO Core Specification being developed by the FDO Forum." @default.
- W4220853821 created "2022-04-03" @default.
- W4220853821 creator A5004931104 @default.
- W4220853821 creator A5022818385 @default.
- W4220853821 creator A5027899893 @default.
- W4220853821 creator A5029941394 @default.
- W4220853821 creator A5057615744 @default.
- W4220853821 creator A5059103997 @default.
- W4220853821 creator A5082113801 @default.
- W4220853821 creator A5089943881 @default.
- W4220853821 date "2022-01-01" @default.
- W4220853821 modified "2023-10-16" @default.
- W4220853821 title "The Specimen Data Refinery: A Canonical Workflow Framework and FAIR Digital Object Approach to Speeding up Digital Mobilisation of Natural History Collections" @default.
- W4220853821 cites W1868943680 @default.
- W4220853821 cites W2019920685 @default.
- W4220853821 cites W2071908440 @default.
- W4220853821 cites W2128105693 @default.
- W4220853821 cites W2302501749 @default.
- W4220853821 cites W2537696795 @default.
- W4220853821 cites W2552011409 @default.
- W4220853821 cites W2618602062 @default.
- W4220853821 cites W2743583628 @default.
- W4220853821 cites W2901476362 @default.
- W4220853821 cites W2911663376 @default.
- W4220853821 cites W2917007534 @default.
- W4220853821 cites W2917207851 @default.
- W4220853821 cites W2920734838 @default.
- W4220853821 cites W2921506023 @default.
- W4220853821 cites W2951211197 @default.
- W4220853821 cites W2953916733 @default.
- W4220853821 cites W2961706560 @default.
- W4220853821 cites W2972030815 @default.
- W4220853821 cites W2986399651 @default.
- W4220853821 cites W3013553122 @default.
- W4220853821 cites W3020941647 @default.
- W4220853821 cites W3024414693 @default.
- W4220853821 cites W3039389083 @default.
- W4220853821 cites W3039579509 @default.
- W4220853821 cites W3039900714 @default.
- W4220853821 cites W3049349070 @default.
- W4220853821 cites W3121051245 @default.
- W4220853821 cites W3127613064 @default.
- W4220853821 cites W3139096123 @default.
- W4220853821 cites W4220999516 @default.
- W4220853821 cites W4241396795 @default.
- W4220853821 cites W4242710869 @default.
- W4220853821 doi "https://doi.org/10.1162/dint_a_00134" @default.
- W4220853821 hasPublicationYear "2022" @default.
- W4220853821 type Work @default.
- W4220853821 citedByCount "6" @default.
- W4220853821 countsByYear W42208538212022 @default.
- W4220853821 countsByYear W42208538212023 @default.
- W4220853821 crossrefType "journal-article" @default.
- W4220853821 hasAuthorship W4220853821A5004931104 @default.
- W4220853821 hasAuthorship W4220853821A5022818385 @default.
- W4220853821 hasAuthorship W4220853821A5027899893 @default.
- W4220853821 hasAuthorship W4220853821A5029941394 @default.
- W4220853821 hasAuthorship W4220853821A5057615744 @default.
- W4220853821 hasAuthorship W4220853821A5059103997 @default.
- W4220853821 hasAuthorship W4220853821A5082113801 @default.
- W4220853821 hasAuthorship W4220853821A5089943881 @default.
- W4220853821 hasBestOaLocation W42208538211 @default.
- W4220853821 hasConcept C111919701 @default.
- W4220853821 hasConcept C136764020 @default.
- W4220853821 hasConcept C177212765 @default.
- W4220853821 hasConcept C188220564 @default.
- W4220853821 hasConcept C20136886 @default.
- W4220853821 hasConcept C41008148 @default.
- W4220853821 hasConcept C77088390 @default.
- W4220853821 hasConcept C79974875 @default.
- W4220853821 hasConcept C93518851 @default.
- W4220853821 hasConceptScore W4220853821C111919701 @default.
- W4220853821 hasConceptScore W4220853821C136764020 @default.
- W4220853821 hasConceptScore W4220853821C177212765 @default.
- W4220853821 hasConceptScore W4220853821C188220564 @default.
- W4220853821 hasConceptScore W4220853821C20136886 @default.
- W4220853821 hasConceptScore W4220853821C41008148 @default.
- W4220853821 hasConceptScore W4220853821C77088390 @default.
- W4220853821 hasConceptScore W4220853821C79974875 @default.
- W4220853821 hasConceptScore W4220853821C93518851 @default.
- W4220853821 hasIssue "2" @default.
- W4220853821 hasLocation W42208538211 @default.
- W4220853821 hasLocation W42208538212 @default.
- W4220853821 hasLocation W42208538213 @default.
- W4220853821 hasLocation W42208538214 @default.
- W4220853821 hasLocation W42208538215 @default.
- W4220853821 hasOpenAccess W4220853821 @default.
- W4220853821 hasPrimaryLocation W42208538211 @default.
- W4220853821 hasRelatedWork W2001814116 @default.
- W4220853821 hasRelatedWork W2035400554 @default.
- W4220853821 hasRelatedWork W2050637807 @default.
- W4220853821 hasRelatedWork W2064470642 @default.
- W4220853821 hasRelatedWork W2074746251 @default.
- W4220853821 hasRelatedWork W2143887500 @default.
- W4220853821 hasRelatedWork W2274725732 @default.
- W4220853821 hasRelatedWork W2356938388 @default.
- W4220853821 hasRelatedWork W2377329191 @default.
- W4220853821 hasRelatedWork W2746179582 @default.