Matches in SemOpenAlex for { <https://semopenalex.org/work/W2040298461> ?p ?o ?g. }
- W2040298461 abstract "Manually annotated corpora are critical for the training and evaluation of automated methods to identify concepts in biomedical text. This paper presents the concept annotations of the Colorado Richly Annotated Full-Text (CRAFT) Corpus, a collection of 97 full-length, open-access biomedical journal articles that have been annotated both semantically and syntactically to serve as a research resource for the biomedical natural-language-processing (NLP) community. CRAFT identifies all mentions of nearly all concepts from nine prominent biomedical ontologies and terminologies: the Cell Type Ontology, the Chemical Entities of Biological Interest ontology, the NCBI Taxonomy, the Protein Ontology, the Sequence Ontology, the entries of the Entrez Gene database, and the three subontologies of the Gene Ontology. The first public release includes the annotations for 67 of the 97 articles, reserving two sets of 15 articles for future text-mining competitions (after which these too will be released). Concept annotations were created based on a single set of guidelines, which has enabled us to achieve consistently high interannotator agreement. As the initial 67-article release contains more than 560,000 tokens (and the full set more than 790,000 tokens), our corpus is among the largest gold-standard annotated biomedical corpora. Unlike most others, the journal articles that comprise the corpus are drawn from diverse biomedical disciplines and are marked up in their entirety. Additionally, with a concept-annotation count of nearly 100,000 in the 67-article subset (and more than 140,000 in the full collection), the scale of conceptual markup is also among the largest of comparable corpora. The concept annotations of the CRAFT Corpus have the potential to significantly advance biomedical text mining by providing a high-quality gold standard for NLP systems. The corpus, annotation guidelines, and other associated resources are freely available at http://bionlp-corpora.sourceforge.net/CRAFT/index.shtml ." @default.
- W2040298461 created "2016-06-24" @default.
- W2040298461 creator A5015949255 @default.
- W2040298461 creator A5016349593 @default.
- W2040298461 creator A5020016552 @default.
- W2040298461 creator A5026930630 @default.
- W2040298461 creator A5030261399 @default.
- W2040298461 creator A5041860080 @default.
- W2040298461 creator A5062221063 @default.
- W2040298461 creator A5067354556 @default.
- W2040298461 creator A5083125835 @default.
- W2040298461 creator A5084069668 @default.
- W2040298461 creator A5090651928 @default.
- W2040298461 date "2012-07-09" @default.
- W2040298461 modified "2023-10-09" @default.
- W2040298461 title "Concept annotation in the CRAFT corpus" @default.
- W2040298461 cites W1757909678 @default.
- W2040298461 cites W1850865022 @default.
- W2040298461 cites W1976316416 @default.
- W2040298461 cites W1981492645 @default.
- W2040298461 cites W1985695385 @default.
- W2040298461 cites W1994306321 @default.
- W2040298461 cites W1994843778 @default.
- W2040298461 cites W2007367068 @default.
- W2040298461 cites W2010798628 @default.
- W2040298461 cites W2023806451 @default.
- W2040298461 cites W2044257395 @default.
- W2040298461 cites W2048140075 @default.
- W2040298461 cites W2055532234 @default.
- W2040298461 cites W2056616115 @default.
- W2040298461 cites W2071993998 @default.
- W2040298461 cites W2075322787 @default.
- W2040298461 cites W2077797310 @default.
- W2040298461 cites W2080948989 @default.
- W2040298461 cites W2085541230 @default.
- W2040298461 cites W2092419931 @default.
- W2040298461 cites W2097898123 @default.
- W2040298461 cites W2099369363 @default.
- W2040298461 cites W2101819947 @default.
- W2040298461 cites W2103017472 @default.
- W2040298461 cites W2107580398 @default.
- W2040298461 cites W2112589398 @default.
- W2040298461 cites W2113142309 @default.
- W2040298461 cites W2113443914 @default.
- W2040298461 cites W2121588207 @default.
- W2040298461 cites W2122904379 @default.
- W2040298461 cites W2123171788 @default.
- W2040298461 cites W2127241285 @default.
- W2040298461 cites W2127563440 @default.
- W2040298461 cites W2133465414 @default.
- W2040298461 cites W2135230352 @default.
- W2040298461 cites W2135940255 @default.
- W2040298461 cites W2149803936 @default.
- W2040298461 cites W2150392051 @default.
- W2040298461 cites W2159985898 @default.
- W2040298461 cites W2163107094 @default.
- W2040298461 cites W4237381349 @default.
- W2040298461 cites W4250297812 @default.
- W2040298461 doi "https://doi.org/10.1186/1471-2105-13-161" @default.
- W2040298461 hasPubMedCentralId "https://www.ncbi.nlm.nih.gov/pmc/articles/3476437" @default.
- W2040298461 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/22776079" @default.
- W2040298461 hasPublicationYear "2012" @default.
- W2040298461 type Work @default.
- W2040298461 sameAs 2040298461 @default.
- W2040298461 citedByCount "205" @default.
- W2040298461 countsByYear W20402984612012 @default.
- W2040298461 countsByYear W20402984612013 @default.
- W2040298461 countsByYear W20402984612014 @default.
- W2040298461 countsByYear W20402984612015 @default.
- W2040298461 countsByYear W20402984612016 @default.
- W2040298461 countsByYear W20402984612017 @default.
- W2040298461 countsByYear W20402984612018 @default.
- W2040298461 countsByYear W20402984612019 @default.
- W2040298461 countsByYear W20402984612020 @default.
- W2040298461 countsByYear W20402984612021 @default.
- W2040298461 countsByYear W20402984612022 @default.
- W2040298461 countsByYear W20402984612023 @default.
- W2040298461 crossrefType "journal-article" @default.
- W2040298461 hasAuthorship W2040298461A5015949255 @default.
- W2040298461 hasAuthorship W2040298461A5016349593 @default.
- W2040298461 hasAuthorship W2040298461A5020016552 @default.
- W2040298461 hasAuthorship W2040298461A5026930630 @default.
- W2040298461 hasAuthorship W2040298461A5030261399 @default.
- W2040298461 hasAuthorship W2040298461A5041860080 @default.
- W2040298461 hasAuthorship W2040298461A5062221063 @default.
- W2040298461 hasAuthorship W2040298461A5067354556 @default.
- W2040298461 hasAuthorship W2040298461A5083125835 @default.
- W2040298461 hasAuthorship W2040298461A5084069668 @default.
- W2040298461 hasAuthorship W2040298461A5090651928 @default.
- W2040298461 hasBestOaLocation W20402984611 @default.
- W2040298461 hasConcept C111472728 @default.
- W2040298461 hasConcept C136764020 @default.
- W2040298461 hasConcept C137982476 @default.
- W2040298461 hasConcept C138885662 @default.
- W2040298461 hasConcept C154945302 @default.
- W2040298461 hasConcept C177264268 @default.
- W2040298461 hasConcept C199360897 @default.
- W2040298461 hasConcept C204321447 @default.
- W2040298461 hasConcept C2129575 @default.
- W2040298461 hasConcept C23123220 @default.