Matches in SemOpenAlex for { <https://semopenalex.org/work/W4221128994> ?p ?o ?g. }
- W4221128994 abstract "Databases are fundamental to advance biomedical science. However, most of them are populated and updated with a great deal of human effort. Biomedical Relation Extraction (BioRE) aims to shift this burden to machines. Among its different applications, the discovery of Gene-Disease Associations (GDAs) is one of BioRE most relevant tasks. Nevertheless, few resources have been developed to train models for GDA extraction. Besides, these resources are all limited in size-preventing models from scaling effectively to large amounts of data.To overcome this limitation, we have exploited the DisGeNET database to build a large-scale, semi-automatically annotated dataset for GDA extraction. DisGeNET stores one of the largest available collections of genes and variants involved in human diseases. Relying on DisGeNET, we developed TBGA: a GDA extraction dataset generated from more than 700K publications that consists of over 200K instances and 100K gene-disease pairs. Each instance consists of the sentence from which the GDA was extracted, the corresponding GDA, and the information about the gene-disease pair.TBGA is amongst the largest datasets for GDA extraction. We have evaluated state-of-the-art models for GDA extraction on TBGA, showing that it is a challenging and well-suited dataset for the task. We made the dataset publicly available to foster the development of state-of-the-art BioRE models for GDA extraction." @default.
- W4221128994 created "2022-04-03" @default.
- W4221128994 creator A5003820981 @default.
- W4221128994 creator A5078254809 @default.
- W4221128994 date "2022-03-31" @default.
- W4221128994 modified "2023-10-14" @default.
- W4221128994 title "TBGA: a large-scale Gene-Disease Association dataset for Biomedical Relation Extraction" @default.
- W4221128994 cites W1604644367 @default.
- W4221128994 cites W2007069550 @default.
- W4221128994 cites W2029302969 @default.
- W4221128994 cites W2031950862 @default.
- W4221128994 cites W2036935277 @default.
- W4221128994 cites W2048296798 @default.
- W4221128994 cites W2052217781 @default.
- W4221128994 cites W2061833373 @default.
- W4221128994 cites W2104107436 @default.
- W4221128994 cites W2107580398 @default.
- W4221128994 cites W2110119381 @default.
- W4221128994 cites W2116868464 @default.
- W4221128994 cites W2128768874 @default.
- W4221128994 cites W2136437513 @default.
- W4221128994 cites W2144427809 @default.
- W4221128994 cites W2147714160 @default.
- W4221128994 cites W2158419693 @default.
- W4221128994 cites W2159583324 @default.
- W4221128994 cites W2170146596 @default.
- W4221128994 cites W2251135946 @default.
- W4221128994 cites W2293938906 @default.
- W4221128994 cites W2515462165 @default.
- W4221128994 cites W2517194566 @default.
- W4221128994 cites W2582146834 @default.
- W4221128994 cites W2762091108 @default.
- W4221128994 cites W2770565711 @default.
- W4221128994 cites W2773309042 @default.
- W4221128994 cites W2787229753 @default.
- W4221128994 cites W2898789672 @default.
- W4221128994 cites W2901332105 @default.
- W4221128994 cites W2901527454 @default.
- W4221128994 cites W2970959783 @default.
- W4221128994 cites W2983166786 @default.
- W4221128994 cites W2998616814 @default.
- W4221128994 cites W3006508031 @default.
- W4221128994 cites W3033077186 @default.
- W4221128994 cites W3047366767 @default.
- W4221128994 cites W3060853552 @default.
- W4221128994 cites W3111301126 @default.
- W4221128994 cites W3175518369 @default.
- W4221128994 cites W4237381349 @default.
- W4221128994 cites W3114456282 @default.
- W4221128994 doi "https://doi.org/10.1186/s12859-022-04646-6" @default.
- W4221128994 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/35361129" @default.
- W4221128994 hasPublicationYear "2022" @default.
- W4221128994 type Work @default.
- W4221128994 citedByCount "5" @default.
- W4221128994 countsByYear W42211289942022 @default.
- W4221128994 countsByYear W42211289942023 @default.
- W4221128994 crossrefType "journal-article" @default.
- W4221128994 hasAuthorship W4221128994A5003820981 @default.
- W4221128994 hasAuthorship W4221128994A5078254809 @default.
- W4221128994 hasBestOaLocation W42211289941 @default.
- W4221128994 hasConcept C104317684 @default.
- W4221128994 hasConcept C124101348 @default.
- W4221128994 hasConcept C150194340 @default.
- W4221128994 hasConcept C153604712 @default.
- W4221128994 hasConcept C154945302 @default.
- W4221128994 hasConcept C162324750 @default.
- W4221128994 hasConcept C165141518 @default.
- W4221128994 hasConcept C187736073 @default.
- W4221128994 hasConcept C193524817 @default.
- W4221128994 hasConcept C195807954 @default.
- W4221128994 hasConcept C205649164 @default.
- W4221128994 hasConcept C2522767166 @default.
- W4221128994 hasConcept C2778755073 @default.
- W4221128994 hasConcept C2780451532 @default.
- W4221128994 hasConcept C41008148 @default.
- W4221128994 hasConcept C54355233 @default.
- W4221128994 hasConcept C58640448 @default.
- W4221128994 hasConcept C71472368 @default.
- W4221128994 hasConcept C86803240 @default.
- W4221128994 hasConcept C95371953 @default.
- W4221128994 hasConceptScore W4221128994C104317684 @default.
- W4221128994 hasConceptScore W4221128994C124101348 @default.
- W4221128994 hasConceptScore W4221128994C150194340 @default.
- W4221128994 hasConceptScore W4221128994C153604712 @default.
- W4221128994 hasConceptScore W4221128994C154945302 @default.
- W4221128994 hasConceptScore W4221128994C162324750 @default.
- W4221128994 hasConceptScore W4221128994C165141518 @default.
- W4221128994 hasConceptScore W4221128994C187736073 @default.
- W4221128994 hasConceptScore W4221128994C193524817 @default.
- W4221128994 hasConceptScore W4221128994C195807954 @default.
- W4221128994 hasConceptScore W4221128994C205649164 @default.
- W4221128994 hasConceptScore W4221128994C2522767166 @default.
- W4221128994 hasConceptScore W4221128994C2778755073 @default.
- W4221128994 hasConceptScore W4221128994C2780451532 @default.
- W4221128994 hasConceptScore W4221128994C41008148 @default.
- W4221128994 hasConceptScore W4221128994C54355233 @default.
- W4221128994 hasConceptScore W4221128994C58640448 @default.
- W4221128994 hasConceptScore W4221128994C71472368 @default.
- W4221128994 hasConceptScore W4221128994C86803240 @default.
- W4221128994 hasConceptScore W4221128994C95371953 @default.