Matches in SemOpenAlex for { <https://semopenalex.org/work/W3100185420> ?p ?o ?g. }
Showing items 1 to 90 of
90
with 100 items per page.
- W3100185420 abstract "Abstract Text mining is widely used within the life sciences as an evidence stream for inferring relationships between biological entities. In most cases, conventional string matching is used to identify cooccurrences of given entities within sentences. This limits the utility of text mining results, as they tend to contain significant noise due to weak inclusion criteria. We show that, in the indicative case of protein-protein interactions (PPIs), the majority of sentences containing cooccurrences (∽75%) do not describe any causal relationship. We further demonstrate the feasibility of fine tuning a strong domain-specific language model, BioBERT, to analyse sentences containing cooccurrences and accurately (F1 score: 88.95%) identify functional links between proteins. These strong results come in spite of the deep complexity of the language involved, which limits the accuracy even of expert curators. We establish guidelines for best practices in data creation to this end, including an examination of inter-annotator agreement, of semisupervision, and of rules based alternatives to manual curation, and explore the potential for downstream use of the model to accelerate curation of interactions in the SIGNOR database of causal protein interactions and the IntAct database of experimental evidence for physical protein interactions." @default.
- W3100185420 created "2020-11-23" @default.
- W3100185420 creator A5001401686 @default.
- W3100185420 creator A5005184545 @default.
- W3100185420 creator A5005779459 @default.
- W3100185420 creator A5029338279 @default.
- W3100185420 creator A5031417057 @default.
- W3100185420 creator A5034929252 @default.
- W3100185420 creator A5049010848 @default.
- W3100185420 creator A5050128493 @default.
- W3100185420 creator A5081988749 @default.
- W3100185420 creator A5086705091 @default.
- W3100185420 date "2020-09-01" @default.
- W3100185420 modified "2023-09-27" @default.
- W3100185420 title "Optimising biomedical relationship extraction with BioBERT" @default.
- W3100185420 cites W1850865022 @default.
- W3100185420 cites W1996699506 @default.
- W3100185420 cites W2025658351 @default.
- W3100185420 cites W2043764521 @default.
- W3100185420 cites W2048296798 @default.
- W3100185420 cites W2062584010 @default.
- W3100185420 cites W2085752173 @default.
- W3100185420 cites W2138627627 @default.
- W3100185420 cites W2494967098 @default.
- W3100185420 cites W2558999090 @default.
- W3100185420 cites W2887377515 @default.
- W3100185420 cites W2900569176 @default.
- W3100185420 cites W2904726360 @default.
- W3100185420 cites W2911489562 @default.
- W3100185420 cites W2963341956 @default.
- W3100185420 cites W2964348125 @default.
- W3100185420 cites W3042444601 @default.
- W3100185420 hasPublicationYear "2020" @default.
- W3100185420 type Work @default.
- W3100185420 sameAs 3100185420 @default.
- W3100185420 citedByCount "3" @default.
- W3100185420 countsByYear W31001854202021 @default.
- W3100185420 crossrefType "posted-content" @default.
- W3100185420 hasAuthorship W3100185420A5001401686 @default.
- W3100185420 hasAuthorship W3100185420A5005184545 @default.
- W3100185420 hasAuthorship W3100185420A5005779459 @default.
- W3100185420 hasAuthorship W3100185420A5029338279 @default.
- W3100185420 hasAuthorship W3100185420A5031417057 @default.
- W3100185420 hasAuthorship W3100185420A5034929252 @default.
- W3100185420 hasAuthorship W3100185420A5049010848 @default.
- W3100185420 hasAuthorship W3100185420A5050128493 @default.
- W3100185420 hasAuthorship W3100185420A5081988749 @default.
- W3100185420 hasAuthorship W3100185420A5086705091 @default.
- W3100185420 hasBestOaLocation W31001854201 @default.
- W3100185420 hasConcept C105795698 @default.
- W3100185420 hasConcept C124101348 @default.
- W3100185420 hasConcept C134306372 @default.
- W3100185420 hasConcept C154945302 @default.
- W3100185420 hasConcept C157486923 @default.
- W3100185420 hasConcept C165064840 @default.
- W3100185420 hasConcept C204321447 @default.
- W3100185420 hasConcept C23123220 @default.
- W3100185420 hasConcept C33923547 @default.
- W3100185420 hasConcept C36503486 @default.
- W3100185420 hasConcept C37914503 @default.
- W3100185420 hasConcept C41008148 @default.
- W3100185420 hasConceptScore W3100185420C105795698 @default.
- W3100185420 hasConceptScore W3100185420C124101348 @default.
- W3100185420 hasConceptScore W3100185420C134306372 @default.
- W3100185420 hasConceptScore W3100185420C154945302 @default.
- W3100185420 hasConceptScore W3100185420C157486923 @default.
- W3100185420 hasConceptScore W3100185420C165064840 @default.
- W3100185420 hasConceptScore W3100185420C204321447 @default.
- W3100185420 hasConceptScore W3100185420C23123220 @default.
- W3100185420 hasConceptScore W3100185420C33923547 @default.
- W3100185420 hasConceptScore W3100185420C36503486 @default.
- W3100185420 hasConceptScore W3100185420C37914503 @default.
- W3100185420 hasConceptScore W3100185420C41008148 @default.
- W3100185420 hasLocation W31001854201 @default.
- W3100185420 hasOpenAccess W3100185420 @default.
- W3100185420 hasPrimaryLocation W31001854201 @default.
- W3100185420 hasRelatedWork W10647322 @default.
- W3100185420 hasRelatedWork W11643025 @default.
- W3100185420 hasRelatedWork W11991885 @default.
- W3100185420 hasRelatedWork W12452471 @default.
- W3100185420 hasRelatedWork W13752685 @default.
- W3100185420 hasRelatedWork W14808 @default.
- W3100185420 hasRelatedWork W3901497 @default.
- W3100185420 hasRelatedWork W4373349 @default.
- W3100185420 hasRelatedWork W5104570 @default.
- W3100185420 hasRelatedWork W8241849 @default.
- W3100185420 isParatext "false" @default.
- W3100185420 isRetracted "false" @default.
- W3100185420 magId "3100185420" @default.
- W3100185420 workType "article" @default.