Matches in SemOpenAlex for { <https://semopenalex.org/work/W3037573926> ?p ?o ?g. }
- W3037573926 abstract "Abstract Nullomers are minimal-length oligomers absent from a genome or proteome. Although research has shown that artificially synthesized nullomers have deleterious effects, there is still a lack of a strategy for the prioritisation and classification of non-occurring sequences as potentially malicious or benign. In this work, by using Markovian models with multiple-testing correction, we reveal significant absent oligomers which are statistically expected to exist. This strongly suggests that their absence is due to negative selection. We survey genomes and proteomes covering the diversity of life, and find thousands of significant absent sequences. Common significant nullomers are often mono- or dinucleotide tracts, or palindromic. Significant viral nullomers are often restriction sites, and may indicate unknown restriction motifs. Surprisingly, significant mammal genome nullomers are often present, but rare, in other mammals, suggesting that they are suppressed but not completely forbidden. Significant human nullomers are rarely present in human viruses, indicating viral mimicry of the host. More than 1/4 of human proteins are one substitution away from containing a significant nullomer. We provide a web-based, interactive database of significant nullomers across genomes and proteomes." @default.
- W3037573926 created "2020-07-02" @default.
- W3037573926 creator A5055829507 @default.
- W3037573926 creator A5082344085 @default.
- W3037573926 date "2020-06-26" @default.
- W3037573926 modified "2023-10-01" @default.
- W3037573926 title "Significant non-existence of sequences in genomes and proteomes" @default.
- W3037573926 cites W1560020441 @default.
- W3037573926 cites W1899868121 @default.
- W3037573926 cites W1992450378 @default.
- W3037573926 cites W2002420618 @default.
- W3037573926 cites W2006514054 @default.
- W3037573926 cites W2018650627 @default.
- W3037573926 cites W2043274355 @default.
- W3037573926 cites W2058832897 @default.
- W3037573926 cites W2063040470 @default.
- W3037573926 cites W2084241014 @default.
- W3037573926 cites W2088588015 @default.
- W3037573926 cites W2101904315 @default.
- W3037573926 cites W2107665951 @default.
- W3037573926 cites W2111326065 @default.
- W3037573926 cites W2114504162 @default.
- W3037573926 cites W2114820744 @default.
- W3037573926 cites W2121114391 @default.
- W3037573926 cites W2124636403 @default.
- W3037573926 cites W2127363306 @default.
- W3037573926 cites W2138565200 @default.
- W3037573926 cites W2141124151 @default.
- W3037573926 cites W2148171419 @default.
- W3037573926 cites W2150392267 @default.
- W3037573926 cites W2156001194 @default.
- W3037573926 cites W2158266834 @default.
- W3037573926 cites W2167080692 @default.
- W3037573926 cites W2188163004 @default.
- W3037573926 cites W2412026210 @default.
- W3037573926 cites W2513547424 @default.
- W3037573926 cites W2559258718 @default.
- W3037573926 cites W2607082035 @default.
- W3037573926 cites W2609272369 @default.
- W3037573926 cites W2744510038 @default.
- W3037573926 cites W2759289851 @default.
- W3037573926 cites W2766477897 @default.
- W3037573926 cites W2778568625 @default.
- W3037573926 cites W2805661590 @default.
- W3037573926 cites W2805967232 @default.
- W3037573926 cites W2809141626 @default.
- W3037573926 cites W2899317755 @default.
- W3037573926 cites W2901775589 @default.
- W3037573926 cites W2909516037 @default.
- W3037573926 cites W2912820434 @default.
- W3037573926 cites W2919121574 @default.
- W3037573926 cites W2938574745 @default.
- W3037573926 cites W2950778827 @default.
- W3037573926 cites W2953799871 @default.
- W3037573926 cites W2955039765 @default.
- W3037573926 cites W2956329239 @default.
- W3037573926 cites W2971539337 @default.
- W3037573926 cites W2979561860 @default.
- W3037573926 cites W2990420098 @default.
- W3037573926 cites W2992307477 @default.
- W3037573926 cites W2994464657 @default.
- W3037573926 cites W3001604492 @default.
- W3037573926 cites W3002528879 @default.
- W3037573926 cites W3004859565 @default.
- W3037573926 cites W3007039579 @default.
- W3037573926 cites W3012143441 @default.
- W3037573926 cites W3013093478 @default.
- W3037573926 cites W3016639056 @default.
- W3037573926 cites W3136918052 @default.
- W3037573926 cites W4205392594 @default.
- W3037573926 cites W4210974853 @default.
- W3037573926 cites W4238449263 @default.
- W3037573926 cites W4243601081 @default.
- W3037573926 cites W4247447075 @default.
- W3037573926 cites W4290305201 @default.
- W3037573926 cites W4297084841 @default.
- W3037573926 cites W2106704555 @default.
- W3037573926 doi "https://doi.org/10.1101/2020.06.25.170431" @default.
- W3037573926 hasPublicationYear "2020" @default.
- W3037573926 type Work @default.
- W3037573926 sameAs 3037573926 @default.
- W3037573926 citedByCount "1" @default.
- W3037573926 countsByYear W30375739262020 @default.
- W3037573926 crossrefType "posted-content" @default.
- W3037573926 hasAuthorship W3037573926A5055829507 @default.
- W3037573926 hasAuthorship W3037573926A5082344085 @default.
- W3037573926 hasBestOaLocation W30375739261 @default.
- W3037573926 hasConcept C104317684 @default.
- W3037573926 hasConcept C104397665 @default.
- W3037573926 hasConcept C141231307 @default.
- W3037573926 hasConcept C197077220 @default.
- W3037573926 hasConcept C44667518 @default.
- W3037573926 hasConcept C54355233 @default.
- W3037573926 hasConcept C70721500 @default.
- W3037573926 hasConcept C7386963 @default.
- W3037573926 hasConcept C78458016 @default.
- W3037573926 hasConcept C86803240 @default.
- W3037573926 hasConceptScore W3037573926C104317684 @default.
- W3037573926 hasConceptScore W3037573926C104397665 @default.
- W3037573926 hasConceptScore W3037573926C141231307 @default.