Matches in SemOpenAlex for { <https://semopenalex.org/work/W3100708393> ?p ?o ?g. }
- W3100708393 endingPage "026004" @default.
- W3100708393 startingPage "026004" @default.
- W3100708393 abstract "Minimal absent words (MAW) of a genomic sequence are subsequences that are absent themselves but the subwords of which are all present in the sequence. The characteristic distribution of genomic MAWs as a function of their length has been observed to be qualitatively similar for all living organisms, the bulk being rather short, and only relatively few being long. It has been an open issue whether the reason behind this phenomenon is statistical or reflects a biological mechanism, and what biological information is contained in absent words. In this work we demonstrate that the bulk can be described by a probabilistic model of sampling words from random sequences, while the tail of long MAWs is of biological origin. We introduce the novel concept of a core of a minimal absent word, which are sequences present in the genome and closest to a given MAW. We show that in bacteria and yeast the cores of the longest MAWs, which exist in two or more copies, are located in highly conserved regions the most prominent example being ribosomal RNAs (rRNAs). We also show that while the distribution of the cores of long MAWs is roughly uniform over these genomes on a coarse-grained level, on a more detailed level it is strongly enhanced in 3' untranslated regions (UTRs) and, to a lesser extent, also in 5' UTRs. This indicates that MAWs and associated MAW cores correspond to fine-tuned evolutionary relationships, and suggest that they can be more widely used as markers for genomic complexity." @default.
- W3100708393 created "2020-11-23" @default.
- W3100708393 creator A5037003272 @default.
- W3100708393 creator A5047019776 @default.
- W3100708393 creator A5015802277 @default.
- W3100708393 date "2016-04-04" @default.
- W3100708393 modified "2023-10-02" @default.
- W3100708393 title "The bulk and the tail of minimal absent words in genome sequences" @default.
- W3100708393 cites W1539969737 @default.
- W3100708393 cites W1890888469 @default.
- W3100708393 cites W1953234029 @default.
- W3100708393 cites W1963889669 @default.
- W3100708393 cites W1976022108 @default.
- W3100708393 cites W1977852064 @default.
- W3100708393 cites W1989596454 @default.
- W3100708393 cites W1998122628 @default.
- W3100708393 cites W2000331453 @default.
- W3100708393 cites W2006514054 @default.
- W3100708393 cites W2018650627 @default.
- W3100708393 cites W2021931962 @default.
- W3100708393 cites W2023034125 @default.
- W3100708393 cites W2029387763 @default.
- W3100708393 cites W2034153329 @default.
- W3100708393 cites W2043499324 @default.
- W3100708393 cites W2045895701 @default.
- W3100708393 cites W2048056758 @default.
- W3100708393 cites W2055353957 @default.
- W3100708393 cites W2055595349 @default.
- W3100708393 cites W2061983309 @default.
- W3100708393 cites W2072059848 @default.
- W3100708393 cites W2075847813 @default.
- W3100708393 cites W2097160089 @default.
- W3100708393 cites W2097493826 @default.
- W3100708393 cites W2099878244 @default.
- W3100708393 cites W2106365398 @default.
- W3100708393 cites W2114820744 @default.
- W3100708393 cites W2119404103 @default.
- W3100708393 cites W2119489506 @default.
- W3100708393 cites W2124479173 @default.
- W3100708393 cites W2136777606 @default.
- W3100708393 cites W2138565200 @default.
- W3100708393 cites W2138755637 @default.
- W3100708393 cites W2144386534 @default.
- W3100708393 cites W2147555013 @default.
- W3100708393 cites W2151614109 @default.
- W3100708393 cites W2161656148 @default.
- W3100708393 cites W2164277750 @default.
- W3100708393 cites W2166123725 @default.
- W3100708393 cites W2170509538 @default.
- W3100708393 cites W2170551349 @default.
- W3100708393 cites W2171196697 @default.
- W3100708393 cites W3101305142 @default.
- W3100708393 cites W4245668478 @default.
- W3100708393 doi "https://doi.org/10.1088/1478-3975/13/2/026004" @default.
- W3100708393 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/27043075" @default.
- W3100708393 hasPublicationYear "2016" @default.
- W3100708393 type Work @default.
- W3100708393 sameAs 3100708393 @default.
- W3100708393 citedByCount "3" @default.
- W3100708393 countsByYear W31007083932017 @default.
- W3100708393 countsByYear W31007083932018 @default.
- W3100708393 countsByYear W31007083932021 @default.
- W3100708393 crossrefType "journal-article" @default.
- W3100708393 hasAuthorship W3100708393A5015802277 @default.
- W3100708393 hasAuthorship W3100708393A5037003272 @default.
- W3100708393 hasAuthorship W3100708393A5047019776 @default.
- W3100708393 hasBestOaLocation W31007083932 @default.
- W3100708393 hasConcept C104317684 @default.
- W3100708393 hasConcept C14036430 @default.
- W3100708393 hasConcept C141231307 @default.
- W3100708393 hasConcept C2778112365 @default.
- W3100708393 hasConcept C54355233 @default.
- W3100708393 hasConcept C67705224 @default.
- W3100708393 hasConcept C70721500 @default.
- W3100708393 hasConcept C78458016 @default.
- W3100708393 hasConcept C86803240 @default.
- W3100708393 hasConcept C89604277 @default.
- W3100708393 hasConceptScore W3100708393C104317684 @default.
- W3100708393 hasConceptScore W3100708393C14036430 @default.
- W3100708393 hasConceptScore W3100708393C141231307 @default.
- W3100708393 hasConceptScore W3100708393C2778112365 @default.
- W3100708393 hasConceptScore W3100708393C54355233 @default.
- W3100708393 hasConceptScore W3100708393C67705224 @default.
- W3100708393 hasConceptScore W3100708393C70721500 @default.
- W3100708393 hasConceptScore W3100708393C78458016 @default.
- W3100708393 hasConceptScore W3100708393C86803240 @default.
- W3100708393 hasConceptScore W3100708393C89604277 @default.
- W3100708393 hasIssue "2" @default.
- W3100708393 hasLocation W31007083931 @default.
- W3100708393 hasLocation W31007083932 @default.
- W3100708393 hasLocation W31007083933 @default.
- W3100708393 hasLocation W31007083934 @default.
- W3100708393 hasOpenAccess W3100708393 @default.
- W3100708393 hasPrimaryLocation W31007083931 @default.
- W3100708393 hasRelatedWork W1872127137 @default.
- W3100708393 hasRelatedWork W1991523530 @default.
- W3100708393 hasRelatedWork W2002128513 @default.
- W3100708393 hasRelatedWork W2020824267 @default.