Matches in SemOpenAlex for { <https://semopenalex.org/work/W2099761707> ?p ?o ?g. }
- W2099761707 endingPage "640" @default.
- W2099761707 startingPage "632" @default.
- W2099761707 abstract "Large-scale cDNA sequencing projects and tiling array studies have revealed the presence of many unannotated genes. For protein coding genes, small coding sequences may not be identified by gene finders because of the conservative nature of prediction algorithms. In this study, we identified small open reading frames (sORFs) with high coding potential by a simple gene finding method (Coding Index, CI) based on the nucleotide composition bias found in most coding sequences. Applying this method to 18 Arabidopsis thaliana and 84 yeast sORF genes with evidence of expression at the protein level gives 100% accurate prediction. In the A. thaliana genome, we identified 7159 sORFs that are likely coding sequences (coding sORFs) with the CI measure at the 1% false-positive rate. To determine if these coding sORFs are parts of functional genes, we evaluated each coding sORF for evidence of transcription or evolutionary conservation. At the 5% false-positive rate, we found that 2996 coding sORFs are likely expressed in at least one experimental condition of the A. thaliana tiling array data. In addition, the evolutionary conservation of each A. thaliana sORF was examined within A. thaliana or between A. thaliana and five plants with complete or partial genome sequences. In 3997 coding sORFs with readily identifiable homologous sequences, 2376 are subject to purifying selection at the 1% false-positive rate. After eliminating coding sORFs with similarity to known transposable elements and those that are likely missing exons of known genes, the remaining 3241 coding sORFs with either evidence of transcription or purifying selection likely belong to novel coding genes in the A. thaliana genome." @default.
- W2099761707 created "2016-06-24" @default.
- W2099761707 creator A5045515728 @default.
- W2099761707 creator A5049358254 @default.
- W2099761707 creator A5059271655 @default.
- W2099761707 creator A5073568638 @default.
- W2099761707 creator A5081111897 @default.
- W2099761707 date "2007-03-29" @default.
- W2099761707 modified "2023-10-18" @default.
- W2099761707 title "A large number of novel coding small open reading frames in the intergenic regions of the <i>Arabidopsis thaliana</i> genome are transcribed and/or under purifying selection" @default.
- W2099761707 cites W1513332069 @default.
- W2099761707 cites W1523340433 @default.
- W2099761707 cites W1603411458 @default.
- W2099761707 cites W1882986874 @default.
- W2099761707 cites W1966134030 @default.
- W2099761707 cites W1970279759 @default.
- W2099761707 cites W1982115165 @default.
- W2099761707 cites W1991480269 @default.
- W2099761707 cites W2012363927 @default.
- W2099761707 cites W2013947447 @default.
- W2099761707 cites W2016927884 @default.
- W2099761707 cites W2025666214 @default.
- W2099761707 cites W2026726511 @default.
- W2099761707 cites W2027059027 @default.
- W2099761707 cites W2038395276 @default.
- W2099761707 cites W2065120930 @default.
- W2099761707 cites W2078146553 @default.
- W2099761707 cites W2081530255 @default.
- W2099761707 cites W2090831186 @default.
- W2099761707 cites W2096744087 @default.
- W2099761707 cites W2098891914 @default.
- W2099761707 cites W2107903949 @default.
- W2099761707 cites W2114112529 @default.
- W2099761707 cites W2118675686 @default.
- W2099761707 cites W2119632499 @default.
- W2099761707 cites W2125346198 @default.
- W2099761707 cites W2131581981 @default.
- W2099761707 cites W2131887570 @default.
- W2099761707 cites W2134666031 @default.
- W2099761707 cites W2135793841 @default.
- W2099761707 cites W2147170044 @default.
- W2099761707 cites W2148523609 @default.
- W2099761707 cites W2150001834 @default.
- W2099761707 cites W2152180708 @default.
- W2099761707 cites W2152873040 @default.
- W2099761707 cites W2153632714 @default.
- W2099761707 cites W2158714788 @default.
- W2099761707 cites W2161191296 @default.
- W2099761707 cites W2167505352 @default.
- W2099761707 cites W2167980479 @default.
- W2099761707 cites W2169805130 @default.
- W2099761707 cites W2169898794 @default.
- W2099761707 cites W2175948239 @default.
- W2099761707 cites W4230932396 @default.
- W2099761707 cites W4244241357 @default.
- W2099761707 cites W4251259245 @default.
- W2099761707 cites W4253368219 @default.
- W2099761707 cites W4256328342 @default.
- W2099761707 doi "https://doi.org/10.1101/gr.5836207" @default.
- W2099761707 hasPubMedCentralId "https://www.ncbi.nlm.nih.gov/pmc/articles/1855179" @default.
- W2099761707 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/17395691" @default.
- W2099761707 hasPublicationYear "2007" @default.
- W2099761707 type Work @default.
- W2099761707 sameAs 2099761707 @default.
- W2099761707 citedByCount "157" @default.
- W2099761707 countsByYear W20997617072012 @default.
- W2099761707 countsByYear W20997617072013 @default.
- W2099761707 countsByYear W20997617072014 @default.
- W2099761707 countsByYear W20997617072015 @default.
- W2099761707 countsByYear W20997617072016 @default.
- W2099761707 countsByYear W20997617072017 @default.
- W2099761707 countsByYear W20997617072018 @default.
- W2099761707 countsByYear W20997617072019 @default.
- W2099761707 countsByYear W20997617072020 @default.
- W2099761707 countsByYear W20997617072021 @default.
- W2099761707 countsByYear W20997617072022 @default.
- W2099761707 countsByYear W20997617072023 @default.
- W2099761707 crossrefType "journal-article" @default.
- W2099761707 hasAuthorship W2099761707A5045515728 @default.
- W2099761707 hasAuthorship W2099761707A5049358254 @default.
- W2099761707 hasAuthorship W2099761707A5059271655 @default.
- W2099761707 hasAuthorship W2099761707A5073568638 @default.
- W2099761707 hasAuthorship W2099761707A5081111897 @default.
- W2099761707 hasBestOaLocation W20997617071 @default.
- W2099761707 hasConcept C104317684 @default.
- W2099761707 hasConcept C141231307 @default.
- W2099761707 hasConcept C167625842 @default.
- W2099761707 hasConcept C195139083 @default.
- W2099761707 hasConcept C47289529 @default.
- W2099761707 hasConcept C4918238 @default.
- W2099761707 hasConcept C54355233 @default.
- W2099761707 hasConcept C70721500 @default.
- W2099761707 hasConcept C7386963 @default.
- W2099761707 hasConcept C86803240 @default.
- W2099761707 hasConcept C91779695 @default.
- W2099761707 hasConceptScore W2099761707C104317684 @default.
- W2099761707 hasConceptScore W2099761707C141231307 @default.
- W2099761707 hasConceptScore W2099761707C167625842 @default.