Matches in SemOpenAlex for { <https://semopenalex.org/work/W2897300788> ?p ?o ?g. }
- W2897300788 endingPage "e0202513" @default.
- W2897300788 startingPage "e0202513" @default.
- W2897300788 abstract "Overlapping genes represent a fascinating evolutionary puzzle, since they encode two functionally unrelated proteins from the same DNA sequence. They originate by a mechanism of overprinting, in which point mutations in an existing frame allow the expression (the birth) of a completely new protein from a second frame. In viruses, in which overlapping genes are abundant, these new proteins often play a critical role in infection, yet they are frequently overlooked during genome annotation. This results in erroneous interpretation of mutational studies and in a significant waste of resources. Therefore, overlapping genes need to be correctly detected, especially since they are now thought to be abundant also in eukaryotes. Developing better detection methods and conducting systematic evolutionary studies require a large, reliable benchmark dataset of known cases. We thus assembled a high-quality dataset of 80 viral overlapping genes whose expression is experimentally proven. Many of them were not present in databases. We found that overall, overlapping genes differ significantly from non-overlapping genes in their nucleotide and amino acid composition. In particular, the proteins they encode are enriched in high-degeneracy amino acids and depleted in low-degeneracy ones, which may alleviate the evolutionary constraints acting on overlapping genes. Principal component analysis revealed that the vast majority of overlapping genes follow a similar composition bias, despite their heterogeneity in length and function. Six proven mammalian overlapping genes also followed this bias. We propose that this apparently near-universal composition bias may either favour the birth of overlapping genes, or/and result from selection pressure acting on them." @default.
- W2897300788 created "2018-10-26" @default.
- W2897300788 creator A5007404143 @default.
- W2897300788 creator A5008919272 @default.
- W2897300788 creator A5009291913 @default.
- W2897300788 creator A5022569324 @default.
- W2897300788 creator A5026976726 @default.
- W2897300788 creator A5047814803 @default.
- W2897300788 creator A5073252393 @default.
- W2897300788 creator A5075882065 @default.
- W2897300788 date "2018-10-19" @default.
- W2897300788 modified "2023-10-17" @default.
- W2897300788 title "Overlapping genes and the proteins they encode differ significantly in their sequence composition from non-overlapping genes" @default.
- W2897300788 cites W129574965 @default.
- W2897300788 cites W1483301458 @default.
- W2897300788 cites W1562259630 @default.
- W2897300788 cites W1769626217 @default.
- W2897300788 cites W179740174 @default.
- W2897300788 cites W1902451327 @default.
- W2897300788 cites W1926826741 @default.
- W2897300788 cites W1939831927 @default.
- W2897300788 cites W1966054897 @default.
- W2897300788 cites W1968277533 @default.
- W2897300788 cites W1968421852 @default.
- W2897300788 cites W1972892672 @default.
- W2897300788 cites W1976455895 @default.
- W2897300788 cites W1979607809 @default.
- W2897300788 cites W1983824673 @default.
- W2897300788 cites W1990397154 @default.
- W2897300788 cites W1992308820 @default.
- W2897300788 cites W1996002425 @default.
- W2897300788 cites W2000989493 @default.
- W2897300788 cites W2001788095 @default.
- W2897300788 cites W2002599176 @default.
- W2897300788 cites W2006082276 @default.
- W2897300788 cites W2010666698 @default.
- W2897300788 cites W2010690249 @default.
- W2897300788 cites W2011925077 @default.
- W2897300788 cites W2013586160 @default.
- W2897300788 cites W2018232804 @default.
- W2897300788 cites W2020243822 @default.
- W2897300788 cites W2025341678 @default.
- W2897300788 cites W2027757280 @default.
- W2897300788 cites W2029510750 @default.
- W2897300788 cites W2030844589 @default.
- W2897300788 cites W2031450374 @default.
- W2897300788 cites W2032639010 @default.
- W2897300788 cites W2034400748 @default.
- W2897300788 cites W2040415319 @default.
- W2897300788 cites W2043422501 @default.
- W2897300788 cites W2043648764 @default.
- W2897300788 cites W2050115925 @default.
- W2897300788 cites W2054225539 @default.
- W2897300788 cites W2057850601 @default.
- W2897300788 cites W2059135994 @default.
- W2897300788 cites W2059143988 @default.
- W2897300788 cites W2060969855 @default.
- W2897300788 cites W2061603739 @default.
- W2897300788 cites W2065951980 @default.
- W2897300788 cites W2065977014 @default.
- W2897300788 cites W2067127476 @default.
- W2897300788 cites W2067610638 @default.
- W2897300788 cites W2072187285 @default.
- W2897300788 cites W2072841471 @default.
- W2897300788 cites W2079656551 @default.
- W2897300788 cites W2080929480 @default.
- W2897300788 cites W2096286650 @default.
- W2897300788 cites W2096363048 @default.
- W2897300788 cites W2097736339 @default.
- W2897300788 cites W2097781409 @default.
- W2897300788 cites W2098792085 @default.
- W2897300788 cites W2100163761 @default.
- W2897300788 cites W2104690497 @default.
- W2897300788 cites W2107903949 @default.
- W2897300788 cites W2108982334 @default.
- W2897300788 cites W2110650347 @default.
- W2897300788 cites W2111284653 @default.
- W2897300788 cites W2112269135 @default.
- W2897300788 cites W2113180283 @default.
- W2897300788 cites W2113997129 @default.
- W2897300788 cites W2119863792 @default.
- W2897300788 cites W2120167569 @default.
- W2897300788 cites W2120334862 @default.
- W2897300788 cites W2127501111 @default.
- W2897300788 cites W2131848569 @default.
- W2897300788 cites W2136101247 @default.
- W2897300788 cites W2136620995 @default.
- W2897300788 cites W2139566973 @default.
- W2897300788 cites W2141838466 @default.
- W2897300788 cites W2142740146 @default.
- W2897300788 cites W2143519602 @default.
- W2897300788 cites W2146215397 @default.
- W2897300788 cites W2158714788 @default.
- W2897300788 cites W2160630690 @default.
- W2897300788 cites W2162833399 @default.
- W2897300788 cites W2164637416 @default.
- W2897300788 cites W2179178090 @default.
- W2897300788 cites W2216126184 @default.