Matches in SemOpenAlex for { <https://semopenalex.org/work/W4289755328> ?p ?o ?g. }
- W4289755328 abstract "ABSTRACT ‘Newly Born’ proteins, devoid of detectable homology to any other proteins, known as orphan proteins, occur in a single species or within a taxonomically restricted gene family. They are generated by expression of novel Open Reading Frames, and appear throughout evolution. We were curious if the three recently developed programs for predicting protein structures, viz., AlphaFold2, RoseTTAFold, and ESMFold, might be of value for comparison of such ‘ Newly Born’ proteins to random polypeptides with amino acid content similar to that of native proteins, which have been called ‘ Never Born ’ proteins. The programs were used to compare the structures of two sets of ‘ Never Born’ proteins that had been expressed – Group 1, which had been shown experimentally to possess substantial secondary structure, and Group 3, which had been shown to be intrinsically disordered. Overall, the models generated were scored as being of low quality but revealed some general principles. Specifically, all four members of Group 1 were predicted to be compact by all three algorithms. The members of Group 3 were predicted to be very extended, as would be expected for intrinsically disordered proteins. The three programs were then used to predict the structures of three orphan proteins whose crystal structures had been solved, two of which display novel folds. Finally, they were used to predict the structures of seven orphan proteins with well-identified biological functions, whose 3D structures are not known. Two proteins, which were predicted to be disordered based on their sequences, are predicted by all three structure algorithms to be extended structures. The other five were predicted to be compact structures with two exceptions in the case of AlphaFold2. All three prediction algorithms make remarkably similar and high-quality predictions for one large protein, HCO_11565, from a nematode. It is conjectured that this is due to many homologs in the taxonomically restricted family of which it is a member and to the fact that the Dali server revealed several non-related proteins with similar folds. Overall, orphan and taxonomically restricted proteins are often predicted to have compact 3D structures, sometimes with a novel fold that is a consequence of their novel sequences, which are associated with the appearance of new biological functions." @default.
- W4289755328 created "2022-08-04" @default.
- W4289755328 creator A5025933576 @default.
- W4289755328 creator A5029048058 @default.
- W4289755328 creator A5035282947 @default.
- W4289755328 creator A5045275595 @default.
- W4289755328 creator A5050870388 @default.
- W4289755328 creator A5091713319 @default.
- W4289755328 date "2022-08-02" @default.
- W4289755328 modified "2023-09-26" @default.
- W4289755328 title "Do Newly Born Orphan Proteins Resemble Never Born Proteins? A Study Using Three Deep Learning Algorithms" @default.
- W4289755328 cites W1551404386 @default.
- W4289755328 cites W1579792554 @default.
- W4289755328 cites W1967090894 @default.
- W4289755328 cites W1967974247 @default.
- W4289755328 cites W1971301499 @default.
- W4289755328 cites W1972325703 @default.
- W4289755328 cites W1982145465 @default.
- W4289755328 cites W1982198083 @default.
- W4289755328 cites W1982583124 @default.
- W4289755328 cites W1985807928 @default.
- W4289755328 cites W1988985255 @default.
- W4289755328 cites W1991949383 @default.
- W4289755328 cites W2001335546 @default.
- W4289755328 cites W2003022823 @default.
- W4289755328 cites W2013460486 @default.
- W4289755328 cites W2019017190 @default.
- W4289755328 cites W2059223767 @default.
- W4289755328 cites W2065127848 @default.
- W4289755328 cites W2066053649 @default.
- W4289755328 cites W2068043995 @default.
- W4289755328 cites W2078964153 @default.
- W4289755328 cites W2080323035 @default.
- W4289755328 cites W2088201027 @default.
- W4289755328 cites W2090424274 @default.
- W4289755328 cites W2090785561 @default.
- W4289755328 cites W2094966626 @default.
- W4289755328 cites W2096476896 @default.
- W4289755328 cites W2096536772 @default.
- W4289755328 cites W2097484079 @default.
- W4289755328 cites W2097891553 @default.
- W4289755328 cites W2106612397 @default.
- W4289755328 cites W2114175948 @default.
- W4289755328 cites W2114823344 @default.
- W4289755328 cites W2115089435 @default.
- W4289755328 cites W2120507610 @default.
- W4289755328 cites W2127673162 @default.
- W4289755328 cites W2128643990 @default.
- W4289755328 cites W2129883957 @default.
- W4289755328 cites W2130479394 @default.
- W4289755328 cites W2131965976 @default.
- W4289755328 cites W2138668897 @default.
- W4289755328 cites W2140673705 @default.
- W4289755328 cites W2146729343 @default.
- W4289755328 cites W2155479906 @default.
- W4289755328 cites W2156697496 @default.
- W4289755328 cites W2158919079 @default.
- W4289755328 cites W2575382994 @default.
- W4289755328 cites W2767444993 @default.
- W4289755328 cites W2805582776 @default.
- W4289755328 cites W2912126500 @default.
- W4289755328 cites W2969287734 @default.
- W4289755328 cites W2980170804 @default.
- W4289755328 cites W3007397157 @default.
- W4289755328 cites W3105446189 @default.
- W4289755328 cites W3128724325 @default.
- W4289755328 cites W3132153121 @default.
- W4289755328 cites W3156051371 @default.
- W4289755328 cites W3177828909 @default.
- W4289755328 cites W3184265853 @default.
- W4289755328 cites W3186179742 @default.
- W4289755328 cites W3195375135 @default.
- W4289755328 cites W3198923619 @default.
- W4289755328 cites W3216341763 @default.
- W4289755328 cites W4206275111 @default.
- W4289755328 cites W4206563428 @default.
- W4289755328 cites W4206767430 @default.
- W4289755328 cites W4213208821 @default.
- W4289755328 cites W4281790889 @default.
- W4289755328 cites W4282922306 @default.
- W4289755328 cites W4296780589 @default.
- W4289755328 cites W4300861364 @default.
- W4289755328 cites W4306178575 @default.
- W4289755328 cites W4306412091 @default.
- W4289755328 cites W4312084900 @default.
- W4289755328 doi "https://doi.org/10.1101/2022.08.02.502493" @default.
- W4289755328 hasPublicationYear "2022" @default.
- W4289755328 type Work @default.
- W4289755328 citedByCount "1" @default.
- W4289755328 countsByYear W42897553282022 @default.
- W4289755328 crossrefType "posted-content" @default.
- W4289755328 hasAuthorship W4289755328A5025933576 @default.
- W4289755328 hasAuthorship W4289755328A5029048058 @default.
- W4289755328 hasAuthorship W4289755328A5035282947 @default.
- W4289755328 hasAuthorship W4289755328A5045275595 @default.
- W4289755328 hasAuthorship W4289755328A5050870388 @default.
- W4289755328 hasAuthorship W4289755328A5091713319 @default.
- W4289755328 hasBestOaLocation W42897553281 @default.
- W4289755328 hasConcept C104317684 @default.
- W4289755328 hasConcept C165525559 @default.