Matches in SemOpenAlex for { <https://semopenalex.org/work/W2894554333> ?p ?o ?g. }
- W2894554333 abstract "Abstract The relationships between polypeptide composition, sequence, structure and function have been puzzling biologists ever since first protein sequences were determined. Here, we study the statistics of occurrence of all possible pentapeptide sequences in known proteins. To compensate for the non-uniform distribution of individual amino acid residues in protein sequences, we investigate separately all possible permutations of every given amino acid composition. For the majority of permutation groups we find that pentapeptide occurrences deviate strongly from the expected binomial distributions, and that the observed distributions are also characterized by high numbers of outlier sequences. An analysis of identified outliers shows they often contain known motifs and rare amino acids, suggesting that they represent important functional elements. We further compare the pentapeptide composition of regions known to correspond to protein domains with that of non-domain regions. We find that a substantial number of pentapeptides is clearly strongly favored in protein domains. Finally, we show that over-represented pentapeptides are significantly related to known functional motifs and to predicted ancient structural peptides." @default.
- W2894554333 created "2018-10-12" @default.
- W2894554333 creator A5016232823 @default.
- W2894554333 creator A5021021856 @default.
- W2894554333 creator A5035111563 @default.
- W2894554333 creator A5043804688 @default.
- W2894554333 creator A5062372955 @default.
- W2894554333 creator A5072526330 @default.
- W2894554333 creator A5077637095 @default.
- W2894554333 date "2018-10-11" @default.
- W2894554333 modified "2023-10-05" @default.
- W2894554333 title "Global pentapeptide statistics are far away from expected distributions" @default.
- W2894554333 cites W1508885706 @default.
- W2894554333 cites W1592237925 @default.
- W2894554333 cites W1594390190 @default.
- W2894554333 cites W1875580551 @default.
- W2894554333 cites W1969425147 @default.
- W2894554333 cites W1999545669 @default.
- W2894554333 cites W2001755249 @default.
- W2894554333 cites W2006465073 @default.
- W2894554333 cites W2013460486 @default.
- W2894554333 cites W2056972503 @default.
- W2894554333 cites W2071996119 @default.
- W2894554333 cites W2072187305 @default.
- W2894554333 cites W2077235131 @default.
- W2894554333 cites W2080221828 @default.
- W2894554333 cites W2085825757 @default.
- W2894554333 cites W2088588015 @default.
- W2894554333 cites W2091710329 @default.
- W2894554333 cites W2094778744 @default.
- W2894554333 cites W2097606916 @default.
- W2894554333 cites W2098129156 @default.
- W2894554333 cites W2107158607 @default.
- W2894554333 cites W2115089435 @default.
- W2894554333 cites W2120717049 @default.
- W2894554333 cites W2123076448 @default.
- W2894554333 cites W2128560492 @default.
- W2894554333 cites W2131322954 @default.
- W2894554333 cites W2139621307 @default.
- W2894554333 cites W2141335473 @default.
- W2894554333 cites W2153544371 @default.
- W2894554333 cites W2159959024 @default.
- W2894554333 cites W2174184943 @default.
- W2894554333 cites W2206815873 @default.
- W2894554333 cites W2211953232 @default.
- W2894554333 cites W4295216797 @default.
- W2894554333 cites W64749174 @default.
- W2894554333 doi "https://doi.org/10.1038/s41598-018-33433-8" @default.
- W2894554333 hasPubMedCentralId "https://www.ncbi.nlm.nih.gov/pmc/articles/6181984" @default.
- W2894554333 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/30310110" @default.
- W2894554333 hasPublicationYear "2018" @default.
- W2894554333 type Work @default.
- W2894554333 sameAs 2894554333 @default.
- W2894554333 citedByCount "8" @default.
- W2894554333 countsByYear W28945543332019 @default.
- W2894554333 countsByYear W28945543332020 @default.
- W2894554333 countsByYear W28945543332021 @default.
- W2894554333 countsByYear W28945543332023 @default.
- W2894554333 crossrefType "journal-article" @default.
- W2894554333 hasAuthorship W2894554333A5016232823 @default.
- W2894554333 hasAuthorship W2894554333A5021021856 @default.
- W2894554333 hasAuthorship W2894554333A5035111563 @default.
- W2894554333 hasAuthorship W2894554333A5043804688 @default.
- W2894554333 hasAuthorship W2894554333A5062372955 @default.
- W2894554333 hasAuthorship W2894554333A5072526330 @default.
- W2894554333 hasAuthorship W2894554333A5077637095 @default.
- W2894554333 hasBestOaLocation W28945543331 @default.
- W2894554333 hasConcept C105795698 @default.
- W2894554333 hasConcept C121332964 @default.
- W2894554333 hasConcept C21308566 @default.
- W2894554333 hasConcept C24890656 @default.
- W2894554333 hasConcept C2779281246 @default.
- W2894554333 hasConcept C33923547 @default.
- W2894554333 hasConcept C45197812 @default.
- W2894554333 hasConcept C515207424 @default.
- W2894554333 hasConcept C54355233 @default.
- W2894554333 hasConcept C55493867 @default.
- W2894554333 hasConcept C70721500 @default.
- W2894554333 hasConcept C79337645 @default.
- W2894554333 hasConcept C86803240 @default.
- W2894554333 hasConceptScore W2894554333C105795698 @default.
- W2894554333 hasConceptScore W2894554333C121332964 @default.
- W2894554333 hasConceptScore W2894554333C21308566 @default.
- W2894554333 hasConceptScore W2894554333C24890656 @default.
- W2894554333 hasConceptScore W2894554333C2779281246 @default.
- W2894554333 hasConceptScore W2894554333C33923547 @default.
- W2894554333 hasConceptScore W2894554333C45197812 @default.
- W2894554333 hasConceptScore W2894554333C515207424 @default.
- W2894554333 hasConceptScore W2894554333C54355233 @default.
- W2894554333 hasConceptScore W2894554333C55493867 @default.
- W2894554333 hasConceptScore W2894554333C70721500 @default.
- W2894554333 hasConceptScore W2894554333C79337645 @default.
- W2894554333 hasConceptScore W2894554333C86803240 @default.
- W2894554333 hasIssue "1" @default.
- W2894554333 hasLocation W28945543331 @default.
- W2894554333 hasLocation W28945543332 @default.
- W2894554333 hasLocation W28945543333 @default.
- W2894554333 hasLocation W28945543334 @default.
- W2894554333 hasLocation W28945543335 @default.
- W2894554333 hasOpenAccess W2894554333 @default.