Matches in SemOpenAlex for { <https://semopenalex.org/work/W4317716364> ?p ?o ?g. }
- W4317716364 abstract "Abstract Motivation Evolutionary inferences depend crucially on the quality of multiple sequence alignments (MSA), which is problematic for distantly related proteins. Since protein structure is more conserved than protein sequence, it seems natural to use structure alignments for distant homologs. However, structure alignments may not be suitable for inferring evolutionary relationships at the sequence level. Results Here we investigate the mutual relationships between four protein similarity measures that depend on sequence and structure (fraction of aligned residues, sequence similarity, fraction of superimposed backbones and contact overlap) and the corresponding alignments. Changes in protein sequences and structures are intimately correlated, but our results suggest that no individual measure can provide a complete and unbiased picture of changes in protein sequences and structure. Therefore, we propose a new hybrid measure of protein sequence and structure similarity based on Principal Components (PC_sim). Starting from an MSA, we obtain modified pairwise alignments (PA) based on PC_sim, and from them we construct a new MSA based on the maximal cliques of the PA graph. These alignments yield larger protein similarities and agree better with the Balibase “reference” MSA and with consensus MSA than alignments that target individual similarity measures. Moreover, PC_sim is associated with a divergence measure that correlates strongest with divergences obtained from individual similarities, which suggests that it can infer more accurate evolutionary divergences for the reconstruction of phylogenetic trees with distance methods. Availability https://github.com/ugobas/Evol_div Contact ubastolla@cbm.csic.es" @default.
- W4317716364 created "2023-01-23" @default.
- W4317716364 creator A5013334733 @default.
- W4317716364 creator A5023588121 @default.
- W4317716364 creator A5025838254 @default.
- W4317716364 date "2023-01-22" @default.
- W4317716364 modified "2023-09-26" @default.
- W4317716364 title "PC_sim: An integrated measure of protein sequence and structure similarity for improved alignments and evolutionary inference" @default.
- W4317716364 cites W1842186545 @default.
- W4317716364 cites W1977538576 @default.
- W4317716364 cites W1979762151 @default.
- W4317716364 cites W1992579032 @default.
- W4317716364 cites W1997631042 @default.
- W4317716364 cites W2022058405 @default.
- W4317716364 cites W2041338685 @default.
- W4317716364 cites W2065921821 @default.
- W4317716364 cites W2066142272 @default.
- W4317716364 cites W2067068892 @default.
- W4317716364 cites W2069777230 @default.
- W4317716364 cites W2085488365 @default.
- W4317716364 cites W2097270746 @default.
- W4317716364 cites W2097767833 @default.
- W4317716364 cites W2100561059 @default.
- W4317716364 cites W2102245393 @default.
- W4317716364 cites W2107237710 @default.
- W4317716364 cites W2108067237 @default.
- W4317716364 cites W2109580010 @default.
- W4317716364 cites W2117486996 @default.
- W4317716364 cites W2127322768 @default.
- W4317716364 cites W2127688908 @default.
- W4317716364 cites W2127860913 @default.
- W4317716364 cites W2130479394 @default.
- W4317716364 cites W2131408740 @default.
- W4317716364 cites W2132926880 @default.
- W4317716364 cites W2140771555 @default.
- W4317716364 cites W2140889378 @default.
- W4317716364 cites W2141411672 @default.
- W4317716364 cites W2144362290 @default.
- W4317716364 cites W2150566133 @default.
- W4317716364 cites W2157661039 @default.
- W4317716364 cites W2160378127 @default.
- W4317716364 cites W2161151688 @default.
- W4317716364 cites W2171641243 @default.
- W4317716364 cites W2266439690 @default.
- W4317716364 cites W2938574745 @default.
- W4317716364 cites W2941366761 @default.
- W4317716364 cites W2950873526 @default.
- W4317716364 cites W3177828909 @default.
- W4317716364 cites W4256395558 @default.
- W4317716364 doi "https://doi.org/10.1101/2023.01.22.525078" @default.
- W4317716364 hasPublicationYear "2023" @default.
- W4317716364 type Work @default.
- W4317716364 citedByCount "0" @default.
- W4317716364 crossrefType "posted-content" @default.
- W4317716364 hasAuthorship W4317716364A5013334733 @default.
- W4317716364 hasAuthorship W4317716364A5023588121 @default.
- W4317716364 hasAuthorship W4317716364A5025838254 @default.
- W4317716364 hasBestOaLocation W43177163641 @default.
- W4317716364 hasConcept C10010492 @default.
- W4317716364 hasConcept C103278499 @default.
- W4317716364 hasConcept C104317684 @default.
- W4317716364 hasConcept C115961682 @default.
- W4317716364 hasConcept C124101348 @default.
- W4317716364 hasConcept C138885662 @default.
- W4317716364 hasConcept C154945302 @default.
- W4317716364 hasConcept C167625842 @default.
- W4317716364 hasConcept C171897839 @default.
- W4317716364 hasConcept C180384323 @default.
- W4317716364 hasConcept C184898388 @default.
- W4317716364 hasConcept C193252679 @default.
- W4317716364 hasConcept C207390915 @default.
- W4317716364 hasConcept C2776214188 @default.
- W4317716364 hasConcept C2776517306 @default.
- W4317716364 hasConcept C2778112365 @default.
- W4317716364 hasConcept C2780009758 @default.
- W4317716364 hasConcept C41008148 @default.
- W4317716364 hasConcept C41895202 @default.
- W4317716364 hasConcept C45484198 @default.
- W4317716364 hasConcept C4668613 @default.
- W4317716364 hasConcept C54355233 @default.
- W4317716364 hasConcept C70721500 @default.
- W4317716364 hasConcept C78458016 @default.
- W4317716364 hasConcept C86803240 @default.
- W4317716364 hasConcept C88031987 @default.
- W4317716364 hasConceptScore W4317716364C10010492 @default.
- W4317716364 hasConceptScore W4317716364C103278499 @default.
- W4317716364 hasConceptScore W4317716364C104317684 @default.
- W4317716364 hasConceptScore W4317716364C115961682 @default.
- W4317716364 hasConceptScore W4317716364C124101348 @default.
- W4317716364 hasConceptScore W4317716364C138885662 @default.
- W4317716364 hasConceptScore W4317716364C154945302 @default.
- W4317716364 hasConceptScore W4317716364C167625842 @default.
- W4317716364 hasConceptScore W4317716364C171897839 @default.
- W4317716364 hasConceptScore W4317716364C180384323 @default.
- W4317716364 hasConceptScore W4317716364C184898388 @default.
- W4317716364 hasConceptScore W4317716364C193252679 @default.
- W4317716364 hasConceptScore W4317716364C207390915 @default.
- W4317716364 hasConceptScore W4317716364C2776214188 @default.
- W4317716364 hasConceptScore W4317716364C2776517306 @default.
- W4317716364 hasConceptScore W4317716364C2778112365 @default.