Matches in SemOpenAlex for { <https://semopenalex.org/work/W2973498290> ?p ?o ?g. }
- W2973498290 abstract "Statistical models for families of evolutionary related proteins have recently gained interest: In particular, pairwise Potts models as those inferred by the direct-coupling analysis have been able to extract information about the three-dimensional structure of folded proteins and about the effect of amino acid substitutions in proteins. These models are typically requested to reproduce the one- and two-point statistics of the amino acid usage in a protein family, i.e., to capture the so-called residue conservation and covariation statistics of proteins of common evolutionary origin. Pairwise Potts models are the maximum-entropy models achieving this. Although being successful, these models depend on huge numbers of ad hoc introduced parameters, which have to be estimated from finite amounts of data and whose biophysical interpretation remains unclear. Here, we propose an approach to parameter reduction, which is based on selecting collective sequence motifs. It naturally leads to the formulation of statistical sequence models in terms of Hopfield-Potts models. These models can be accurately inferred using a mapping to restricted Boltzmann machines and persistent contrastive divergence. We show that, when applied to protein data, even 20-40 patterns are sufficient to obtain statistically close-to-generative models. The Hopfield patterns form interpretable sequence motifs and may be used to clusterize amino acid sequences into functional subfamilies. However, the distributed collective nature of these motifs intrinsically limits the ability of Hopfield-Potts models in predicting contact maps, showing the necessity of developing models going beyond the Hopfield-Potts models discussed here." @default.
- W2973498290 created "2019-09-26" @default.
- W2973498290 creator A5035771024 @default.
- W2973498290 creator A5047371913 @default.
- W2973498290 date "2019-09-19" @default.
- W2973498290 modified "2023-10-18" @default.
- W2973498290 title "Selection of sequence motifs and generative Hopfield-Potts models for protein families" @default.
- W2973498290 cites W1861406683 @default.
- W2973498290 cites W1935237622 @default.
- W2973498290 cites W1951660422 @default.
- W2973498290 cites W1979762151 @default.
- W2973498290 cites W1985252748 @default.
- W2973498290 cites W1994261741 @default.
- W2973498290 cites W1995924392 @default.
- W2973498290 cites W2001438084 @default.
- W2973498290 cites W2008545402 @default.
- W2973498290 cites W2014706477 @default.
- W2973498290 cites W2031600071 @default.
- W2973498290 cites W2032558547 @default.
- W2973498290 cites W2051210555 @default.
- W2973498290 cites W2053671774 @default.
- W2973498290 cites W2057000344 @default.
- W2973498290 cites W2060809301 @default.
- W2973498290 cites W2061042699 @default.
- W2973498290 cites W2065921821 @default.
- W2973498290 cites W2092572492 @default.
- W2973498290 cites W2094782385 @default.
- W2973498290 cites W2100495367 @default.
- W2973498290 cites W2121444044 @default.
- W2973498290 cites W2125150881 @default.
- W2973498290 cites W2137566700 @default.
- W2973498290 cites W2140697343 @default.
- W2973498290 cites W2151457629 @default.
- W2973498290 cites W2151831732 @default.
- W2973498290 cites W2166701319 @default.
- W2973498290 cites W2169478909 @default.
- W2973498290 cites W2224056471 @default.
- W2973498290 cites W2284132962 @default.
- W2973498290 cites W2416642098 @default.
- W2973498290 cites W2470360319 @default.
- W2973498290 cites W2574496196 @default.
- W2973498290 cites W2593619857 @default.
- W2973498290 cites W2766246727 @default.
- W2973498290 cites W2772248591 @default.
- W2973498290 cites W2783644078 @default.
- W2973498290 cites W2949706399 @default.
- W2973498290 cites W2963640180 @default.
- W2973498290 cites W3100163799 @default.
- W2973498290 cites W3102678975 @default.
- W2973498290 cites W3136918052 @default.
- W2973498290 cites W4211156111 @default.
- W2973498290 cites W4245668478 @default.
- W2973498290 doi "https://doi.org/10.1103/physreve.100.032128" @default.
- W2973498290 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/31639992" @default.
- W2973498290 hasPublicationYear "2019" @default.
- W2973498290 type Work @default.
- W2973498290 sameAs 2973498290 @default.
- W2973498290 citedByCount "22" @default.
- W2973498290 countsByYear W29734982902019 @default.
- W2973498290 countsByYear W29734982902020 @default.
- W2973498290 countsByYear W29734982902021 @default.
- W2973498290 countsByYear W29734982902022 @default.
- W2973498290 countsByYear W29734982902023 @default.
- W2973498290 crossrefType "journal-article" @default.
- W2973498290 hasAuthorship W2973498290A5035771024 @default.
- W2973498290 hasAuthorship W2973498290A5047371913 @default.
- W2973498290 hasBestOaLocation W29734982902 @default.
- W2973498290 hasConcept C121332964 @default.
- W2973498290 hasConcept C121864883 @default.
- W2973498290 hasConcept C154945302 @default.
- W2973498290 hasConcept C167966045 @default.
- W2973498290 hasConcept C184898388 @default.
- W2973498290 hasConcept C192576344 @default.
- W2973498290 hasConcept C2778112365 @default.
- W2973498290 hasConcept C39890363 @default.
- W2973498290 hasConcept C41008148 @default.
- W2973498290 hasConcept C50644808 @default.
- W2973498290 hasConcept C51329190 @default.
- W2973498290 hasConcept C54355233 @default.
- W2973498290 hasConcept C70721500 @default.
- W2973498290 hasConcept C86803240 @default.
- W2973498290 hasConcept C93959086 @default.
- W2973498290 hasConcept C98925819 @default.
- W2973498290 hasConceptScore W2973498290C121332964 @default.
- W2973498290 hasConceptScore W2973498290C121864883 @default.
- W2973498290 hasConceptScore W2973498290C154945302 @default.
- W2973498290 hasConceptScore W2973498290C167966045 @default.
- W2973498290 hasConceptScore W2973498290C184898388 @default.
- W2973498290 hasConceptScore W2973498290C192576344 @default.
- W2973498290 hasConceptScore W2973498290C2778112365 @default.
- W2973498290 hasConceptScore W2973498290C39890363 @default.
- W2973498290 hasConceptScore W2973498290C41008148 @default.
- W2973498290 hasConceptScore W2973498290C50644808 @default.
- W2973498290 hasConceptScore W2973498290C51329190 @default.
- W2973498290 hasConceptScore W2973498290C54355233 @default.
- W2973498290 hasConceptScore W2973498290C70721500 @default.
- W2973498290 hasConceptScore W2973498290C86803240 @default.
- W2973498290 hasConceptScore W2973498290C93959086 @default.
- W2973498290 hasConceptScore W2973498290C98925819 @default.
- W2973498290 hasFunder F4320323538 @default.