Matches in SemOpenAlex for { <https://semopenalex.org/work/W2031141877> ?p ?o ?g. }
- W2031141877 abstract "Protein sequence data is abundant, yet derivation of structural features from sequence alone is generally restricted to prediction of domain architecture, secondary structure elements and motifs. Precise feature boundaries cannot be determined reliably, and it is unknown to what extent these features constitute fundamental building blocks of protein sequences, a question with particular relevance to protein folding. Here we propose a statistical approach using mutual information, a measure of association, to predict feature boundaries. In this approach, proteins are viewed as strings of adjacent, non-overlapping features, where each feature is a subsequence of the protein, and the union of the features is the entire protein. Mutual information values are measured between nearby amino acids along sequences, and low values are indicators for feature boundaries. These boundaries are then predicted using a flexible partitioning algorithm. The algorithms presented in this paper were tested on the GPCR protein family and subfamilies. A comparison with segment boundaries implied indirectly from secondary structure prediction and expert knowledge demonstrates that the algorithm can be used to statistically predict feature positions in protein sequences generically, without assumptions on the feature type to be detected. Access to the data used and algorithms presented in this paper are available at flan.blm.cs.cmu.edu." @default.
- W2031141877 created "2016-06-24" @default.
- W2031141877 creator A5065265042 @default.
- W2031141877 creator A5077128720 @default.
- W2031141877 date "2004-01-01" @default.
- W2031141877 modified "2023-09-26" @default.
- W2031141877 title "Identification of fundamental building blocks in protein sequences using statistical association measures" @default.
- W2031141877 cites W1539255723 @default.
- W2031141877 cites W1554807519 @default.
- W2031141877 cites W1729771463 @default.
- W2031141877 cites W1971024387 @default.
- W2031141877 cites W1975304761 @default.
- W2031141877 cites W1985546344 @default.
- W2031141877 cites W1985899857 @default.
- W2031141877 cites W2016014327 @default.
- W2031141877 cites W2017431704 @default.
- W2031141877 cites W2026907061 @default.
- W2031141877 cites W2028761382 @default.
- W2031141877 cites W2034274945 @default.
- W2031141877 cites W2054333829 @default.
- W2031141877 cites W2055043387 @default.
- W2031141877 cites W2057289558 @default.
- W2031141877 cites W2059846299 @default.
- W2031141877 cites W2073110681 @default.
- W2031141877 cites W2074673068 @default.
- W2031141877 cites W2078122870 @default.
- W2031141877 cites W2078149659 @default.
- W2031141877 cites W2089606003 @default.
- W2031141877 cites W2092023575 @default.
- W2031141877 cites W2097365951 @default.
- W2031141877 cites W2100271852 @default.
- W2031141877 cites W2102471334 @default.
- W2031141877 cites W2104972430 @default.
- W2031141877 cites W2110190189 @default.
- W2031141877 cites W2114083522 @default.
- W2031141877 cites W2117919289 @default.
- W2031141877 cites W2120684043 @default.
- W2031141877 cites W2121082582 @default.
- W2031141877 cites W2123832393 @default.
- W2031141877 cites W2126405180 @default.
- W2031141877 cites W2132062642 @default.
- W2031141877 cites W2146691282 @default.
- W2031141877 cites W2149963584 @default.
- W2031141877 cites W2152303410 @default.
- W2031141877 cites W2152770371 @default.
- W2031141877 cites W2158906453 @default.
- W2031141877 cites W2161911785 @default.
- W2031141877 cites W2170471837 @default.
- W2031141877 cites W2421648494 @default.
- W2031141877 cites W262216995 @default.
- W2031141877 cites W51161921 @default.
- W2031141877 cites W72690559 @default.
- W2031141877 doi "https://doi.org/10.1145/967900.967933" @default.
- W2031141877 hasPublicationYear "2004" @default.
- W2031141877 type Work @default.
- W2031141877 sameAs 2031141877 @default.
- W2031141877 citedByCount "8" @default.
- W2031141877 crossrefType "proceedings-article" @default.
- W2031141877 hasAuthorship W2031141877A5065265042 @default.
- W2031141877 hasAuthorship W2031141877A5077128720 @default.
- W2031141877 hasConcept C104317684 @default.
- W2031141877 hasConcept C124101348 @default.
- W2031141877 hasConcept C134306372 @default.
- W2031141877 hasConcept C137877099 @default.
- W2031141877 hasConcept C138885662 @default.
- W2031141877 hasConcept C144292202 @default.
- W2031141877 hasConcept C152139883 @default.
- W2031141877 hasConcept C153180895 @default.
- W2031141877 hasConcept C154945302 @default.
- W2031141877 hasConcept C2776401178 @default.
- W2031141877 hasConcept C2778112365 @default.
- W2031141877 hasConcept C33923547 @default.
- W2031141877 hasConcept C34388435 @default.
- W2031141877 hasConcept C41008148 @default.
- W2031141877 hasConcept C41895202 @default.
- W2031141877 hasConcept C47701112 @default.
- W2031141877 hasConcept C54355233 @default.
- W2031141877 hasConcept C55493867 @default.
- W2031141877 hasConcept C58773245 @default.
- W2031141877 hasConcept C86803240 @default.
- W2031141877 hasConceptScore W2031141877C104317684 @default.
- W2031141877 hasConceptScore W2031141877C124101348 @default.
- W2031141877 hasConceptScore W2031141877C134306372 @default.
- W2031141877 hasConceptScore W2031141877C137877099 @default.
- W2031141877 hasConceptScore W2031141877C138885662 @default.
- W2031141877 hasConceptScore W2031141877C144292202 @default.
- W2031141877 hasConceptScore W2031141877C152139883 @default.
- W2031141877 hasConceptScore W2031141877C153180895 @default.
- W2031141877 hasConceptScore W2031141877C154945302 @default.
- W2031141877 hasConceptScore W2031141877C2776401178 @default.
- W2031141877 hasConceptScore W2031141877C2778112365 @default.
- W2031141877 hasConceptScore W2031141877C33923547 @default.
- W2031141877 hasConceptScore W2031141877C34388435 @default.
- W2031141877 hasConceptScore W2031141877C41008148 @default.
- W2031141877 hasConceptScore W2031141877C41895202 @default.
- W2031141877 hasConceptScore W2031141877C47701112 @default.
- W2031141877 hasConceptScore W2031141877C54355233 @default.
- W2031141877 hasConceptScore W2031141877C55493867 @default.
- W2031141877 hasConceptScore W2031141877C58773245 @default.
- W2031141877 hasConceptScore W2031141877C86803240 @default.