Matches in SemOpenAlex for { <https://semopenalex.org/work/W4210826824> ?p ?o ?g. }
- W4210826824 endingPage "n/a" @default.
- W4210826824 startingPage "n/a" @default.
- W4210826824 abstract "Elucidating the principles of sequence–structure relationships of proteins is a long-standing issue in biology. The nature of a short segment of a protein is determined by both the subsequence of the segment itself and its environment. For example, a type of subsequence, the so-called chameleon sequences, can form different secondary structures depending on its environments. Chameleon sequences are considered to have a weak tendency to form a specific structure. Although many chameleon sequences have been identified, they are only a small part of all possible subsequences in the proteome. The strength of the tendency to take a specific structure for each subsequence has not been fully quantified. In this study, we comprehensively analyzed subsequences consisting of four to nine amino acid residues, or N-gram (4≤N≤9), observed in non-redundant sequences in the Protein Data Bank (PDB). Tendencies to form a specific structure in terms of the secondary structure and accessible surface area are quantified as information quantities for each N-gram. Although the majority of observed subsequences have low information quantity due to lack of samples in the current PDB, thousands of N-grams with strong tendencies, including known structural motifs, were found. In addition, machine learning partially predicted the tendency of unknown N-grams, and thus, this technique helps to extract knowledge from the limited number of samples in the PDB." @default.
- W4210826824 created "2022-02-09" @default.
- W4210826824 creator A5030758619 @default.
- W4210826824 creator A5042029425 @default.
- W4210826824 creator A5063034206 @default.
- W4210826824 date "2022-01-01" @default.
- W4210826824 modified "2023-10-14" @default.
- W4210826824 title "Information quantity for secondary structure propensities of protein subsequences in the Protein Data Bank" @default.
- W4210826824 cites W1530397352 @default.
- W4210826824 cites W1600959429 @default.
- W4210826824 cites W1970829756 @default.
- W4210826824 cites W1976005460 @default.
- W4210826824 cites W1987328021 @default.
- W4210826824 cites W1990412107 @default.
- W4210826824 cites W1996073320 @default.
- W4210826824 cites W2003419027 @default.
- W4210826824 cites W2008708467 @default.
- W4210826824 cites W2014050554 @default.
- W4210826824 cites W2016381774 @default.
- W4210826824 cites W2021041804 @default.
- W4210826824 cites W2042705901 @default.
- W4210826824 cites W2045777307 @default.
- W4210826824 cites W2049695588 @default.
- W4210826824 cites W2062602834 @default.
- W4210826824 cites W2109872885 @default.
- W4210826824 cites W2111705855 @default.
- W4210826824 cites W2133142974 @default.
- W4210826824 cites W2141128950 @default.
- W4210826824 cites W2150306348 @default.
- W4210826824 cites W2154872292 @default.
- W4210826824 cites W2158266834 @default.
- W4210826824 cites W2162980545 @default.
- W4210826824 cites W2169054000 @default.
- W4210826824 cites W2170747616 @default.
- W4210826824 cites W2288234278 @default.
- W4210826824 cites W2413508140 @default.
- W4210826824 cites W2463717668 @default.
- W4210826824 cites W2567587907 @default.
- W4210826824 cites W2898320067 @default.
- W4210826824 cites W2904005668 @default.
- W4210826824 cites W4233044915 @default.
- W4210826824 doi "https://doi.org/10.2142/biophysico.bppb-v19.0002" @default.
- W4210826824 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/35532457" @default.
- W4210826824 hasPublicationYear "2022" @default.
- W4210826824 type Work @default.
- W4210826824 citedByCount "0" @default.
- W4210826824 crossrefType "journal-article" @default.
- W4210826824 hasAuthorship W4210826824A5030758619 @default.
- W4210826824 hasAuthorship W4210826824A5042029425 @default.
- W4210826824 hasAuthorship W4210826824A5063034206 @default.
- W4210826824 hasBestOaLocation W42108268241 @default.
- W4210826824 hasConcept C104397665 @default.
- W4210826824 hasConcept C119145174 @default.
- W4210826824 hasConcept C134306372 @default.
- W4210826824 hasConcept C137877099 @default.
- W4210826824 hasConcept C2778112365 @default.
- W4210826824 hasConcept C33923547 @default.
- W4210826824 hasConcept C34388435 @default.
- W4210826824 hasConcept C41008148 @default.
- W4210826824 hasConcept C47701112 @default.
- W4210826824 hasConcept C54355233 @default.
- W4210826824 hasConcept C55493867 @default.
- W4210826824 hasConcept C60644358 @default.
- W4210826824 hasConcept C62614982 @default.
- W4210826824 hasConcept C65556437 @default.
- W4210826824 hasConcept C70721500 @default.
- W4210826824 hasConcept C86803240 @default.
- W4210826824 hasConceptScore W4210826824C104397665 @default.
- W4210826824 hasConceptScore W4210826824C119145174 @default.
- W4210826824 hasConceptScore W4210826824C134306372 @default.
- W4210826824 hasConceptScore W4210826824C137877099 @default.
- W4210826824 hasConceptScore W4210826824C2778112365 @default.
- W4210826824 hasConceptScore W4210826824C33923547 @default.
- W4210826824 hasConceptScore W4210826824C34388435 @default.
- W4210826824 hasConceptScore W4210826824C41008148 @default.
- W4210826824 hasConceptScore W4210826824C47701112 @default.
- W4210826824 hasConceptScore W4210826824C54355233 @default.
- W4210826824 hasConceptScore W4210826824C55493867 @default.
- W4210826824 hasConceptScore W4210826824C60644358 @default.
- W4210826824 hasConceptScore W4210826824C62614982 @default.
- W4210826824 hasConceptScore W4210826824C65556437 @default.
- W4210826824 hasConceptScore W4210826824C70721500 @default.
- W4210826824 hasConceptScore W4210826824C86803240 @default.
- W4210826824 hasIssue "0" @default.
- W4210826824 hasLocation W42108268241 @default.
- W4210826824 hasLocation W42108268242 @default.
- W4210826824 hasLocation W42108268243 @default.
- W4210826824 hasLocation W42108268244 @default.
- W4210826824 hasOpenAccess W4210826824 @default.
- W4210826824 hasPrimaryLocation W42108268241 @default.
- W4210826824 hasRelatedWork W1979707045 @default.
- W4210826824 hasRelatedWork W2017505838 @default.
- W4210826824 hasRelatedWork W2045062364 @default.
- W4210826824 hasRelatedWork W2584773272 @default.
- W4210826824 hasRelatedWork W2728406523 @default.
- W4210826824 hasRelatedWork W2731937419 @default.
- W4210826824 hasRelatedWork W2732388178 @default.
- W4210826824 hasRelatedWork W4210826824 @default.