Matches in SemOpenAlex for { <https://semopenalex.org/work/W2119409375> ?p ?o ?g. }
- W2119409375 endingPage "e201302010" @default.
- W2119409375 startingPage "e201302010" @default.
- W2119409375 abstract "Protein structure and function information is coded in amino acid sequences. However, the relationship between primary sequences and three-dimensional structures and functions remains enigmatic. Our approach to this fundamental biochemistry problem is based on the frequencies of short constituent sequences (SCSs) or words. A protein amino acid sequence is considered analogous to an English sentence, where SCSs are equivalent to words. Availability scores, which are defined as real SCS frequencies in the non-redundant amino acid database relative to their probabilistically expected frequencies, demonstrate the biological usage bias of SCSs. As a result, this frequency-based linguistic approach is expected to have diverse applications, such as secondary structure specifications by structure-specific SCSs and immunological adjuvants with rare or non-existent SCSs. Linguistic similarities (e.g., wide ranges of scale-free distributions) and dissimilarities (e.g., behaviors of low-rank samples) between proteins and the natural English language have been revealed in the rank-frequency relationships of SCSs or words. We have developed a web server, the SCS Package, which contains five applications for analyzing protein sequences based on the linguistic concept. These tools have the potential to assist researchers in deciphering structurally and functionally important protein sites, species-specific sequences, and functional relationships between SCSs. The SCS Package also provides researchers with a tool to construct amino acid sequences de novo based on the idiomatic usage of SCSs." @default.
- W2119409375 created "2016-06-24" @default.
- W2119409375 creator A5025591755 @default.
- W2119409375 creator A5041964842 @default.
- W2119409375 creator A5066616566 @default.
- W2119409375 date "2013-02-01" @default.
- W2119409375 modified "2023-10-17" @default.
- W2119409375 title "A FREQUENCY-BASED LINGUISTIC APPROACH TO PROTEIN DECODING AND DESIGN: SIMPLE CONCEPTS, DIVERSE APPLICATIONS, AND THE SCS PACKAGE" @default.
- W2119409375 cites W1594390190 @default.
- W2119409375 cites W1969474339 @default.
- W2119409375 cites W1975304761 @default.
- W2119409375 cites W1978172310 @default.
- W2119409375 cites W1980176289 @default.
- W2119409375 cites W1985854865 @default.
- W2119409375 cites W1990412107 @default.
- W2119409375 cites W1991216560 @default.
- W2119409375 cites W1991771602 @default.
- W2119409375 cites W1999545669 @default.
- W2119409375 cites W2002566401 @default.
- W2119409375 cites W2003938648 @default.
- W2119409375 cites W2006465073 @default.
- W2119409375 cites W2009047999 @default.
- W2119409375 cites W2010904931 @default.
- W2119409375 cites W2013460486 @default.
- W2119409375 cites W2018879187 @default.
- W2119409375 cites W2028913470 @default.
- W2119409375 cites W2030124080 @default.
- W2119409375 cites W2045777307 @default.
- W2119409375 cites W2050102340 @default.
- W2119409375 cites W2050139756 @default.
- W2119409375 cites W2055214702 @default.
- W2119409375 cites W2056366317 @default.
- W2119409375 cites W2060321728 @default.
- W2119409375 cites W2071996119 @default.
- W2119409375 cites W2078778136 @default.
- W2119409375 cites W2078963930 @default.
- W2119409375 cites W2089909578 @default.
- W2119409375 cites W2094931921 @default.
- W2119409375 cites W2127479412 @default.
- W2119409375 cites W2130479394 @default.
- W2119409375 cites W2135768017 @default.
- W2119409375 cites W2140992384 @default.
- W2119409375 cites W2141111918 @default.
- W2119409375 cites W2141335473 @default.
- W2119409375 cites W2158714788 @default.
- W2119409375 cites W2169856792 @default.
- W2119409375 cites W2171963266 @default.
- W2119409375 cites W2207671224 @default.
- W2119409375 cites W3047703793 @default.
- W2119409375 doi "https://doi.org/10.5936/csbj.201302010" @default.
- W2119409375 hasPubMedCentralId "https://www.ncbi.nlm.nih.gov/pmc/articles/3962227" @default.
- W2119409375 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/24688703" @default.
- W2119409375 hasPublicationYear "2013" @default.
- W2119409375 type Work @default.
- W2119409375 sameAs 2119409375 @default.
- W2119409375 citedByCount "10" @default.
- W2119409375 countsByYear W21194093752014 @default.
- W2119409375 countsByYear W21194093752015 @default.
- W2119409375 countsByYear W21194093752017 @default.
- W2119409375 countsByYear W21194093752020 @default.
- W2119409375 countsByYear W21194093752021 @default.
- W2119409375 countsByYear W21194093752022 @default.
- W2119409375 crossrefType "journal-article" @default.
- W2119409375 hasAuthorship W2119409375A5025591755 @default.
- W2119409375 hasAuthorship W2119409375A5041964842 @default.
- W2119409375 hasAuthorship W2119409375A5066616566 @default.
- W2119409375 hasBestOaLocation W21194093751 @default.
- W2119409375 hasConcept C111472728 @default.
- W2119409375 hasConcept C11413529 @default.
- W2119409375 hasConcept C114614502 @default.
- W2119409375 hasConcept C138885662 @default.
- W2119409375 hasConcept C14036430 @default.
- W2119409375 hasConcept C164226766 @default.
- W2119409375 hasConcept C199360897 @default.
- W2119409375 hasConcept C204321447 @default.
- W2119409375 hasConcept C2777530160 @default.
- W2119409375 hasConcept C2778112365 @default.
- W2119409375 hasConcept C2780586882 @default.
- W2119409375 hasConcept C2780801425 @default.
- W2119409375 hasConcept C33923547 @default.
- W2119409375 hasConcept C41008148 @default.
- W2119409375 hasConcept C54355233 @default.
- W2119409375 hasConcept C57273362 @default.
- W2119409375 hasConcept C70721500 @default.
- W2119409375 hasConcept C86803240 @default.
- W2119409375 hasConceptScore W2119409375C111472728 @default.
- W2119409375 hasConceptScore W2119409375C11413529 @default.
- W2119409375 hasConceptScore W2119409375C114614502 @default.
- W2119409375 hasConceptScore W2119409375C138885662 @default.
- W2119409375 hasConceptScore W2119409375C14036430 @default.
- W2119409375 hasConceptScore W2119409375C164226766 @default.
- W2119409375 hasConceptScore W2119409375C199360897 @default.
- W2119409375 hasConceptScore W2119409375C204321447 @default.
- W2119409375 hasConceptScore W2119409375C2777530160 @default.
- W2119409375 hasConceptScore W2119409375C2778112365 @default.
- W2119409375 hasConceptScore W2119409375C2780586882 @default.
- W2119409375 hasConceptScore W2119409375C2780801425 @default.
- W2119409375 hasConceptScore W2119409375C33923547 @default.