Matches in SemOpenAlex for { <https://semopenalex.org/work/W2625898396> ?p ?o ?g. }
- W2625898396 abstract "DNA-binding proteins perform important functions in a great number of biological activities. DNA-binding proteins can interact with ssDNA (single-stranded DNA) or dsDNA (double-stranded DNA), and DNA-binding proteins can be categorized as single-stranded DNA-binding proteins (SSBs) and double-stranded DNA-binding proteins (DSBs). The identification of DNA-binding proteins from amino acid sequences can help to annotate protein functions and understand the binding specificity. In this study, we systematically consider a variety of schemes to represent protein sequences: OAAC (overall amino acid composition) features, dipeptide compositions, PSSM (position-specific scoring matrix profiles) and split amino acid composition (SAA), and then we adopt SVM (support vector machine) and RF (random forest) classification model to distinguish SSBs from DSBs.Our results suggest that some sequence features can significantly differentiate DSBs and SSBs. Evaluated by 10 fold cross-validation on the benchmark datasets, our prediction method can achieve the accuracy of 88.7% and AUC (area under the curve) of 0.919. Moreover, our method has good performance in independent testing.Using various sequence-derived features, a novel method is proposed to distinguish DSBs and SSBs accurately. The method also explores novel features, which could be helpful to discover the binding specificity of DNA-binding proteins." @default.
- W2625898396 created "2017-06-23" @default.
- W2625898396 creator A5009032111 @default.
- W2625898396 creator A5021696313 @default.
- W2625898396 creator A5054667546 @default.
- W2625898396 creator A5058970337 @default.
- W2625898396 creator A5068925622 @default.
- W2625898396 creator A5072476383 @default.
- W2625898396 creator A5082201226 @default.
- W2625898396 date "2017-06-12" @default.
- W2625898396 modified "2023-10-11" @default.
- W2625898396 title "Analysis and prediction of single-stranded and double-stranded DNA binding proteins based on protein sequences" @default.
- W2625898396 cites W1491553314 @default.
- W2625898396 cites W1543773936 @default.
- W2625898396 cites W1559579542 @default.
- W2625898396 cites W1581453124 @default.
- W2625898396 cites W17408266 @default.
- W2625898396 cites W1818938668 @default.
- W2625898396 cites W1966427711 @default.
- W2625898396 cites W1974225312 @default.
- W2625898396 cites W1980121140 @default.
- W2625898396 cites W1985582731 @default.
- W2625898396 cites W1986088879 @default.
- W2625898396 cites W1987667986 @default.
- W2625898396 cites W1994977138 @default.
- W2625898396 cites W1998518361 @default.
- W2625898396 cites W2012627860 @default.
- W2625898396 cites W2016051619 @default.
- W2625898396 cites W2016789453 @default.
- W2625898396 cites W2022630461 @default.
- W2625898396 cites W2024572834 @default.
- W2625898396 cites W2026690931 @default.
- W2625898396 cites W2042459925 @default.
- W2625898396 cites W2048851100 @default.
- W2625898396 cites W2051808158 @default.
- W2625898396 cites W2052490288 @default.
- W2625898396 cites W2058918675 @default.
- W2625898396 cites W2061892659 @default.
- W2625898396 cites W2071286719 @default.
- W2625898396 cites W2079230093 @default.
- W2625898396 cites W2079741453 @default.
- W2625898396 cites W2081998091 @default.
- W2625898396 cites W2083812665 @default.
- W2625898396 cites W2084985871 @default.
- W2625898396 cites W2087518504 @default.
- W2625898396 cites W2090557416 @default.
- W2625898396 cites W2090561245 @default.
- W2625898396 cites W2097606916 @default.
- W2625898396 cites W2108733892 @default.
- W2625898396 cites W2112585401 @default.
- W2625898396 cites W2114669670 @default.
- W2625898396 cites W2117519369 @default.
- W2625898396 cites W2118933796 @default.
- W2625898396 cites W2129410760 @default.
- W2625898396 cites W2134923281 @default.
- W2625898396 cites W2135217050 @default.
- W2625898396 cites W2136637035 @default.
- W2625898396 cites W2136975656 @default.
- W2625898396 cites W2150859192 @default.
- W2625898396 cites W2155770902 @default.
- W2625898396 cites W2158714788 @default.
- W2625898396 cites W2170560345 @default.
- W2625898396 cites W2234451305 @default.
- W2625898396 cites W2317960638 @default.
- W2625898396 cites W2339812959 @default.
- W2625898396 cites W2508414719 @default.
- W2625898396 cites W2570516417 @default.
- W2625898396 doi "https://doi.org/10.1186/s12859-017-1715-8" @default.
- W2625898396 hasPubMedCentralId "https://www.ncbi.nlm.nih.gov/pmc/articles/5469069" @default.
- W2625898396 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/28606086" @default.
- W2625898396 hasPublicationYear "2017" @default.
- W2625898396 type Work @default.
- W2625898396 sameAs 2625898396 @default.
- W2625898396 citedByCount "11" @default.
- W2625898396 countsByYear W26258983962017 @default.
- W2625898396 countsByYear W26258983962018 @default.
- W2625898396 countsByYear W26258983962019 @default.
- W2625898396 countsByYear W26258983962020 @default.
- W2625898396 countsByYear W26258983962021 @default.
- W2625898396 countsByYear W26258983962022 @default.
- W2625898396 countsByYear W26258983962023 @default.
- W2625898396 crossrefType "journal-article" @default.
- W2625898396 hasAuthorship W2625898396A5009032111 @default.
- W2625898396 hasAuthorship W2625898396A5021696313 @default.
- W2625898396 hasAuthorship W2625898396A5054667546 @default.
- W2625898396 hasAuthorship W2625898396A5058970337 @default.
- W2625898396 hasAuthorship W2625898396A5068925622 @default.
- W2625898396 hasAuthorship W2625898396A5072476383 @default.
- W2625898396 hasAuthorship W2625898396A5082201226 @default.
- W2625898396 hasBestOaLocation W26258983961 @default.
- W2625898396 hasConcept C101762097 @default.
- W2625898396 hasConcept C104317684 @default.
- W2625898396 hasConcept C119857082 @default.
- W2625898396 hasConcept C12267149 @default.
- W2625898396 hasConcept C129312508 @default.
- W2625898396 hasConcept C150194340 @default.
- W2625898396 hasConcept C3662595 @default.
- W2625898396 hasConcept C41008148 @default.
- W2625898396 hasConcept C51679486 @default.
- W2625898396 hasConcept C5179208 @default.