Matches in SemOpenAlex for { <https://semopenalex.org/work/W2896720233> ?p ?o ?g. }
- W2896720233 endingPage "66556" @default.
- W2896720233 startingPage "66545" @default.
- W2896720233 abstract "DNA-binding proteins play critical roles in various cellular biological processes, such as gene expression and transcription. However, the experimental methods to identify these proteins like ChIP-sequencing are expensive and time-consuming, which presents the need for in silico methods, especially machine learning-based methods. In recent years, the accuracy of machine learning-based DNA-binding protein prediction has been increasing significantly. However, there are still some critical problems to be solved like how to convert protein sequences into an appropriate discrete model or vector. In this paper, we propose a novel feature construction method based on a position-specific scoring matrix (PSSM) named K-PSSM-Composition. The proposed features can efficiently capture the information about 20 amino acid residues and the local information of a given sequence during the evolutionary process. We perform a recursive feature elimination to extract the optimal set of features, which are used to train the support vector machine model for predicting DNA-binding proteins. We evaluate and compare our proposed predictor with other advanced predictors via two standard benchmark data sets. The proposed method achieves the accuracy values of 89.77% and 88.71% for the jackknife test and independent test respectively, outperforming the compared methods. This finding demonstrates the efficacy and effectiveness of the proposed method in predicting the DNA-binding proteins. The source code and data are available at https://github.com/Excelsior511/DNA-Binding-Proteins ." @default.
- W2896720233 created "2018-10-26" @default.
- W2896720233 creator A5007339758 @default.
- W2896720233 creator A5009921747 @default.
- W2896720233 creator A5028692805 @default.
- W2896720233 creator A5031510017 @default.
- W2896720233 creator A5044283271 @default.
- W2896720233 creator A5055132508 @default.
- W2896720233 date "2018-01-01" @default.
- W2896720233 modified "2023-10-16" @default.
- W2896720233 title "Improved DNA-Binding Protein Identification by Incorporating Evolutionary Information Into the Chou’s PseAAC" @default.
- W2896720233 cites W1494484168 @default.
- W2896720233 cites W1892469892 @default.
- W2896720233 cites W1974480392 @default.
- W2896720233 cites W1981091069 @default.
- W2896720233 cites W1991141438 @default.
- W2896720233 cites W1992116297 @default.
- W2896720233 cites W2010688088 @default.
- W2896720233 cites W2016579482 @default.
- W2896720233 cites W2020089616 @default.
- W2896720233 cites W2027364181 @default.
- W2896720233 cites W2028549892 @default.
- W2896720233 cites W2030922238 @default.
- W2896720233 cites W2034070267 @default.
- W2896720233 cites W2038873127 @default.
- W2896720233 cites W2061293154 @default.
- W2896720233 cites W2061680337 @default.
- W2896720233 cites W2061833373 @default.
- W2896720233 cites W2062296203 @default.
- W2896720233 cites W2070980386 @default.
- W2896720233 cites W2079741453 @default.
- W2896720233 cites W2090561245 @default.
- W2896720233 cites W2094764962 @default.
- W2896720233 cites W2096223584 @default.
- W2896720233 cites W2102551551 @default.
- W2896720233 cites W2104707079 @default.
- W2896720233 cites W2106141559 @default.
- W2896720233 cites W2114024619 @default.
- W2896720233 cites W2114042714 @default.
- W2896720233 cites W2114535505 @default.
- W2896720233 cites W2124266758 @default.
- W2896720233 cites W2132292391 @default.
- W2896720233 cites W2136637035 @default.
- W2896720233 cites W2137565410 @default.
- W2896720233 cites W2138045300 @default.
- W2896720233 cites W2138769522 @default.
- W2896720233 cites W2143426320 @default.
- W2896720233 cites W2144347309 @default.
- W2896720233 cites W2145957695 @default.
- W2896720233 cites W2155601193 @default.
- W2896720233 cites W2156332695 @default.
- W2896720233 cites W2156690214 @default.
- W2896720233 cites W2158714788 @default.
- W2896720233 cites W2161621183 @default.
- W2896720233 cites W2167666169 @default.
- W2896720233 cites W2173801226 @default.
- W2896720233 cites W2196507012 @default.
- W2896720233 cites W2248769182 @default.
- W2896720233 cites W2278741011 @default.
- W2896720233 cites W2313411748 @default.
- W2896720233 cites W2322691988 @default.
- W2896720233 cites W2337731955 @default.
- W2896720233 cites W2340970647 @default.
- W2896720233 cites W2415834705 @default.
- W2896720233 cites W2470414691 @default.
- W2896720233 cites W2472513547 @default.
- W2896720233 cites W2514430732 @default.
- W2896720233 cites W2516173072 @default.
- W2896720233 cites W2520682509 @default.
- W2896720233 cites W2530181556 @default.
- W2896720233 cites W2557383173 @default.
- W2896720233 cites W2559209493 @default.
- W2896720233 cites W2599457435 @default.
- W2896720233 cites W2605763657 @default.
- W2896720233 cites W2607357445 @default.
- W2896720233 cites W2608035254 @default.
- W2896720233 cites W2609394459 @default.
- W2896720233 cites W2735158968 @default.
- W2896720233 cites W2735428840 @default.
- W2896720233 cites W2736742854 @default.
- W2896720233 cites W2748005921 @default.
- W2896720233 cites W2749697459 @default.
- W2896720233 cites W2754289562 @default.
- W2896720233 cites W2757522837 @default.
- W2896720233 cites W2759893831 @default.
- W2896720233 cites W2762890495 @default.
- W2896720233 cites W2766430481 @default.
- W2896720233 cites W2767196078 @default.
- W2896720233 cites W2769306988 @default.
- W2896720233 cites W2769456202 @default.
- W2896720233 cites W2770445408 @default.
- W2896720233 cites W2784083480 @default.
- W2896720233 cites W2794797435 @default.
- W2896720233 cites W2804489036 @default.
- W2896720233 cites W2804549231 @default.
- W2896720233 cites W2805791355 @default.
- W2896720233 cites W2807186140 @default.
- W2896720233 cites W2808950870 @default.