Matches in SemOpenAlex for { <https://semopenalex.org/work/W3141691617> ?p ?o ?g. }
- W3141691617 abstract "Abstract Background Protein post-translational modification (PTM) is a key issue to investigate the mechanism of protein’s function. With the rapid development of proteomics technology, a large amount of protein sequence data has been generated, which highlights the importance of the in-depth study and analysis of PTMs in proteins. Method We proposed a new multi-classification machine learning pipeline MultiLyGAN to identity seven types of lysine modified sites. Using eight different sequential and five structural construction methods, 1497 valid features were remained after the filtering by Pearson correlation coefficient. To solve the data imbalance problem, Conditional Generative Adversarial Network (CGAN) and Conditional Wasserstein Generative Adversarial Network (CWGAN), two influential deep generative methods were leveraged and compared to generate new samples for the types with fewer samples. Finally, random forest algorithm was utilized to predict seven categories. Results In the tenfold cross-validation, accuracy (Acc) and Matthews correlation coefficient (MCC) were 0.8589 and 0.8376, respectively. In the independent test, Acc and MCC were 0.8549 and 0.8330, respectively. The results indicated that CWGAN better solved the existing data imbalance and stabilized the training error. Alternatively, an accumulated feature importance analysis reported that CKSAAP, PWM and structural features were the three most important feature-encoding schemes. MultiLyGAN can be found at https://github.com/Lab-Xu/MultiLyGAN . Conclusions The CWGAN greatly improved the predictive performance in all experiments. Features derived from CKSAAP, PWM and structure schemes are the most informative and had the greatest contribution to the prediction of PTM." @default.
- W3141691617 created "2021-04-13" @default.
- W3141691617 creator A5001969147 @default.
- W3141691617 creator A5002586536 @default.
- W3141691617 creator A5014559891 @default.
- W3141691617 creator A5036384831 @default.
- W3141691617 creator A5057292712 @default.
- W3141691617 creator A5070676893 @default.
- W3141691617 creator A5090366405 @default.
- W3141691617 date "2021-03-31" @default.
- W3141691617 modified "2023-10-14" @default.
- W3141691617 title "Prediction and analysis of multiple protein lysine modified sites based on conditional wasserstein generative adversarial networks" @default.
- W3141691617 cites W1494484168 @default.
- W3141691617 cites W1973253766 @default.
- W3141691617 cites W1991199546 @default.
- W3141691617 cites W1999118478 @default.
- W3141691617 cites W2015376870 @default.
- W3141691617 cites W2022446852 @default.
- W3141691617 cites W2023540360 @default.
- W3141691617 cites W2046534253 @default.
- W3141691617 cites W2052929547 @default.
- W3141691617 cites W2064016106 @default.
- W3141691617 cites W2086831875 @default.
- W3141691617 cites W2087158205 @default.
- W3141691617 cites W2089980937 @default.
- W3141691617 cites W2095900655 @default.
- W3141691617 cites W2102621836 @default.
- W3141691617 cites W2106413110 @default.
- W3141691617 cites W2116062594 @default.
- W3141691617 cites W2121417968 @default.
- W3141691617 cites W2128538135 @default.
- W3141691617 cites W2130277691 @default.
- W3141691617 cites W2132292391 @default.
- W3141691617 cites W2142479026 @default.
- W3141691617 cites W2144864517 @default.
- W3141691617 cites W2145786566 @default.
- W3141691617 cites W2145957695 @default.
- W3141691617 cites W2152339342 @default.
- W3141691617 cites W2153456067 @default.
- W3141691617 cites W2155853203 @default.
- W3141691617 cites W2157873167 @default.
- W3141691617 cites W2158797467 @default.
- W3141691617 cites W2169543816 @default.
- W3141691617 cites W2207892701 @default.
- W3141691617 cites W2209329607 @default.
- W3141691617 cites W2278741011 @default.
- W3141691617 cites W2344452825 @default.
- W3141691617 cites W2397556949 @default.
- W3141691617 cites W2399634880 @default.
- W3141691617 cites W2467352723 @default.
- W3141691617 cites W2518156938 @default.
- W3141691617 cites W2555891876 @default.
- W3141691617 cites W2558091742 @default.
- W3141691617 cites W2764095180 @default.
- W3141691617 cites W2765093153 @default.
- W3141691617 cites W2775998237 @default.
- W3141691617 cites W2790664092 @default.
- W3141691617 cites W2791621240 @default.
- W3141691617 cites W2801931817 @default.
- W3141691617 cites W2805952346 @default.
- W3141691617 cites W2810675213 @default.
- W3141691617 cites W2898320067 @default.
- W3141691617 cites W2919709896 @default.
- W3141691617 cites W2920338801 @default.
- W3141691617 cites W2991113051 @default.
- W3141691617 cites W3020440546 @default.
- W3141691617 doi "https://doi.org/10.1186/s12859-021-04101-y" @default.
- W3141691617 hasPubMedCentralId "https://www.ncbi.nlm.nih.gov/pmc/articles/8010967" @default.
- W3141691617 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/33789579" @default.
- W3141691617 hasPublicationYear "2021" @default.
- W3141691617 type Work @default.
- W3141691617 sameAs 3141691617 @default.
- W3141691617 citedByCount "9" @default.
- W3141691617 countsByYear W31416916172021 @default.
- W3141691617 countsByYear W31416916172022 @default.
- W3141691617 crossrefType "journal-article" @default.
- W3141691617 hasAuthorship W3141691617A5001969147 @default.
- W3141691617 hasAuthorship W3141691617A5002586536 @default.
- W3141691617 hasAuthorship W3141691617A5014559891 @default.
- W3141691617 hasAuthorship W3141691617A5036384831 @default.
- W3141691617 hasAuthorship W3141691617A5057292712 @default.
- W3141691617 hasAuthorship W3141691617A5070676893 @default.
- W3141691617 hasAuthorship W3141691617A5090366405 @default.
- W3141691617 hasBestOaLocation W31416916171 @default.
- W3141691617 hasConcept C105795698 @default.
- W3141691617 hasConcept C108583219 @default.
- W3141691617 hasConcept C11413529 @default.
- W3141691617 hasConcept C119857082 @default.
- W3141691617 hasConcept C12267149 @default.
- W3141691617 hasConcept C124101348 @default.
- W3141691617 hasConcept C125411270 @default.
- W3141691617 hasConcept C138885662 @default.
- W3141691617 hasConcept C153180895 @default.
- W3141691617 hasConcept C154945302 @default.
- W3141691617 hasConcept C164085508 @default.
- W3141691617 hasConcept C199360897 @default.
- W3141691617 hasConcept C2776401178 @default.
- W3141691617 hasConcept C2988773926 @default.
- W3141691617 hasConcept C33923547 @default.
- W3141691617 hasConcept C39890363 @default.