Matches in SemOpenAlex for { <https://semopenalex.org/work/W3048991675> ?p ?o ?g. }
- W3048991675 endingPage "113903" @default.
- W3048991675 startingPage "113903" @default.
- W3048991675 abstract "Lysine crotonylation is an important protein post-translational modification, which plays an important role in the process of chromosome organization and nucleic acid metabolism. Recognition of crotonylation sites is important to understand the function and mechanism of proteins. Traditional experimental methods are time-consuming and expensive, and can't predict crotonylation sites quickly and accurately. Therefore, this paper proposes a novel crotonylation sites prediction method called LightGBM-CroSite. First, binary encoding (BE), position weight amino acid composition (PWAA), encoding based on grouped weight (EBGW), k nearest neighbors (KNN), pseudo-position specific scoring matrix (PsePSSM) are used to extract features of protein sequences and obtain the original feature space. Second, the elastic net is used to remove redundant information and select the optimal feature subset. Third, the synthetic minority oversampling technique (SMOTE) is used to balance the samples. Finally, the balanced feature vectors are input into LightGBM to predict the crotonylation sites. According to the result of jackknife test, the Accuracy (ACC), Matthew's correlation coefficient (MCC) and area under ROC curve (AUC) are 98.99%, 0.9798 and 0.9996, respectively. Compared with other state-of-the-art methods, the results show that our method has a better model performance on the crotonylation sites prediction. The source code and all datasets are available at https://github.com/QUST-AIBBDRC/LightGBM-CroSite/." @default.
- W3048991675 created "2020-08-21" @default.
- W3048991675 creator A5003331350 @default.
- W3048991675 creator A5023705567 @default.
- W3048991675 creator A5031849574 @default.
- W3048991675 creator A5038117190 @default.
- W3048991675 creator A5075590567 @default.
- W3048991675 date "2020-11-01" @default.
- W3048991675 modified "2023-10-18" @default.
- W3048991675 title "Prediction of protein crotonylation sites through LightGBM classifier based on SMOTE and elastic net" @default.
- W3048991675 cites W1538162617 @default.
- W3048991675 cites W1817561967 @default.
- W3048991675 cites W1982136724 @default.
- W3048991675 cites W1987570579 @default.
- W3048991675 cites W1988790447 @default.
- W3048991675 cites W2000371303 @default.
- W3048991675 cites W2015780677 @default.
- W3048991675 cites W2053186076 @default.
- W3048991675 cites W2056132907 @default.
- W3048991675 cites W2067752346 @default.
- W3048991675 cites W2080922998 @default.
- W3048991675 cites W2103404876 @default.
- W3048991675 cites W2119387367 @default.
- W3048991675 cites W2122825543 @default.
- W3048991675 cites W2128538135 @default.
- W3048991675 cites W2141818629 @default.
- W3048991675 cites W2148143831 @default.
- W3048991675 cites W2153187042 @default.
- W3048991675 cites W2158714788 @default.
- W3048991675 cites W2169937061 @default.
- W3048991675 cites W2209329607 @default.
- W3048991675 cites W2278741011 @default.
- W3048991675 cites W2297784337 @default.
- W3048991675 cites W2395084017 @default.
- W3048991675 cites W2472513547 @default.
- W3048991675 cites W2590067952 @default.
- W3048991675 cites W2593128459 @default.
- W3048991675 cites W2614370829 @default.
- W3048991675 cites W2736251917 @default.
- W3048991675 cites W2748860630 @default.
- W3048991675 cites W2768705872 @default.
- W3048991675 cites W2797330243 @default.
- W3048991675 cites W2803011470 @default.
- W3048991675 cites W2807846544 @default.
- W3048991675 cites W2810225085 @default.
- W3048991675 cites W2810614960 @default.
- W3048991675 cites W2896899595 @default.
- W3048991675 cites W2900868822 @default.
- W3048991675 cites W2904742480 @default.
- W3048991675 cites W2905744876 @default.
- W3048991675 cites W2907063321 @default.
- W3048991675 cites W2911378181 @default.
- W3048991675 cites W2950389803 @default.
- W3048991675 cites W2980057552 @default.
- W3048991675 cites W3010877467 @default.
- W3048991675 cites W3012353262 @default.
- W3048991675 doi "https://doi.org/10.1016/j.ab.2020.113903" @default.
- W3048991675 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/32805274" @default.
- W3048991675 hasPublicationYear "2020" @default.
- W3048991675 type Work @default.
- W3048991675 sameAs 3048991675 @default.
- W3048991675 citedByCount "42" @default.
- W3048991675 countsByYear W30489916752020 @default.
- W3048991675 countsByYear W30489916752021 @default.
- W3048991675 countsByYear W30489916752022 @default.
- W3048991675 countsByYear W30489916752023 @default.
- W3048991675 crossrefType "journal-article" @default.
- W3048991675 hasAuthorship W3048991675A5003331350 @default.
- W3048991675 hasAuthorship W3048991675A5023705567 @default.
- W3048991675 hasAuthorship W3048991675A5031849574 @default.
- W3048991675 hasAuthorship W3048991675A5038117190 @default.
- W3048991675 hasAuthorship W3048991675A5075590567 @default.
- W3048991675 hasConcept C119857082 @default.
- W3048991675 hasConcept C14166107 @default.
- W3048991675 hasConcept C148483581 @default.
- W3048991675 hasConcept C153180895 @default.
- W3048991675 hasConcept C154945302 @default.
- W3048991675 hasConcept C203868755 @default.
- W3048991675 hasConcept C2524010 @default.
- W3048991675 hasConcept C33923547 @default.
- W3048991675 hasConcept C41008148 @default.
- W3048991675 hasConcept C95623464 @default.
- W3048991675 hasConceptScore W3048991675C119857082 @default.
- W3048991675 hasConceptScore W3048991675C14166107 @default.
- W3048991675 hasConceptScore W3048991675C148483581 @default.
- W3048991675 hasConceptScore W3048991675C153180895 @default.
- W3048991675 hasConceptScore W3048991675C154945302 @default.
- W3048991675 hasConceptScore W3048991675C203868755 @default.
- W3048991675 hasConceptScore W3048991675C2524010 @default.
- W3048991675 hasConceptScore W3048991675C33923547 @default.
- W3048991675 hasConceptScore W3048991675C41008148 @default.
- W3048991675 hasConceptScore W3048991675C95623464 @default.
- W3048991675 hasFunder F4320321001 @default.
- W3048991675 hasFunder F4320324174 @default.
- W3048991675 hasFunder F4320325655 @default.
- W3048991675 hasFunder F4320333596 @default.
- W3048991675 hasLocation W30489916751 @default.
- W3048991675 hasOpenAccess W3048991675 @default.