Matches in SemOpenAlex for { <https://semopenalex.org/work/W4315864628> ?p ?o ?g. }
- W4315864628 endingPage "123180" @default.
- W4315864628 startingPage "123180" @default.
- W4315864628 abstract "N4-methylcytosine (4mC) is an important DNA chemical modification pattern which is a new methylation modification discovered in recent years and plays critical roles in gene expression regulation, defense against invading genetic elements, genomic imprinting, and so on. Identifying 4mC site from DNA sequence segment contributes to discovering more novel modification patterns. In this paper, we present a model called 4mCBERT that encodes DNA sequence segments by sequence characteristics including one-hot, electron-ion interaction pseudopotential, nucleotide chemical property, word2vec and chemical information containing physicochemical properties (PCP), chemical bidirectional encoder representations from transformers (chemical BERT) and employs ensemble learning framework to develop a prediction model. PCP and chemical BERT features are firstly constructed and applied to predict 4mC sites and show positive contributions to identifying 4mC. For the Matthew's Correlation Coefficient, 4mCBERT significantly outperformed other state-of-the-art models on six independent benchmark datasets including A. thaliana, C. elegans, D. melanogaster, E. coli, G. Pickering, and G. subterraneous by 4.32 % to 24.39 %, 2.52 % to 31.65 %, 2 % to 16.49 %, 6.63 % to 35.15, 8.59 % to 61.85 %, and 8.45 % to 34.45 %. Moreover, 4mCBERT is designed to allow users to predict 4mC sites and retrain 4mC prediction models. In brief, 4mCBERT shows higher performance on six benchmark datasets by incorporating sequence- and chemical-driven information and is available at http://cczubio.top/4mCBERT and https://github.com/abcair/4mCBERT." @default.
- W4315864628 created "2023-01-13" @default.
- W4315864628 creator A5016638896 @default.
- W4315864628 creator A5038760444 @default.
- W4315864628 creator A5059423948 @default.
- W4315864628 date "2023-03-01" @default.
- W4315864628 modified "2023-10-17" @default.
- W4315864628 title "4mCBERT: A computing tool for the identification of DNA N4-methylcytosine sites by sequence- and chemical-derived information based on ensemble learning strategies" @default.
- W4315864628 cites W1715444199 @default.
- W4315864628 cites W1997866877 @default.
- W4315864628 cites W2150920145 @default.
- W4315864628 cites W2170747616 @default.
- W4315864628 cites W2463031885 @default.
- W4315864628 cites W2537227711 @default.
- W4315864628 cites W2549518011 @default.
- W4315864628 cites W2631752818 @default.
- W4315864628 cites W2737592062 @default.
- W4315864628 cites W2747948838 @default.
- W4315864628 cites W2785534246 @default.
- W4315864628 cites W2792451020 @default.
- W4315864628 cites W2800742848 @default.
- W4315864628 cites W2805701650 @default.
- W4315864628 cites W2883534252 @default.
- W4315864628 cites W2886223143 @default.
- W4315864628 cites W2890517686 @default.
- W4315864628 cites W2892079508 @default.
- W4315864628 cites W2932679380 @default.
- W4315864628 cites W2944851425 @default.
- W4315864628 cites W2951845617 @default.
- W4315864628 cites W2954633061 @default.
- W4315864628 cites W2963037859 @default.
- W4315864628 cites W2963537251 @default.
- W4315864628 cites W2972632059 @default.
- W4315864628 cites W2981572887 @default.
- W4315864628 cites W3004591935 @default.
- W4315864628 cites W3010162374 @default.
- W4315864628 cites W3011537067 @default.
- W4315864628 cites W3017583060 @default.
- W4315864628 cites W3021941048 @default.
- W4315864628 cites W3037668068 @default.
- W4315864628 cites W3086030569 @default.
- W4315864628 cites W3088538819 @default.
- W4315864628 cites W3094312164 @default.
- W4315864628 cites W3094948551 @default.
- W4315864628 cites W3097145107 @default.
- W4315864628 cites W3110491319 @default.
- W4315864628 cites W3116664634 @default.
- W4315864628 cites W3120063029 @default.
- W4315864628 cites W3120223152 @default.
- W4315864628 cites W3132403896 @default.
- W4315864628 cites W3159427491 @default.
- W4315864628 cites W3160425403 @default.
- W4315864628 cites W3174619174 @default.
- W4315864628 cites W3174714560 @default.
- W4315864628 cites W3190763963 @default.
- W4315864628 cites W3200137755 @default.
- W4315864628 cites W3210257016 @default.
- W4315864628 cites W4207028073 @default.
- W4315864628 cites W4220690199 @default.
- W4315864628 cites W4221105426 @default.
- W4315864628 cites W4224298700 @default.
- W4315864628 cites W4283706978 @default.
- W4315864628 cites W3173302775 @default.
- W4315864628 doi "https://doi.org/10.1016/j.ijbiomac.2023.123180" @default.
- W4315864628 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/36646347" @default.
- W4315864628 hasPublicationYear "2023" @default.
- W4315864628 type Work @default.
- W4315864628 citedByCount "5" @default.
- W4315864628 countsByYear W43158646282023 @default.
- W4315864628 crossrefType "journal-article" @default.
- W4315864628 hasAuthorship W4315864628A5016638896 @default.
- W4315864628 hasAuthorship W4315864628A5038760444 @default.
- W4315864628 hasAuthorship W4315864628A5059423948 @default.
- W4315864628 hasConcept C13280743 @default.
- W4315864628 hasConcept C154945302 @default.
- W4315864628 hasConcept C185798385 @default.
- W4315864628 hasConcept C205649164 @default.
- W4315864628 hasConcept C41008148 @default.
- W4315864628 hasConcept C45942800 @default.
- W4315864628 hasConcept C70721500 @default.
- W4315864628 hasConcept C86803240 @default.
- W4315864628 hasConceptScore W4315864628C13280743 @default.
- W4315864628 hasConceptScore W4315864628C154945302 @default.
- W4315864628 hasConceptScore W4315864628C185798385 @default.
- W4315864628 hasConceptScore W4315864628C205649164 @default.
- W4315864628 hasConceptScore W4315864628C41008148 @default.
- W4315864628 hasConceptScore W4315864628C45942800 @default.
- W4315864628 hasConceptScore W4315864628C70721500 @default.
- W4315864628 hasConceptScore W4315864628C86803240 @default.
- W4315864628 hasLocation W43158646281 @default.
- W4315864628 hasLocation W43158646282 @default.
- W4315864628 hasOpenAccess W4315864628 @default.
- W4315864628 hasPrimaryLocation W43158646281 @default.
- W4315864628 hasRelatedWork W2028665553 @default.
- W4315864628 hasRelatedWork W2086519370 @default.
- W4315864628 hasRelatedWork W2087343574 @default.
- W4315864628 hasRelatedWork W2130974462 @default.
- W4315864628 hasRelatedWork W2378211422 @default.