Matches in SemOpenAlex for { <https://semopenalex.org/work/W4387637108> ?p ?o ?g. }
- W4387637108 abstract "Protein remote homology detection is essential for structure prediction, function prediction, disease mechanism understanding, etc. The remote homology relationship depends on multiple protein properties, such as structural information and local sequence patterns. Previous studies have shown the challenges for predicting remote homology relationship by protein features at sequence level (e.g. position-specific score matrix). Protein motifs have been used in structure and function analysis due to their unique sequence patterns and implied structural information. Therefore, designing a usable architecture to fuse multiple protein properties based on motifs is urgently needed to improve protein remote homology detection performance. To make full use of the characteristics of motifs, we employed the language model called the protein cubic language model (PCLM). It combines multiple properties by constructing a motif-based neural network. Based on the PCLM, we proposed a predictor called PreHom-PCLM by extracting and fusing multiple motif features for protein remote homology detection. PreHom-PCLM outperforms the other state-of-the-art methods on the test set and independent test set. Experimental results further prove the effectiveness of multiple features fused by PreHom-PCLM for remote homology detection. Furthermore, the protein features derived from the PreHom-PCLM show strong discriminative power for proteins from different structural classes in the high-dimensional space. Availability and Implementation: http://bliulab.net/PreHom-PCLM." @default.
- W4387637108 created "2023-10-15" @default.
- W4387637108 creator A5039611523 @default.
- W4387637108 creator A5042037784 @default.
- W4387637108 creator A5051460401 @default.
- W4387637108 creator A5091016333 @default.
- W4387637108 date "2023-09-22" @default.
- W4387637108 modified "2023-10-15" @default.
- W4387637108 title "PreHom-PCLM: protein remote homology detection by combing motifs and protein cubic language model" @default.
- W4387637108 cites W1990534247 @default.
- W4387637108 cites W2015180542 @default.
- W4387637108 cites W2051210555 @default.
- W4387637108 cites W2062296203 @default.
- W4387637108 cites W2084787613 @default.
- W4387637108 cites W2085277871 @default.
- W4387637108 cites W2118543988 @default.
- W4387637108 cites W2127322768 @default.
- W4387637108 cites W2135621733 @default.
- W4387637108 cites W2138122982 @default.
- W4387637108 cites W2142678478 @default.
- W4387637108 cites W2147660582 @default.
- W4387637108 cites W2156125289 @default.
- W4387637108 cites W2158714788 @default.
- W4387637108 cites W2169150541 @default.
- W4387637108 cites W2726870159 @default.
- W4387637108 cites W2747370968 @default.
- W4387637108 cites W2761495217 @default.
- W4387637108 cites W2902353954 @default.
- W4387637108 cites W2914272550 @default.
- W4387637108 cites W2950954328 @default.
- W4387637108 cites W2951149542 @default.
- W4387637108 cites W2953008890 @default.
- W4387637108 cites W2963579612 @default.
- W4387637108 cites W2964110616 @default.
- W4387637108 cites W2972411752 @default.
- W4387637108 cites W2989977616 @default.
- W4387637108 cites W2997234557 @default.
- W4387637108 cites W2999481648 @default.
- W4387637108 cites W3027689783 @default.
- W4387637108 cites W3084310327 @default.
- W4387637108 cites W3104285666 @default.
- W4387637108 cites W3164046276 @default.
- W4387637108 cites W3166142427 @default.
- W4387637108 cites W3174724403 @default.
- W4387637108 cites W3177500196 @default.
- W4387637108 cites W3177828909 @default.
- W4387637108 cites W3186179742 @default.
- W4387637108 cites W3203741962 @default.
- W4387637108 cites W3209492740 @default.
- W4387637108 cites W3212854871 @default.
- W4387637108 cites W4200166788 @default.
- W4387637108 cites W4220991280 @default.
- W4387637108 cites W4225665657 @default.
- W4387637108 cites W4242765109 @default.
- W4387637108 cites W4281790889 @default.
- W4387637108 cites W4307698996 @default.
- W4387637108 cites W4308370073 @default.
- W4387637108 cites W4313430582 @default.
- W4387637108 doi "https://doi.org/10.1093/bib/bbad347" @default.
- W4387637108 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/37833837" @default.
- W4387637108 hasPublicationYear "2023" @default.
- W4387637108 type Work @default.
- W4387637108 citedByCount "0" @default.
- W4387637108 crossrefType "journal-article" @default.
- W4387637108 hasAuthorship W4387637108A5039611523 @default.
- W4387637108 hasAuthorship W4387637108A5042037784 @default.
- W4387637108 hasAuthorship W4387637108A5051460401 @default.
- W4387637108 hasAuthorship W4387637108A5091016333 @default.
- W4387637108 hasConcept C104317684 @default.
- W4387637108 hasConcept C11413529 @default.
- W4387637108 hasConcept C117745874 @default.
- W4387637108 hasConcept C132677234 @default.
- W4387637108 hasConcept C153180895 @default.
- W4387637108 hasConcept C154945302 @default.
- W4387637108 hasConcept C165525559 @default.
- W4387637108 hasConcept C169627665 @default.
- W4387637108 hasConcept C181199279 @default.
- W4387637108 hasConcept C2874115 @default.
- W4387637108 hasConcept C41008148 @default.
- W4387637108 hasConcept C47701112 @default.
- W4387637108 hasConcept C54355233 @default.
- W4387637108 hasConcept C55493867 @default.
- W4387637108 hasConcept C70721500 @default.
- W4387637108 hasConcept C86803240 @default.
- W4387637108 hasConcept C97931131 @default.
- W4387637108 hasConceptScore W4387637108C104317684 @default.
- W4387637108 hasConceptScore W4387637108C11413529 @default.
- W4387637108 hasConceptScore W4387637108C117745874 @default.
- W4387637108 hasConceptScore W4387637108C132677234 @default.
- W4387637108 hasConceptScore W4387637108C153180895 @default.
- W4387637108 hasConceptScore W4387637108C154945302 @default.
- W4387637108 hasConceptScore W4387637108C165525559 @default.
- W4387637108 hasConceptScore W4387637108C169627665 @default.
- W4387637108 hasConceptScore W4387637108C181199279 @default.
- W4387637108 hasConceptScore W4387637108C2874115 @default.
- W4387637108 hasConceptScore W4387637108C41008148 @default.
- W4387637108 hasConceptScore W4387637108C47701112 @default.
- W4387637108 hasConceptScore W4387637108C54355233 @default.
- W4387637108 hasConceptScore W4387637108C55493867 @default.
- W4387637108 hasConceptScore W4387637108C70721500 @default.