Matches in SemOpenAlex for { <https://semopenalex.org/work/W4229451461> ?p ?o ?g. }
- W4229451461 abstract "Protein language models have emerged as an alternative to multiple sequence alignment for enriching sequence information and improving downstream prediction tasks such as biophysical, structural, and functional properties. Here we show that a method called SPOT-1D-LM combines traditional one-hot encoding with the embeddings from two different language models (ProtTrans and ESM-1b) for the input and yields a leap in accuracy over single-sequence-based techniques in predicting protein 1D secondary and tertiary structural properties, including backbone torsion angles, solvent accessibility and contact numbers for all six test sets (TEST2018, TEST2020, Neff1-2020, CASP12-FM, CASP13-FM and CASP14-FM). More significantly, it has a performance comparable to profile-based methods for those proteins with homologous sequences. For example, the accuracy for three-state secondary structure (SS3) prediction for TEST2018 and TEST2020 proteins are 86.7% and 79.8% by SPOT-1D-LM, compared to 74.3% and 73.4% by the single-sequence-based method SPOT-1D-Single and 86.2% and 80.5% by the profile-based method SPOT-1D, respectively. For proteins without homologous sequences (Neff1-2020) SS3 is 80.41% by SPOT-1D-LM which is 3.8% and 8.3% higher than SPOT-1D-Single and SPOT-1D, respectively. SPOT-1D-LM is expected to be useful for genome-wide analysis given its fast performance. Moreover, high-accuracy prediction of both secondary and tertiary structural properties such as backbone angles and solvent accessibility without sequence alignment suggests that highly accurate prediction of protein structures may be made without homologous sequences, the remaining obstacle in the post AlphaFold2 era." @default.
- W4229451461 created "2022-05-11" @default.
- W4229451461 creator A5013626970 @default.
- W4229451461 creator A5032724151 @default.
- W4229451461 creator A5046089538 @default.
- W4229451461 creator A5048999222 @default.
- W4229451461 creator A5056571102 @default.
- W4229451461 date "2022-05-09" @default.
- W4229451461 modified "2023-10-01" @default.
- W4229451461 title "Reaching alignment-profile-based accuracy in predicting protein secondary and tertiary structural properties without alignment" @default.
- W4229451461 cites W1574447377 @default.
- W4229451461 cites W1901129140 @default.
- W4229451461 cites W1999724806 @default.
- W4229451461 cites W2008708467 @default.
- W4229451461 cites W2076048958 @default.
- W4229451461 cites W2111374837 @default.
- W4229451461 cites W2116160060 @default.
- W4229451461 cites W2131774270 @default.
- W4229451461 cites W2141885858 @default.
- W4229451461 cites W2142529984 @default.
- W4229451461 cites W2166637863 @default.
- W4229451461 cites W2288234278 @default.
- W4229451461 cites W2549976854 @default.
- W4229451461 cites W2557595285 @default.
- W4229451461 cites W2791790018 @default.
- W4229451461 cites W2808950571 @default.
- W4229451461 cites W2898320067 @default.
- W4229451461 cites W2905446269 @default.
- W4229451461 cites W2949342052 @default.
- W4229451461 cites W2949867299 @default.
- W4229451461 cites W2950374603 @default.
- W4229451461 cites W2950954328 @default.
- W4229451461 cites W2952317511 @default.
- W4229451461 cites W2953008890 @default.
- W4229451461 cites W2956081200 @default.
- W4229451461 cites W2963457143 @default.
- W4229451461 cites W2968295962 @default.
- W4229451461 cites W2971227267 @default.
- W4229451461 cites W2995514860 @default.
- W4229451461 cites W3011698511 @default.
- W4229451461 cites W3042567394 @default.
- W4229451461 cites W3093108887 @default.
- W4229451461 cites W3111174583 @default.
- W4229451461 cites W3127535326 @default.
- W4229451461 cites W3158236124 @default.
- W4229451461 cites W3160452636 @default.
- W4229451461 cites W3177500196 @default.
- W4229451461 cites W3177828909 @default.
- W4229451461 cites W3186179742 @default.
- W4229451461 cites W3191761521 @default.
- W4229451461 cites W4205806554 @default.
- W4229451461 cites W4206563428 @default.
- W4229451461 cites W4210400672 @default.
- W4229451461 doi "https://doi.org/10.1038/s41598-022-11684-w" @default.
- W4229451461 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/35534620" @default.
- W4229451461 hasPublicationYear "2022" @default.
- W4229451461 type Work @default.
- W4229451461 citedByCount "8" @default.
- W4229451461 countsByYear W42294514612022 @default.
- W4229451461 countsByYear W42294514612023 @default.
- W4229451461 crossrefType "journal-article" @default.
- W4229451461 hasAuthorship W4229451461A5013626970 @default.
- W4229451461 hasAuthorship W4229451461A5032724151 @default.
- W4229451461 hasAuthorship W4229451461A5046089538 @default.
- W4229451461 hasAuthorship W4229451461A5048999222 @default.
- W4229451461 hasAuthorship W4229451461A5056571102 @default.
- W4229451461 hasBestOaLocation W42294514611 @default.
- W4229451461 hasConcept C104317684 @default.
- W4229451461 hasConcept C111919701 @default.
- W4229451461 hasConcept C11413529 @default.
- W4229451461 hasConcept C153180895 @default.
- W4229451461 hasConcept C154945302 @default.
- W4229451461 hasConcept C167625842 @default.
- W4229451461 hasConcept C185592680 @default.
- W4229451461 hasConcept C186060115 @default.
- W4229451461 hasConcept C199672914 @default.
- W4229451461 hasConcept C2778112365 @default.
- W4229451461 hasConcept C41008148 @default.
- W4229451461 hasConcept C45484198 @default.
- W4229451461 hasConcept C4668613 @default.
- W4229451461 hasConcept C54355233 @default.
- W4229451461 hasConcept C55493867 @default.
- W4229451461 hasConcept C62614982 @default.
- W4229451461 hasConcept C70721500 @default.
- W4229451461 hasConcept C86803240 @default.
- W4229451461 hasConcept C88031987 @default.
- W4229451461 hasConceptScore W4229451461C104317684 @default.
- W4229451461 hasConceptScore W4229451461C111919701 @default.
- W4229451461 hasConceptScore W4229451461C11413529 @default.
- W4229451461 hasConceptScore W4229451461C153180895 @default.
- W4229451461 hasConceptScore W4229451461C154945302 @default.
- W4229451461 hasConceptScore W4229451461C167625842 @default.
- W4229451461 hasConceptScore W4229451461C185592680 @default.
- W4229451461 hasConceptScore W4229451461C186060115 @default.
- W4229451461 hasConceptScore W4229451461C199672914 @default.
- W4229451461 hasConceptScore W4229451461C2778112365 @default.
- W4229451461 hasConceptScore W4229451461C41008148 @default.
- W4229451461 hasConceptScore W4229451461C45484198 @default.
- W4229451461 hasConceptScore W4229451461C4668613 @default.
- W4229451461 hasConceptScore W4229451461C54355233 @default.