Matches in SemOpenAlex for { <https://semopenalex.org/work/W3178922277> ?p ?o ?g. }
- W3178922277 abstract "Abstract Glycosyltransferases (GTs) play fundamental roles in nearly all cellular processes through the biosynthesis of complex carbohydrates and glycosylation of diverse protein and small molecule substrates. The extensive structural and functional diversification of GTs presents a major challenge in mapping the relationships connecting sequence, structure, fold and function using traditional bioinformatics approaches. Here, we present a convolutional neural network with attention (CNN-attention) based deep learning model that leverages simple secondary structure representations generated from primary sequences to provide GT fold prediction with high accuracy. The model learned distinguishing features free of primary sequence alignment constraints and, unlike other models, is highly interpretable and helped identify common secondary structural features shared by divergent families. The model delineated sequence and structural features characteristic of individual fold types, while classifying them into distinct clusters that group evolutionarily divergent families based on shared secondary structural features. We further extend our model to classify GT families of unknown folds and variants of known folds. By identifying families that are likely to adopt novel folds such as GT91, GT96 and GT97, our studies identify targets for future structural studies and expand the GT fold landscape." @default.
- W3178922277 created "2021-07-19" @default.
- W3178922277 creator A5009581435 @default.
- W3178922277 creator A5033384464 @default.
- W3178922277 creator A5050749778 @default.
- W3178922277 creator A5066141945 @default.
- W3178922277 creator A5071400741 @default.
- W3178922277 creator A5084948037 @default.
- W3178922277 date "2021-07-06" @default.
- W3178922277 modified "2023-09-25" @default.
- W3178922277 title "Mapping the glycosyltransferase fold landscape using deep learning" @default.
- W3178922277 cites W1508885706 @default.
- W3178922277 cites W1832693441 @default.
- W3178922277 cites W1994095141 @default.
- W3178922277 cites W1999147304 @default.
- W3178922277 cites W2008708467 @default.
- W3178922277 cites W2018454041 @default.
- W3178922277 cites W2037993016 @default.
- W3178922277 cites W2038564024 @default.
- W3178922277 cites W2049149162 @default.
- W3178922277 cites W2064675550 @default.
- W3178922277 cites W2112921913 @default.
- W3178922277 cites W2122559203 @default.
- W3178922277 cites W2124351063 @default.
- W3178922277 cites W2127963615 @default.
- W3178922277 cites W2132632000 @default.
- W3178922277 cites W2135465741 @default.
- W3178922277 cites W2150869803 @default.
- W3178922277 cites W2152997840 @default.
- W3178922277 cites W2157387409 @default.
- W3178922277 cites W2172024836 @default.
- W3178922277 cites W2253808368 @default.
- W3178922277 cites W2300549074 @default.
- W3178922277 cites W2404417962 @default.
- W3178922277 cites W2511687775 @default.
- W3178922277 cites W2607268717 @default.
- W3178922277 cites W2761250517 @default.
- W3178922277 cites W2794004073 @default.
- W3178922277 cites W2800781359 @default.
- W3178922277 cites W2901114541 @default.
- W3178922277 cites W2918296775 @default.
- W3178922277 cites W2943203634 @default.
- W3178922277 cites W2950374603 @default.
- W3178922277 cites W2952073529 @default.
- W3178922277 cites W2963614249 @default.
- W3178922277 cites W2968256998 @default.
- W3178922277 cites W2972596570 @default.
- W3178922277 cites W2973218493 @default.
- W3178922277 cites W2976491270 @default.
- W3178922277 cites W2997234557 @default.
- W3178922277 cites W2998499403 @default.
- W3178922277 cites W2999044305 @default.
- W3178922277 cites W3010453736 @default.
- W3178922277 cites W3014163652 @default.
- W3178922277 cites W3039647328 @default.
- W3178922277 cites W3102564565 @default.
- W3178922277 cites W3103145119 @default.
- W3178922277 cites W3119905388 @default.
- W3178922277 cites W3146944767 @default.
- W3178922277 cites W3199799076 @default.
- W3178922277 cites W4206671406 @default.
- W3178922277 cites W4249977334 @default.
- W3178922277 cites W4255659575 @default.
- W3178922277 cites W4300672471 @default.
- W3178922277 doi "https://doi.org/10.1101/2021.07.05.451183" @default.
- W3178922277 hasPublicationYear "2021" @default.
- W3178922277 type Work @default.
- W3178922277 sameAs 3178922277 @default.
- W3178922277 citedByCount "0" @default.
- W3178922277 crossrefType "posted-content" @default.
- W3178922277 hasAuthorship W3178922277A5009581435 @default.
- W3178922277 hasAuthorship W3178922277A5033384464 @default.
- W3178922277 hasAuthorship W3178922277A5050749778 @default.
- W3178922277 hasAuthorship W3178922277A5066141945 @default.
- W3178922277 hasAuthorship W3178922277A5071400741 @default.
- W3178922277 hasAuthorship W3178922277A5084948037 @default.
- W3178922277 hasBestOaLocation W31789222771 @default.
- W3178922277 hasConcept C104317684 @default.
- W3178922277 hasConcept C108583219 @default.
- W3178922277 hasConcept C117745874 @default.
- W3178922277 hasConcept C132677234 @default.
- W3178922277 hasConcept C154945302 @default.
- W3178922277 hasConcept C199360897 @default.
- W3178922277 hasConcept C2778112365 @default.
- W3178922277 hasConcept C41008148 @default.
- W3178922277 hasConcept C53942344 @default.
- W3178922277 hasConcept C54355233 @default.
- W3178922277 hasConcept C55493867 @default.
- W3178922277 hasConcept C62614982 @default.
- W3178922277 hasConcept C70721500 @default.
- W3178922277 hasConcept C78458016 @default.
- W3178922277 hasConcept C81363708 @default.
- W3178922277 hasConcept C86803240 @default.
- W3178922277 hasConceptScore W3178922277C104317684 @default.
- W3178922277 hasConceptScore W3178922277C108583219 @default.
- W3178922277 hasConceptScore W3178922277C117745874 @default.
- W3178922277 hasConceptScore W3178922277C132677234 @default.
- W3178922277 hasConceptScore W3178922277C154945302 @default.
- W3178922277 hasConceptScore W3178922277C199360897 @default.
- W3178922277 hasConceptScore W3178922277C2778112365 @default.