Matches in SemOpenAlex for { <https://semopenalex.org/work/W2023994620> ?p ?o ?g. }
- W2023994620 endingPage "310" @default.
- W2023994620 startingPage "299" @default.
- W2023994620 abstract "Abstract Identification and Classification of G-protein coupled receptors (GPCRs) using protein sequences is an important computational challenge, given that experimental screening of thousands of ligands is an expensive proposition. There are two distinct but complementary approaches to GPCR classification—machine learning and sequence motif analysis. Machine learning methodologies typically suffer from problems of class imbalance and lack of multi-class classification. Many sequence motif methods, meanwhile, are too dependent on the similarity of the primary sequence alignments. It is desirable to have a motif discovery and application methodology that is not strongly dependent on primary sequence similarity. It should also overcome limitations of machine learning. We propose and evaluate the effectiveness of a simple methodology that uses a reduced protein functional alphabet representation, where similar functional residues have similar symbols. Regular expression motifs can then be obtained by ClustalW based multiple sequence alignment, using an identity matrix. Since evolutionary matrices like BLOSUM, PAM are not used, this method can be useful for any set of sequences that do not necessarily share a common ancestry. Reduced alphabet motifs can accurately classify known GPCR proteins and the results are comparable to PRINTS and PROSITE. For well known GPCR proteins from SWIS-SPROT, there were no false negatives and only a few false positives. This methodology covers most currently known classes of GPCRs, even if there are very few representative sequences. It also predicts more than one class for certain sequences, thus overcoming the limitation of machine learning methods. We also annotated, 695 orphan receptors, and 121 were identified as belonging to Family A. A simple JavaScript based web interface has been developed to predict GPCR families and subfamilies (www.insilico-consulting.com/gpcrmotif.html)." @default.
- W2023994620 created "2016-06-24" @default.
- W2023994620 creator A5073320415 @default.
- W2023994620 creator A5089258038 @default.
- W2023994620 date "2007-12-01" @default.
- W2023994620 modified "2023-09-25" @default.
- W2023994620 title "Reduced Alphabet Motif Methodology for GPCR Annotation" @default.
- W2023994620 cites W1603274157 @default.
- W2023994620 cites W1853093403 @default.
- W2023994620 cites W1975670646 @default.
- W2023994620 cites W1987703405 @default.
- W2023994620 cites W1996378214 @default.
- W2023994620 cites W2002566401 @default.
- W2023994620 cites W2018782485 @default.
- W2023994620 cites W2022247199 @default.
- W2023994620 cites W2031198307 @default.
- W2023994620 cites W2035066314 @default.
- W2023994620 cites W2048497859 @default.
- W2023994620 cites W2052376307 @default.
- W2023994620 cites W2055481969 @default.
- W2023994620 cites W2063430651 @default.
- W2023994620 cites W2065161235 @default.
- W2023994620 cites W2074370114 @default.
- W2023994620 cites W2090669407 @default.
- W2023994620 cites W2094384279 @default.
- W2023994620 cites W2097892623 @default.
- W2023994620 cites W2102122585 @default.
- W2023994620 cites W2102151457 @default.
- W2023994620 cites W2106882534 @default.
- W2023994620 cites W2107427637 @default.
- W2023994620 cites W2108401170 @default.
- W2023994620 cites W2109014958 @default.
- W2023994620 cites W2117249420 @default.
- W2023994620 cites W2119313530 @default.
- W2023994620 cites W2123151657 @default.
- W2023994620 cites W2125652418 @default.
- W2023994620 cites W2133540788 @default.
- W2023994620 cites W2143173841 @default.
- W2023994620 cites W2145549598 @default.
- W2023994620 cites W2146423415 @default.
- W2023994620 cites W2150803024 @default.
- W2023994620 cites W2157959080 @default.
- W2023994620 cites W2158714788 @default.
- W2023994620 cites W2160438089 @default.
- W2023994620 cites W2397024508 @default.
- W2023994620 cites W4210400672 @default.
- W2023994620 cites W4210531204 @default.
- W2023994620 cites W4213149192 @default.
- W2023994620 cites W4234228545 @default.
- W2023994620 cites W4251828057 @default.
- W2023994620 doi "https://doi.org/10.1080/07391102.2007.10507178" @default.
- W2023994620 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/17937491" @default.
- W2023994620 hasPublicationYear "2007" @default.
- W2023994620 type Work @default.
- W2023994620 sameAs 2023994620 @default.
- W2023994620 citedByCount "8" @default.
- W2023994620 countsByYear W20239946202012 @default.
- W2023994620 countsByYear W20239946202013 @default.
- W2023994620 countsByYear W20239946202021 @default.
- W2023994620 countsByYear W20239946202022 @default.
- W2023994620 crossrefType "journal-article" @default.
- W2023994620 hasAuthorship W2023994620A5073320415 @default.
- W2023994620 hasAuthorship W2023994620A5089258038 @default.
- W2023994620 hasConcept C10010492 @default.
- W2023994620 hasConcept C104317684 @default.
- W2023994620 hasConcept C112876837 @default.
- W2023994620 hasConcept C117745874 @default.
- W2023994620 hasConcept C119857082 @default.
- W2023994620 hasConcept C121332964 @default.
- W2023994620 hasConcept C132677234 @default.
- W2023994620 hasConcept C135285700 @default.
- W2023994620 hasConcept C138885662 @default.
- W2023994620 hasConcept C154945302 @default.
- W2023994620 hasConcept C167625842 @default.
- W2023994620 hasConcept C170493617 @default.
- W2023994620 hasConcept C24890656 @default.
- W2023994620 hasConcept C32276052 @default.
- W2023994620 hasConcept C41008148 @default.
- W2023994620 hasConcept C41895202 @default.
- W2023994620 hasConcept C45484198 @default.
- W2023994620 hasConcept C54355233 @default.
- W2023994620 hasConcept C552990157 @default.
- W2023994620 hasConcept C55493867 @default.
- W2023994620 hasConcept C60644358 @default.
- W2023994620 hasConcept C64869954 @default.
- W2023994620 hasConcept C70721500 @default.
- W2023994620 hasConcept C86803240 @default.
- W2023994620 hasConceptScore W2023994620C10010492 @default.
- W2023994620 hasConceptScore W2023994620C104317684 @default.
- W2023994620 hasConceptScore W2023994620C112876837 @default.
- W2023994620 hasConceptScore W2023994620C117745874 @default.
- W2023994620 hasConceptScore W2023994620C119857082 @default.
- W2023994620 hasConceptScore W2023994620C121332964 @default.
- W2023994620 hasConceptScore W2023994620C132677234 @default.
- W2023994620 hasConceptScore W2023994620C135285700 @default.
- W2023994620 hasConceptScore W2023994620C138885662 @default.
- W2023994620 hasConceptScore W2023994620C154945302 @default.
- W2023994620 hasConceptScore W2023994620C167625842 @default.