Matches in SemOpenAlex for { <https://semopenalex.org/work/W2024270124> ?p ?o ?g. }
- W2024270124 endingPage "255" @default.
- W2024270124 startingPage "243" @default.
- W2024270124 abstract "In this paper, an efficient K-medians clustering (unsupervised) algorithm for prototype selection and Supervised K-medians (SKM) classification technique for protein sequences are presented. For sequence data sets, a median string/sequence can be used as the cluster/group representative. In K-medians clustering technique, a desired number of clusters, K, each represented by a median string/sequence, is generated and these median sequences are used as prototypes for classifying the new/test sequence whereas in SKM classification technique, median sequence in each group/class of labelled protein sequences is determined and the set of median sequences is used as prototypes for classification purpose. It is found that the K-medians clustering technique outperforms the leader based technique and also SKM classification technique performs better than that of motifs based approach for the data sets used. We further use a simple technique to reduce time and space requirements during protein sequence clustering and classification. During training and testing phase, the similarity score value between a pair of sequences is determined by selecting a portion of the sequence instead of the entire sequence. It is like selecting a subset of features for sequence data sets. The experimental results of the proposed method on K-medians, SKM and Nearest Neighbour Classifier (NNC) techniques show that the Classification Accuracy (CA) using the prototypes generated/used does not degrade much but the training and testing time are reduced significantly. Thus the experimental results indicate that the similarity score does not need to be calculated by considering the entire length of the sequence for achieving a good CA. Even space requirement is reduced during both training and classification." @default.
- W2024270124 created "2016-06-24" @default.
- W2024270124 creator A5028996052 @default.
- W2024270124 creator A5045828691 @default.
- W2024270124 creator A5064702508 @default.
- W2024270124 date "2006-08-22" @default.
- W2024270124 modified "2023-09-27" @default.
- W2024270124 title "Efficient median based clustering and classification techniques for protein sequences" @default.
- W2024270124 cites W129807151 @default.
- W2024270124 cites W1493454437 @default.
- W2024270124 cites W1505171204 @default.
- W2024270124 cites W1513710090 @default.
- W2024270124 cites W1537571504 @default.
- W2024270124 cites W154269568 @default.
- W2024270124 cites W1544578519 @default.
- W2024270124 cites W1546647560 @default.
- W2024270124 cites W1554135376 @default.
- W2024270124 cites W1566159240 @default.
- W2024270124 cites W1592614241 @default.
- W2024270124 cites W1769049279 @default.
- W2024270124 cites W1971784203 @default.
- W2024270124 cites W1988122146 @default.
- W2024270124 cites W1992419399 @default.
- W2024270124 cites W1992648037 @default.
- W2024270124 cites W1997268601 @default.
- W2024270124 cites W2008841836 @default.
- W2024270124 cites W2037201833 @default.
- W2024270124 cites W2048813300 @default.
- W2024270124 cites W2052598245 @default.
- W2024270124 cites W2056841067 @default.
- W2024270124 cites W2069873681 @default.
- W2024270124 cites W2074231493 @default.
- W2024270124 cites W2076223365 @default.
- W2024270124 cites W2077243496 @default.
- W2024270124 cites W2086403919 @default.
- W2024270124 cites W2086770662 @default.
- W2024270124 cites W2087064593 @default.
- W2024270124 cites W2088333696 @default.
- W2024270124 cites W2096635897 @default.
- W2024270124 cites W2105649185 @default.
- W2024270124 cites W2117077088 @default.
- W2024270124 cites W2122111042 @default.
- W2024270124 cites W2123297508 @default.
- W2024270124 cites W2126626732 @default.
- W2024270124 cites W2129430460 @default.
- W2024270124 cites W2146827656 @default.
- W2024270124 cites W2152596143 @default.
- W2024270124 cites W2167130990 @default.
- W2024270124 cites W2176645417 @default.
- W2024270124 cites W2799061466 @default.
- W2024270124 cites W2913660196 @default.
- W2024270124 cites W2999729612 @default.
- W2024270124 cites W3203824546 @default.
- W2024270124 cites W2890040444 @default.
- W2024270124 doi "https://doi.org/10.1007/s10044-006-0040-z" @default.
- W2024270124 hasPublicationYear "2006" @default.
- W2024270124 type Work @default.
- W2024270124 sameAs 2024270124 @default.
- W2024270124 citedByCount "7" @default.
- W2024270124 countsByYear W20242701242012 @default.
- W2024270124 countsByYear W20242701242017 @default.
- W2024270124 countsByYear W20242701242021 @default.
- W2024270124 crossrefType "journal-article" @default.
- W2024270124 hasAuthorship W2024270124A5028996052 @default.
- W2024270124 hasAuthorship W2024270124A5045828691 @default.
- W2024270124 hasAuthorship W2024270124A5064702508 @default.
- W2024270124 hasBestOaLocation W20242701242 @default.
- W2024270124 hasConcept C103278499 @default.
- W2024270124 hasConcept C115961682 @default.
- W2024270124 hasConcept C124101348 @default.
- W2024270124 hasConcept C153180895 @default.
- W2024270124 hasConcept C154945302 @default.
- W2024270124 hasConcept C157486923 @default.
- W2024270124 hasConcept C177264268 @default.
- W2024270124 hasConcept C199360897 @default.
- W2024270124 hasConcept C2778112365 @default.
- W2024270124 hasConcept C33923547 @default.
- W2024270124 hasConcept C37914503 @default.
- W2024270124 hasConcept C41008148 @default.
- W2024270124 hasConcept C54355233 @default.
- W2024270124 hasConcept C73555534 @default.
- W2024270124 hasConcept C86803240 @default.
- W2024270124 hasConcept C95623464 @default.
- W2024270124 hasConceptScore W2024270124C103278499 @default.
- W2024270124 hasConceptScore W2024270124C115961682 @default.
- W2024270124 hasConceptScore W2024270124C124101348 @default.
- W2024270124 hasConceptScore W2024270124C153180895 @default.
- W2024270124 hasConceptScore W2024270124C154945302 @default.
- W2024270124 hasConceptScore W2024270124C157486923 @default.
- W2024270124 hasConceptScore W2024270124C177264268 @default.
- W2024270124 hasConceptScore W2024270124C199360897 @default.
- W2024270124 hasConceptScore W2024270124C2778112365 @default.
- W2024270124 hasConceptScore W2024270124C33923547 @default.
- W2024270124 hasConceptScore W2024270124C37914503 @default.
- W2024270124 hasConceptScore W2024270124C41008148 @default.
- W2024270124 hasConceptScore W2024270124C54355233 @default.
- W2024270124 hasConceptScore W2024270124C73555534 @default.
- W2024270124 hasConceptScore W2024270124C86803240 @default.