Matches in SemOpenAlex for { <https://semopenalex.org/work/W3020031777> ?p ?o ?g. }
- W3020031777 abstract "Counting frequent k-mers as patterns in a long sequence is a typical data-mining problem and contributes to broad applications in bioinformatics. Existing methods of k-mer counting usually produce all the possible contiguous subsequences of length k that are contained in a given sequence. However, these methods often spawn numerous redundant and possibly meaningless k-mers. This inherently gives rise to excessive runtime and space overhead at sequence analysis. Moreover, most of the k-mer counting algorithms consider only single-length contiguous subsequences, which limits the diversity of data mining and knowledge discovery. Previous studies have demonstrated that closed patterns contain the complete information regarding the corresponding patterns. Inspired by the application of the upperclosure property of sequential patterns, we propose an efficient algorithm, called CloKmer, to find closed k-mers of various lengths that are compact yet lossless representation of k-mers. CloKmer utilizes the inverted index to project the original sequence onto an equivalent set and then mines closed k-mers by exploiting space pruning and upper-closure checking. Our experimental results demonstrate that CloKmer generates average 44% less patters but still achieves a comparable accuracy when classifying transcription factor binding sites (TFBSs), compared to traditional (k-mer-based classification) methods." @default.
- W3020031777 created "2020-05-01" @default.
- W3020031777 creator A5012283696 @default.
- W3020031777 creator A5026321844 @default.
- W3020031777 creator A5031510017 @default.
- W3020031777 creator A5037475370 @default.
- W3020031777 creator A5038053049 @default.
- W3020031777 creator A5044728743 @default.
- W3020031777 date "2020-02-01" @default.
- W3020031777 modified "2023-10-15" @default.
- W3020031777 title "Efficient Mining Closed k-Mers from DNA and Protein Sequences" @default.
- W3020031777 cites W1572096077 @default.
- W3020031777 cites W1572541440 @default.
- W3020031777 cites W1676985236 @default.
- W3020031777 cites W1983595396 @default.
- W3020031777 cites W2000993284 @default.
- W3020031777 cites W2015536647 @default.
- W3020031777 cites W2018434937 @default.
- W3020031777 cites W2034589438 @default.
- W3020031777 cites W2058009774 @default.
- W3020031777 cites W2084014179 @default.
- W3020031777 cites W2096128575 @default.
- W3020031777 cites W2112971308 @default.
- W3020031777 cites W2122182354 @default.
- W3020031777 cites W2125266506 @default.
- W3020031777 cites W2170036644 @default.
- W3020031777 cites W2171003081 @default.
- W3020031777 cites W2197164549 @default.
- W3020031777 cites W2291469010 @default.
- W3020031777 cites W2343816076 @default.
- W3020031777 cites W2344493264 @default.
- W3020031777 cites W2411730464 @default.
- W3020031777 cites W2478908476 @default.
- W3020031777 cites W2617103607 @default.
- W3020031777 cites W2631063318 @default.
- W3020031777 cites W2789843538 @default.
- W3020031777 cites W2891436740 @default.
- W3020031777 cites W2911213273 @default.
- W3020031777 cites W2918973339 @default.
- W3020031777 cites W2950326881 @default.
- W3020031777 cites W2950964375 @default.
- W3020031777 cites W2952342008 @default.
- W3020031777 cites W2952363540 @default.
- W3020031777 cites W2967923991 @default.
- W3020031777 cites W758607154 @default.
- W3020031777 cites W952339689 @default.
- W3020031777 doi "https://doi.org/10.1109/bigcomp48618.2020.00-51" @default.
- W3020031777 hasPublicationYear "2020" @default.
- W3020031777 type Work @default.
- W3020031777 sameAs 3020031777 @default.
- W3020031777 citedByCount "3" @default.
- W3020031777 countsByYear W30200317772021 @default.
- W3020031777 countsByYear W30200317772022 @default.
- W3020031777 crossrefType "proceedings-article" @default.
- W3020031777 hasAuthorship W3020031777A5012283696 @default.
- W3020031777 hasAuthorship W3020031777A5026321844 @default.
- W3020031777 hasAuthorship W3020031777A5031510017 @default.
- W3020031777 hasAuthorship W3020031777A5037475370 @default.
- W3020031777 hasAuthorship W3020031777A5038053049 @default.
- W3020031777 hasAuthorship W3020031777A5044728743 @default.
- W3020031777 hasConcept C108010975 @default.
- W3020031777 hasConcept C111472728 @default.
- W3020031777 hasConcept C111919701 @default.
- W3020031777 hasConcept C11413529 @default.
- W3020031777 hasConcept C124101348 @default.
- W3020031777 hasConcept C138885662 @default.
- W3020031777 hasConcept C177264268 @default.
- W3020031777 hasConcept C189950617 @default.
- W3020031777 hasConcept C199360897 @default.
- W3020031777 hasConcept C2279292 @default.
- W3020031777 hasConcept C2778112365 @default.
- W3020031777 hasConcept C2779960059 @default.
- W3020031777 hasConcept C41008148 @default.
- W3020031777 hasConcept C51679486 @default.
- W3020031777 hasConcept C54355233 @default.
- W3020031777 hasConcept C552990157 @default.
- W3020031777 hasConcept C6557445 @default.
- W3020031777 hasConcept C78548338 @default.
- W3020031777 hasConcept C80444323 @default.
- W3020031777 hasConcept C81081738 @default.
- W3020031777 hasConcept C86803240 @default.
- W3020031777 hasConceptScore W3020031777C108010975 @default.
- W3020031777 hasConceptScore W3020031777C111472728 @default.
- W3020031777 hasConceptScore W3020031777C111919701 @default.
- W3020031777 hasConceptScore W3020031777C11413529 @default.
- W3020031777 hasConceptScore W3020031777C124101348 @default.
- W3020031777 hasConceptScore W3020031777C138885662 @default.
- W3020031777 hasConceptScore W3020031777C177264268 @default.
- W3020031777 hasConceptScore W3020031777C189950617 @default.
- W3020031777 hasConceptScore W3020031777C199360897 @default.
- W3020031777 hasConceptScore W3020031777C2279292 @default.
- W3020031777 hasConceptScore W3020031777C2778112365 @default.
- W3020031777 hasConceptScore W3020031777C2779960059 @default.
- W3020031777 hasConceptScore W3020031777C41008148 @default.
- W3020031777 hasConceptScore W3020031777C51679486 @default.
- W3020031777 hasConceptScore W3020031777C54355233 @default.
- W3020031777 hasConceptScore W3020031777C552990157 @default.
- W3020031777 hasConceptScore W3020031777C6557445 @default.
- W3020031777 hasConceptScore W3020031777C78548338 @default.
- W3020031777 hasConceptScore W3020031777C80444323 @default.