Matches in SemOpenAlex for { <https://semopenalex.org/work/W2057804984> ?p ?o ?g. }
- W2057804984 endingPage "492" @default.
- W2057804984 startingPage "481" @default.
- W2057804984 abstract "Patterns/subsequences frequently appearing in sequences provide essential knowledge for domain experts, such as molecular biologists, to discover rules or patterns hidden behind the data. Due to the inherent complex nature of the biological data, patterns rarely exactly reproduce and repeat themselves, but rather appear with a slightly different form in each of its appearances. A gap constraint (In this paper, a gap constraint (also referred to as a wildcard) is a character that can be substituted for any character predefined in an alphabet.) provides flexibility for users to capture useful patterns even if their appearances vary in the sequences. In order to find patterns, existing tools require users to explicitly specify gap constraints beforehand. In reality, it is often nontrivial or time-consuming for users to provide proper gap constraint values. In addition, a change made to the gap values may give completely different results, and require a separate time-consuming re-mining procedure. Therefore, it is desirable to automatically and efficiently find patterns without involving user-specified gap requirements. In this paper, we study the problem of frequent pattern mining without user-specified gap constraints and propose PMBC (namely P̲atternM̲ining from B̲iological sequences with wildcard C onstraints) to solve the problem. Given a sequence and a support threshold value (i.e. pattern frequency threshold), PMBC intends to discover all subsequences with their support values equal to or greater than the given threshold value. The frequent subsequences then form patterns later on. Two heuristic methods (one-way vs. two-way scans) are proposed to discover frequent subsequences and estimate their frequency in the sequences. Experimental results on both synthetic and real-world DNA sequences demonstrate the performance of both methods for frequent pattern mining and pattern frequency estimation." @default.
- W2057804984 created "2016-06-24" @default.
- W2057804984 creator A5042628946 @default.
- W2057804984 creator A5072963545 @default.
- W2057804984 creator A5080738591 @default.
- W2057804984 creator A5084641325 @default.
- W2057804984 date "2013-06-01" @default.
- W2057804984 modified "2023-09-24" @default.
- W2057804984 title "PMBC: Pattern mining from biological sequences with wildcard constraints" @default.
- W2057804984 cites W1532612911 @default.
- W2057804984 cites W1608194207 @default.
- W2057804984 cites W1973846222 @default.
- W2057804984 cites W1979180881 @default.
- W2057804984 cites W2003957650 @default.
- W2057804984 cites W2006013660 @default.
- W2057804984 cites W2008254093 @default.
- W2057804984 cites W2012924720 @default.
- W2057804984 cites W2022148737 @default.
- W2057804984 cites W2030132106 @default.
- W2057804984 cites W2046399516 @default.
- W2057804984 cites W2052496634 @default.
- W2057804984 cites W2053767428 @default.
- W2057804984 cites W2056829615 @default.
- W2057804984 cites W2072783440 @default.
- W2057804984 cites W2073661767 @default.
- W2057804984 cites W2077065215 @default.
- W2057804984 cites W2077378918 @default.
- W2057804984 cites W2089737197 @default.
- W2057804984 cites W2099085937 @default.
- W2057804984 cites W2108419107 @default.
- W2057804984 cites W2114272547 @default.
- W2057804984 cites W2115990786 @default.
- W2057804984 cites W2119423166 @default.
- W2057804984 cites W2129436231 @default.
- W2057804984 cites W2141408320 @default.
- W2057804984 cites W2143417222 @default.
- W2057804984 cites W2145091349 @default.
- W2057804984 cites W2145635625 @default.
- W2057804984 cites W2151077154 @default.
- W2057804984 cites W2151198139 @default.
- W2057804984 cites W2151831732 @default.
- W2057804984 cites W2156026066 @default.
- W2057804984 cites W2158086655 @default.
- W2057804984 cites W2158454296 @default.
- W2057804984 doi "https://doi.org/10.1016/j.compbiomed.2013.02.006" @default.
- W2057804984 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/23566394" @default.
- W2057804984 hasPublicationYear "2013" @default.
- W2057804984 type Work @default.
- W2057804984 sameAs 2057804984 @default.
- W2057804984 citedByCount "40" @default.
- W2057804984 countsByYear W20578049842013 @default.
- W2057804984 countsByYear W20578049842014 @default.
- W2057804984 countsByYear W20578049842015 @default.
- W2057804984 countsByYear W20578049842016 @default.
- W2057804984 countsByYear W20578049842017 @default.
- W2057804984 countsByYear W20578049842018 @default.
- W2057804984 countsByYear W20578049842019 @default.
- W2057804984 countsByYear W20578049842020 @default.
- W2057804984 countsByYear W20578049842021 @default.
- W2057804984 countsByYear W20578049842022 @default.
- W2057804984 crossrefType "journal-article" @default.
- W2057804984 hasAuthorship W2057804984A5042628946 @default.
- W2057804984 hasAuthorship W2057804984A5072963545 @default.
- W2057804984 hasAuthorship W2057804984A5080738591 @default.
- W2057804984 hasAuthorship W2057804984A5084641325 @default.
- W2057804984 hasConcept C105795698 @default.
- W2057804984 hasConcept C11413529 @default.
- W2057804984 hasConcept C119857082 @default.
- W2057804984 hasConcept C124101348 @default.
- W2057804984 hasConcept C134306372 @default.
- W2057804984 hasConcept C154945302 @default.
- W2057804984 hasConcept C173801870 @default.
- W2057804984 hasConcept C2524010 @default.
- W2057804984 hasConcept C2776036281 @default.
- W2057804984 hasConcept C2776291640 @default.
- W2057804984 hasConcept C2778112365 @default.
- W2057804984 hasConcept C2780598303 @default.
- W2057804984 hasConcept C2780861071 @default.
- W2057804984 hasConcept C33923547 @default.
- W2057804984 hasConcept C36503486 @default.
- W2057804984 hasConcept C41008148 @default.
- W2057804984 hasConcept C54355233 @default.
- W2057804984 hasConcept C80444323 @default.
- W2057804984 hasConcept C86803240 @default.
- W2057804984 hasConceptScore W2057804984C105795698 @default.
- W2057804984 hasConceptScore W2057804984C11413529 @default.
- W2057804984 hasConceptScore W2057804984C119857082 @default.
- W2057804984 hasConceptScore W2057804984C124101348 @default.
- W2057804984 hasConceptScore W2057804984C134306372 @default.
- W2057804984 hasConceptScore W2057804984C154945302 @default.
- W2057804984 hasConceptScore W2057804984C173801870 @default.
- W2057804984 hasConceptScore W2057804984C2524010 @default.
- W2057804984 hasConceptScore W2057804984C2776036281 @default.
- W2057804984 hasConceptScore W2057804984C2776291640 @default.
- W2057804984 hasConceptScore W2057804984C2778112365 @default.
- W2057804984 hasConceptScore W2057804984C2780598303 @default.
- W2057804984 hasConceptScore W2057804984C2780861071 @default.
- W2057804984 hasConceptScore W2057804984C33923547 @default.