Matches in SemOpenAlex for { <https://semopenalex.org/work/W161731104> ?p ?o ?g. }
- W161731104 endingPage "481" @default.
- W161731104 startingPage "470" @default.
- W161731104 abstract "AbstractCorrelated protein-DNA interaction (binding cores) between transcription factor (TFs) and transcription factor binding sites (TFBSs) are usually identified by costly 3D structural experiments. To avoid numerous unsuccessful trials, we are motivated to develop a cheap and efficient sequence-based computational method for providing testable novel binding cores with high confidence to accelerate the experiments. Although there are abundant sequence-based motif discovery algorithms, few directly address associating both TF and TFBS core motifs which are both verifiable on 3D structures. In this paper, we formally define the problem of discovering correlated TF-TFBS binding cores, and apply association rule mining techniques over existing real sequence data (TRANSFAC). The proposed algorithm first builds two frequent sequence tree (FS-Tree) structures storing condensed information for association rule mining. Association rules are then generated by depth-first traversal on the structures. FS-Trees have several advantages to support further applications, including efficient calculation of the support and confidence, simple generation of candidate rules, and applicability of effective pruning techniques. As a result, the FS-Trees serve as a useful basis for more general extensions related to biological binding core identification. We tested our algorithm on real sequence data from the biological database TRANSFAC and focus on efficiency comparisons with the recent work employing association rule mining. The rules discovered reveal real TF-TFBS binding cores in independent 3D verifications on Protein Data Bank (PDB).KeywordsBioinformaticsassociation rule miningsequencealgorithm" @default.
- W161731104 created "2016-06-24" @default.
- W161731104 creator A5010336848 @default.
- W161731104 creator A5034812768 @default.
- W161731104 creator A5077261151 @default.
- W161731104 creator A5088763749 @default.
- W161731104 date "2012-01-01" @default.
- W161731104 modified "2023-09-24" @default.
- W161731104 title "Efficient Algorithm for Mining Correlated Protein-DNA Binding Cores" @default.
- W161731104 cites W1482810056 @default.
- W161731104 cites W1486659038 @default.
- W161731104 cites W1641039719 @default.
- W161731104 cites W1965267927 @default.
- W161731104 cites W1965702297 @default.
- W161731104 cites W1981482769 @default.
- W161731104 cites W1989080074 @default.
- W161731104 cites W2016661940 @default.
- W161731104 cites W2032226242 @default.
- W161731104 cites W2053743126 @default.
- W161731104 cites W2064853889 @default.
- W161731104 cites W2068383400 @default.
- W161731104 cites W2099404336 @default.
- W161731104 cites W2114066176 @default.
- W161731104 cites W2135030836 @default.
- W161731104 cites W2135572605 @default.
- W161731104 cites W2148014281 @default.
- W161731104 cites W2157076973 @default.
- W161731104 cites W2998574808 @default.
- W161731104 cites W4248966671 @default.
- W161731104 cites W4253336088 @default.
- W161731104 doi "https://doi.org/10.1007/978-3-642-29038-1_34" @default.
- W161731104 hasPublicationYear "2012" @default.
- W161731104 type Work @default.
- W161731104 sameAs 161731104 @default.
- W161731104 citedByCount "3" @default.
- W161731104 countsByYear W1617311042013 @default.
- W161731104 countsByYear W1617311042014 @default.
- W161731104 countsByYear W1617311042015 @default.
- W161731104 crossrefType "book-chapter" @default.
- W161731104 hasAuthorship W161731104A5010336848 @default.
- W161731104 hasAuthorship W161731104A5034812768 @default.
- W161731104 hasAuthorship W161731104A5077261151 @default.
- W161731104 hasAuthorship W161731104A5088763749 @default.
- W161731104 hasConcept C101762097 @default.
- W161731104 hasConcept C104317684 @default.
- W161731104 hasConcept C108010975 @default.
- W161731104 hasConcept C113174947 @default.
- W161731104 hasConcept C11413529 @default.
- W161731104 hasConcept C124101348 @default.
- W161731104 hasConcept C134306372 @default.
- W161731104 hasConcept C150194340 @default.
- W161731104 hasConcept C193524817 @default.
- W161731104 hasConcept C2778112365 @default.
- W161731104 hasConcept C33923547 @default.
- W161731104 hasConcept C3662595 @default.
- W161731104 hasConcept C41008148 @default.
- W161731104 hasConcept C54355233 @default.
- W161731104 hasConcept C6557445 @default.
- W161731104 hasConcept C70721500 @default.
- W161731104 hasConcept C80444323 @default.
- W161731104 hasConcept C86803240 @default.
- W161731104 hasConceptScore W161731104C101762097 @default.
- W161731104 hasConceptScore W161731104C104317684 @default.
- W161731104 hasConceptScore W161731104C108010975 @default.
- W161731104 hasConceptScore W161731104C113174947 @default.
- W161731104 hasConceptScore W161731104C11413529 @default.
- W161731104 hasConceptScore W161731104C124101348 @default.
- W161731104 hasConceptScore W161731104C134306372 @default.
- W161731104 hasConceptScore W161731104C150194340 @default.
- W161731104 hasConceptScore W161731104C193524817 @default.
- W161731104 hasConceptScore W161731104C2778112365 @default.
- W161731104 hasConceptScore W161731104C33923547 @default.
- W161731104 hasConceptScore W161731104C3662595 @default.
- W161731104 hasConceptScore W161731104C41008148 @default.
- W161731104 hasConceptScore W161731104C54355233 @default.
- W161731104 hasConceptScore W161731104C6557445 @default.
- W161731104 hasConceptScore W161731104C70721500 @default.
- W161731104 hasConceptScore W161731104C80444323 @default.
- W161731104 hasConceptScore W161731104C86803240 @default.
- W161731104 hasLocation W1617311041 @default.
- W161731104 hasOpenAccess W161731104 @default.
- W161731104 hasPrimaryLocation W1617311041 @default.
- W161731104 hasRelatedWork W1510549104 @default.
- W161731104 hasRelatedWork W1912213889 @default.
- W161731104 hasRelatedWork W1986105390 @default.
- W161731104 hasRelatedWork W1992748956 @default.
- W161731104 hasRelatedWork W2036749221 @default.
- W161731104 hasRelatedWork W2038093918 @default.
- W161731104 hasRelatedWork W2048088032 @default.
- W161731104 hasRelatedWork W205473438 @default.
- W161731104 hasRelatedWork W2091572769 @default.
- W161731104 hasRelatedWork W2096210973 @default.
- W161731104 hasRelatedWork W2126080985 @default.
- W161731104 hasRelatedWork W2141409759 @default.
- W161731104 hasRelatedWork W2156543648 @default.
- W161731104 hasRelatedWork W2167445572 @default.
- W161731104 hasRelatedWork W2167895467 @default.
- W161731104 hasRelatedWork W2221564667 @default.