Matches in SemOpenAlex for { <https://semopenalex.org/work/W2012247674> ?p ?o ?g. }
- W2012247674 endingPage "3184" @default.
- W2012247674 startingPage "3171" @default.
- W2012247674 abstract "The identification and characterization of binding sites of DNA-binding molecules, including transcription factors (TFs), is a critical problem at the interface of chemistry, biology and molecular medicine. The Cognate Site Identification (CSI) array is a high-throughput microarray platform for measuring comprehensive recognition profiles of DNA-binding molecules. This technique produces datasets that are useful not only for identifying binding sites of previously uncharacterized TFs but also for elucidating dependencies, both local and nonlocal, between the nucleotides at different positions of the recognition sites. We have developed a regression tree technique, CSI-Tree, for exploring the spectrum of binding sites of DNA-binding molecules. Our approach constructs regression trees utilizing the CSI data of unaligned sequences. The resulting model partitions the binding spectrum into homogeneous regions of position specific nucleotide effects. Each homogeneous partition is then summarized by a position weight matrix (PWM). Hence, the final outcome is a binding intensity rank-ordered collection of PWMs each of which spans a different region in the binding spectrum. Nodes of the regression tree depict the critical position/nucleotide combinations. We analyze the CSI data of the eukaryotic TF Nkx-2.5 and two engineered small molecule DNA ligands and obtain unique insights into their binding properties. The CSI tree for Nkx-2.5 reveals an interaction between two positions of the binding profile and elucidates how different nucleotide combinations at these two positions lead to different binding affinities. The CSI trees for the engineered DNA ligands exhibit a common preference for the dinucleotide AA in the first two positions, which is consistent with preference for a narrow and relatively flat minor groove. We carry out a reanalysis of these data with a mixture of PWMs approach. This approach is an advancement over the simple PWM model and accommodates position dependencies based on only sequence data. Our analysis indicates that the dependencies revealed by the CSI-Tree are challenging to discover without the actual binding intensities. Moreover, such a mixture model is highly sensitive to the number and length of the sequences analyzed. In contrast, CSI-Tree provides interpretable and concise summaries of the complete recognition profiles of DNA-binding molecules by utilizing binding affinities." @default.
- W2012247674 created "2016-06-24" @default.
- W2012247674 creator A5012816191 @default.
- W2012247674 creator A5027290261 @default.
- W2012247674 creator A5057070061 @default.
- W2012247674 creator A5066902283 @default.
- W2012247674 date "2008-04-13" @default.
- W2012247674 modified "2023-09-26" @default.
- W2012247674 title "CSI-Tree: a regression tree approach for modeling binding properties of DNA-binding molecules based on cognate site identification (CSI) data" @default.
- W2012247674 cites W1968453480 @default.
- W2012247674 cites W1977389736 @default.
- W2012247674 cites W1984202875 @default.
- W2012247674 cites W1994122295 @default.
- W2012247674 cites W2007071268 @default.
- W2012247674 cites W2010384381 @default.
- W2012247674 cites W2011962761 @default.
- W2012247674 cites W2013437454 @default.
- W2012247674 cites W2015622114 @default.
- W2012247674 cites W2017764099 @default.
- W2012247674 cites W2017986941 @default.
- W2012247674 cites W2035564383 @default.
- W2012247674 cites W2046226908 @default.
- W2012247674 cites W2047713072 @default.
- W2012247674 cites W2054624670 @default.
- W2012247674 cites W2056002855 @default.
- W2012247674 cites W2069368388 @default.
- W2012247674 cites W2080286449 @default.
- W2012247674 cites W2080498142 @default.
- W2012247674 cites W2083052219 @default.
- W2012247674 cites W2084620487 @default.
- W2012247674 cites W2087752336 @default.
- W2012247674 cites W2089417780 @default.
- W2012247674 cites W2089882687 @default.
- W2012247674 cites W2090010568 @default.
- W2012247674 cites W2092869314 @default.
- W2012247674 cites W2100668965 @default.
- W2012247674 cites W2103453943 @default.
- W2012247674 cites W2120865735 @default.
- W2012247674 cites W2135221645 @default.
- W2012247674 cites W2137342707 @default.
- W2012247674 cites W2140115103 @default.
- W2012247674 cites W2140885056 @default.
- W2012247674 cites W2146393977 @default.
- W2012247674 cites W2147946837 @default.
- W2012247674 cites W2151945801 @default.
- W2012247674 cites W2166277117 @default.
- W2012247674 cites W2166429927 @default.
- W2012247674 cites W2168157737 @default.
- W2012247674 cites W2171153226 @default.
- W2012247674 doi "https://doi.org/10.1093/nar/gkn057" @default.
- W2012247674 hasPubMedCentralId "https://www.ncbi.nlm.nih.gov/pmc/articles/2425502" @default.
- W2012247674 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/18411210" @default.
- W2012247674 hasPublicationYear "2008" @default.
- W2012247674 type Work @default.
- W2012247674 sameAs 2012247674 @default.
- W2012247674 citedByCount "13" @default.
- W2012247674 countsByYear W20122476742012 @default.
- W2012247674 countsByYear W20122476742013 @default.
- W2012247674 countsByYear W20122476742016 @default.
- W2012247674 countsByYear W20122476742017 @default.
- W2012247674 countsByYear W20122476742018 @default.
- W2012247674 crossrefType "journal-article" @default.
- W2012247674 hasAuthorship W2012247674A5012816191 @default.
- W2012247674 hasAuthorship W2012247674A5027290261 @default.
- W2012247674 hasAuthorship W2012247674A5057070061 @default.
- W2012247674 hasAuthorship W2012247674A5066902283 @default.
- W2012247674 hasBestOaLocation W20122476742 @default.
- W2012247674 hasConcept C101762097 @default.
- W2012247674 hasConcept C104317684 @default.
- W2012247674 hasConcept C107824862 @default.
- W2012247674 hasConcept C113174947 @default.
- W2012247674 hasConcept C134306372 @default.
- W2012247674 hasConcept C150194340 @default.
- W2012247674 hasConcept C161624437 @default.
- W2012247674 hasConcept C33923547 @default.
- W2012247674 hasConcept C3662595 @default.
- W2012247674 hasConcept C54355233 @default.
- W2012247674 hasConcept C552990157 @default.
- W2012247674 hasConcept C70721500 @default.
- W2012247674 hasConcept C86803240 @default.
- W2012247674 hasConceptScore W2012247674C101762097 @default.
- W2012247674 hasConceptScore W2012247674C104317684 @default.
- W2012247674 hasConceptScore W2012247674C107824862 @default.
- W2012247674 hasConceptScore W2012247674C113174947 @default.
- W2012247674 hasConceptScore W2012247674C134306372 @default.
- W2012247674 hasConceptScore W2012247674C150194340 @default.
- W2012247674 hasConceptScore W2012247674C161624437 @default.
- W2012247674 hasConceptScore W2012247674C33923547 @default.
- W2012247674 hasConceptScore W2012247674C3662595 @default.
- W2012247674 hasConceptScore W2012247674C54355233 @default.
- W2012247674 hasConceptScore W2012247674C552990157 @default.
- W2012247674 hasConceptScore W2012247674C70721500 @default.
- W2012247674 hasConceptScore W2012247674C86803240 @default.
- W2012247674 hasIssue "10" @default.
- W2012247674 hasLocation W20122476741 @default.
- W2012247674 hasLocation W20122476742 @default.
- W2012247674 hasLocation W20122476743 @default.
- W2012247674 hasLocation W20122476744 @default.