Matches in SemOpenAlex for { <https://semopenalex.org/work/W1483806449> ?p ?o ?g. }
- W1483806449 abstract "Gene expression of eucaryotes is regulated through transcription factors, which are molecules able to attach to the binding sites in the DNA sequence. These binding sites are small pieces of DNA usually found upstream from the gene they regulate. As the binding sites play an important role in the gene expression, it is of interest to find out their characteristics. In this paper, we look for dependencies and independencies between these binding sites using independent component analysis (ICA), non-negative matrix factorization (NMF), probabilistic latent semantic analysis (PLSA) and the method of frequent sets. The data used are human gene upstream regions and possible binding sites listed in a biological database. Also, results on the baker's yeast (S. Cerevisiae) upstream regions are briefly discussed for comparison. ICA, NMF and PLSA are latent variable methods that decompose the observed data into smaller components. Of these, ICA and NMF were originally aimed for continuous data. We show that these methods can be successfully used on discrete DNA data as well. PLSA and the method of frequent sets were created for discrete data sets. The above methods reveal partially overlapping sets of possible binding sites such that the binding sites within a set are dependent of each other. The methods of frequent sets and NMF give a good overview of the most common data structures, whereas using ICA and PLSA we find large sets that are surprisingly frequent. That is, sets of very frequently occurring possible binding sites can be found near hundreds or thousands of genes; also interesting but less frequent ones co-occur surprisingly often." @default.
- W1483806449 created "2016-06-24" @default.
- W1483806449 creator A5039954109 @default.
- W1483806449 creator A5083953371 @default.
- W1483806449 date "2005-03-31" @default.
- W1483806449 modified "2023-10-18" @default.
- W1483806449 title "Dependencies between Transcription Factor Binding Sites: Comparison between ICA, NMF, PLSA and Frequent Sets" @default.
- W1483806449 cites W1548802052 @default.
- W1483806449 cites W1902027874 @default.
- W1483806449 cites W1972221303 @default.
- W1483806449 cites W2005244073 @default.
- W1483806449 cites W2019502123 @default.
- W1483806449 cites W2032231928 @default.
- W1483806449 cites W2043317951 @default.
- W1483806449 cites W2051139824 @default.
- W1483806449 cites W2099741732 @default.
- W1483806449 cites W2107743791 @default.
- W1483806449 cites W2115772919 @default.
- W1483806449 cites W2125227861 @default.
- W1483806449 cites W2128091972 @default.
- W1483806449 cites W2134731454 @default.
- W1483806449 cites W2135029798 @default.
- W1483806449 cites W2142446963 @default.
- W1483806449 cites W2144710927 @default.
- W1483806449 cites W2155602043 @default.
- W1483806449 cites W2156026066 @default.
- W1483806449 cites W2166898290 @default.
- W1483806449 doi "https://doi.org/10.1109/icdm.2004.10086" @default.
- W1483806449 hasPublicationYear "2005" @default.
- W1483806449 type Work @default.
- W1483806449 sameAs 1483806449 @default.
- W1483806449 citedByCount "7" @default.
- W1483806449 countsByYear W14838064492012 @default.
- W1483806449 countsByYear W14838064492014 @default.
- W1483806449 crossrefType "proceedings-article" @default.
- W1483806449 hasAuthorship W1483806449A5039954109 @default.
- W1483806449 hasAuthorship W1483806449A5083953371 @default.
- W1483806449 hasConcept C101762097 @default.
- W1483806449 hasConcept C104317684 @default.
- W1483806449 hasConcept C112933361 @default.
- W1483806449 hasConcept C121332964 @default.
- W1483806449 hasConcept C124101348 @default.
- W1483806449 hasConcept C150194340 @default.
- W1483806449 hasConcept C152671427 @default.
- W1483806449 hasConcept C153180895 @default.
- W1483806449 hasConcept C154945302 @default.
- W1483806449 hasConcept C158693339 @default.
- W1483806449 hasConcept C3662595 @default.
- W1483806449 hasConcept C41008148 @default.
- W1483806449 hasConcept C42355184 @default.
- W1483806449 hasConcept C51167844 @default.
- W1483806449 hasConcept C51432778 @default.
- W1483806449 hasConcept C54355233 @default.
- W1483806449 hasConcept C62520636 @default.
- W1483806449 hasConcept C70721500 @default.
- W1483806449 hasConcept C86803240 @default.
- W1483806449 hasConceptScore W1483806449C101762097 @default.
- W1483806449 hasConceptScore W1483806449C104317684 @default.
- W1483806449 hasConceptScore W1483806449C112933361 @default.
- W1483806449 hasConceptScore W1483806449C121332964 @default.
- W1483806449 hasConceptScore W1483806449C124101348 @default.
- W1483806449 hasConceptScore W1483806449C150194340 @default.
- W1483806449 hasConceptScore W1483806449C152671427 @default.
- W1483806449 hasConceptScore W1483806449C153180895 @default.
- W1483806449 hasConceptScore W1483806449C154945302 @default.
- W1483806449 hasConceptScore W1483806449C158693339 @default.
- W1483806449 hasConceptScore W1483806449C3662595 @default.
- W1483806449 hasConceptScore W1483806449C41008148 @default.
- W1483806449 hasConceptScore W1483806449C42355184 @default.
- W1483806449 hasConceptScore W1483806449C51167844 @default.
- W1483806449 hasConceptScore W1483806449C51432778 @default.
- W1483806449 hasConceptScore W1483806449C54355233 @default.
- W1483806449 hasConceptScore W1483806449C62520636 @default.
- W1483806449 hasConceptScore W1483806449C70721500 @default.
- W1483806449 hasConceptScore W1483806449C86803240 @default.
- W1483806449 hasLocation W14838064491 @default.
- W1483806449 hasOpenAccess W1483806449 @default.
- W1483806449 hasPrimaryLocation W14838064491 @default.
- W1483806449 hasRelatedWork W1485123226 @default.
- W1483806449 hasRelatedWork W1513151891 @default.
- W1483806449 hasRelatedWork W184810141 @default.
- W1483806449 hasRelatedWork W1973863735 @default.
- W1483806449 hasRelatedWork W2024435696 @default.
- W1483806449 hasRelatedWork W2062872358 @default.
- W1483806449 hasRelatedWork W2077399038 @default.
- W1483806449 hasRelatedWork W2092294665 @default.
- W1483806449 hasRelatedWork W2100367856 @default.
- W1483806449 hasRelatedWork W2116109811 @default.
- W1483806449 hasRelatedWork W2143489962 @default.
- W1483806449 hasRelatedWork W2158776283 @default.
- W1483806449 hasRelatedWork W2414477801 @default.
- W1483806449 hasRelatedWork W2510808830 @default.
- W1483806449 hasRelatedWork W2953272377 @default.
- W1483806449 hasRelatedWork W2969877666 @default.
- W1483806449 hasRelatedWork W3100522014 @default.
- W1483806449 hasRelatedWork W65814815 @default.
- W1483806449 hasRelatedWork W2185687106 @default.
- W1483806449 hasRelatedWork W3099687987 @default.
- W1483806449 isParatext "false" @default.
- W1483806449 isRetracted "false" @default.