Matches in SemOpenAlex for { <https://semopenalex.org/work/W1544288077> ?p ?o ?g. }
Showing items 1 to 97 of
97
with 100 items per page.
- W1544288077 abstract "The identification of binding sites for regulatory proteins in the up-stream region of genes is an important ingredient towards the understanding of transcription regulation. In recent years, novel experimental techniques, as gene expression arrays, and the availability of entire genome sequences have opened the possibility for more detailed investigations in this domain. Traditionally, the reconstruction of the profile of a binding site and the localization of all its occurrences in a sequence are treated as separate problems. The first is tackled using a small group of sequences, known or suspected to contain the binding site, but with neither position or pattern known. One successful approach to such reconstruction problem is based on a probabilistic model of the sequence, represented as concatenation of background and motif stochastic words. Maximum likelihood or maximum a-posteriori estimates are obtained with EM or Gibbs-sampler algorithms [13, 14]. The second problem is approached considering one or multiple sequences of variable length; the pattern characterizing the motif is assumed known. Possible locations are identified on the base of scoring functions that highlight the similarity of the motif with the sequence portions. Cut off values for such similarity scores are hard to determine: ad hoc solutions or estimations on a training set are often adopted [17, 18]. Typically these techniques are used to scan one sequence of interest against a data-base of known binding sites. While there are historical and practical reasons to consider these two problems as separate, the current post-genomic era, where we are confronted with large abundance of sequence, calls for a different approach. Consider the problem, tackled in [18], of identifying all the the binding sites of the known regulatory proteins in the genome of E. Coli. While formally similar to blasting a small sequence of interest against a data-base of known regulatory proteins, there are substantial differences in these genome-wide search. On the one hand, as one scans through the genome for binding sites of LexA—to take one example—and finds a substantial number of them, it seems appropriate one should use the information in the identified locations to update the current pattern description. On the other hand, given that the output is not going to include a small number of sites, that can be further investigated, but a large collection of them, the assessment of significance cut-off should be based on proper probabilistic statements. To address these issues, one would need a probability model for the entire genome sequence, that can lead to evaluation of specific a-posteriori probabilities of appearance of a binding site in any given location, and whose parameters can be estimated on the base of data. At the same time, given the scale of the problem, the model should be suitable for rapid computation. In an attempt to address such need we introduce here the Vocabulon model. Section 2 gives a description of the probability model we employ; its differences from others in the literature; and its current implementation. We then present the results of multiple investigations on E. Coli sequence. Given that genome-wide information on the location of binding sites is not available, we used results of gene expression array experiments to corroborate our results, arguing in favor of a novel perspective in array analysis." @default.
- W1544288077 created "2016-06-24" @default.
- W1544288077 creator A5007847950 @default.
- W1544288077 creator A5037586820 @default.
- W1544288077 creator A5055645717 @default.
- W1544288077 creator A5070269135 @default.
- W1544288077 date "2003-10-01" @default.
- W1544288077 modified "2023-09-23" @default.
- W1544288077 title "A Vocabulon Study of E.Coli Regulatory Sites with Feedback to Expression Array Analysis" @default.
- W1544288077 cites W1575431606 @default.
- W1544288077 cites W1969698218 @default.
- W1544288077 cites W1978132986 @default.
- W1544288077 cites W1998300401 @default.
- W1544288077 cites W2005244073 @default.
- W1544288077 cites W2005900670 @default.
- W1544288077 cites W2048869777 @default.
- W1544288077 cites W2064879068 @default.
- W1544288077 cites W2069645361 @default.
- W1544288077 cites W2085575847 @default.
- W1544288077 cites W2096139934 @default.
- W1544288077 cites W2098037868 @default.
- W1544288077 cites W2105792118 @default.
- W1544288077 cites W2134153749 @default.
- W1544288077 cites W2140551944 @default.
- W1544288077 cites W2142182986 @default.
- W1544288077 cites W2142446963 @default.
- W1544288077 cites W2144710927 @default.
- W1544288077 cites W2159883877 @default.
- W1544288077 cites W2165414513 @default.
- W1544288077 hasPublicationYear "2003" @default.
- W1544288077 type Work @default.
- W1544288077 sameAs 1544288077 @default.
- W1544288077 citedByCount "0" @default.
- W1544288077 crossrefType "journal-article" @default.
- W1544288077 hasAuthorship W1544288077A5007847950 @default.
- W1544288077 hasAuthorship W1544288077A5037586820 @default.
- W1544288077 hasAuthorship W1544288077A5055645717 @default.
- W1544288077 hasAuthorship W1544288077A5070269135 @default.
- W1544288077 hasConcept C101762097 @default.
- W1544288077 hasConcept C104317684 @default.
- W1544288077 hasConcept C111472728 @default.
- W1544288077 hasConcept C138885662 @default.
- W1544288077 hasConcept C150194340 @default.
- W1544288077 hasConcept C154945302 @default.
- W1544288077 hasConcept C21592294 @default.
- W1544288077 hasConcept C2778112365 @default.
- W1544288077 hasConcept C3662595 @default.
- W1544288077 hasConcept C41008148 @default.
- W1544288077 hasConcept C49937458 @default.
- W1544288077 hasConcept C54355233 @default.
- W1544288077 hasConcept C70721500 @default.
- W1544288077 hasConcept C75553542 @default.
- W1544288077 hasConcept C86339819 @default.
- W1544288077 hasConcept C86803240 @default.
- W1544288077 hasConceptScore W1544288077C101762097 @default.
- W1544288077 hasConceptScore W1544288077C104317684 @default.
- W1544288077 hasConceptScore W1544288077C111472728 @default.
- W1544288077 hasConceptScore W1544288077C138885662 @default.
- W1544288077 hasConceptScore W1544288077C150194340 @default.
- W1544288077 hasConceptScore W1544288077C154945302 @default.
- W1544288077 hasConceptScore W1544288077C21592294 @default.
- W1544288077 hasConceptScore W1544288077C2778112365 @default.
- W1544288077 hasConceptScore W1544288077C3662595 @default.
- W1544288077 hasConceptScore W1544288077C41008148 @default.
- W1544288077 hasConceptScore W1544288077C49937458 @default.
- W1544288077 hasConceptScore W1544288077C54355233 @default.
- W1544288077 hasConceptScore W1544288077C70721500 @default.
- W1544288077 hasConceptScore W1544288077C75553542 @default.
- W1544288077 hasConceptScore W1544288077C86339819 @default.
- W1544288077 hasConceptScore W1544288077C86803240 @default.
- W1544288077 hasLocation W15442880771 @default.
- W1544288077 hasOpenAccess W1544288077 @default.
- W1544288077 hasPrimaryLocation W15442880771 @default.
- W1544288077 hasRelatedWork W123743014 @default.
- W1544288077 hasRelatedWork W1534697636 @default.
- W1544288077 hasRelatedWork W1569638820 @default.
- W1544288077 hasRelatedWork W1571964016 @default.
- W1544288077 hasRelatedWork W1617327686 @default.
- W1544288077 hasRelatedWork W1660760521 @default.
- W1544288077 hasRelatedWork W2025325736 @default.
- W1544288077 hasRelatedWork W2031282421 @default.
- W1544288077 hasRelatedWork W2114953535 @default.
- W1544288077 hasRelatedWork W2122666123 @default.
- W1544288077 hasRelatedWork W2124112735 @default.
- W1544288077 hasRelatedWork W2125950230 @default.
- W1544288077 hasRelatedWork W2141996884 @default.
- W1544288077 hasRelatedWork W2144375142 @default.
- W1544288077 hasRelatedWork W2146074524 @default.
- W1544288077 hasRelatedWork W2155836137 @default.
- W1544288077 hasRelatedWork W2168030851 @default.
- W1544288077 hasRelatedWork W24935676 @default.
- W1544288077 hasRelatedWork W3007196580 @default.
- W1544288077 hasRelatedWork W76317090 @default.
- W1544288077 isParatext "false" @default.
- W1544288077 isRetracted "false" @default.
- W1544288077 magId "1544288077" @default.
- W1544288077 workType "article" @default.