Matches in SemOpenAlex for { <https://semopenalex.org/work/W2069319306> ?p ?o ?g. }
- W2069319306 abstract "A standard procedure in many areas of bioinformatics is to use a single multiple sequence alignment (MSA) as the basis for various types of analysis. However, downstream results may be highly sensitive to the alignment used, and neglecting the uncertainty in the alignment can lead to significant bias in the resulting inference. In recent years, a number of approaches have been developed for probabilistic sampling of alignments, rather than simply generating a single optimum. However, this type of probabilistic information is currently not widely used in the context of downstream inference, since most existing algorithms are set up to make use of a single alignment. In this work we present a framework for representing a set of sampled alignments as a directed acyclic graph (DAG) whose nodes are alignment columns; each path through this DAG then represents a valid alignment. Since the probabilities of individual columns can be estimated from empirical frequencies, this approach enables sample-based estimation of posterior alignment probabilities. Moreover, due to conditional independencies between columns, the graph structure encodes a much larger set of alignments than the original set of sampled MSAs, such that the effective sample size is greatly increased. The alignment DAG provides a natural way to represent a distribution in the space of MSAs, and allows for existing algorithms to be efficiently scaled up to operate on large sets of alignments. As an example, we show how this can be used to compute marginal probabilities for tree topologies, averaging over a very large number of MSAs. This framework can also be used to generate a statistically meaningful summary alignment; example applications show that this summary alignment is consistently more accurate than the majority of the alignment samples, leading to improvements in downstream tree inference. Implementations of the methods described in this article are available at http://statalign.github.io/WeaveAlign ." @default.
- W2069319306 created "2016-06-24" @default.
- W2069319306 creator A5031360521 @default.
- W2069319306 creator A5031774538 @default.
- W2069319306 creator A5056579625 @default.
- W2069319306 creator A5079496228 @default.
- W2069319306 creator A5084872691 @default.
- W2069319306 creator A5085275630 @default.
- W2069319306 date "2015-04-01" @default.
- W2069319306 modified "2023-10-06" @default.
- W2069319306 title "Efficient representation of uncertainty in multiple sequence alignments using directed acyclic graphs" @default.
- W2069319306 cites W1519266993 @default.
- W2069319306 cites W1567621547 @default.
- W2069319306 cites W1963866954 @default.
- W2069319306 cites W1963957860 @default.
- W2069319306 cites W1965998064 @default.
- W2069319306 cites W1968256798 @default.
- W2069319306 cites W1969153299 @default.
- W2069319306 cites W1970858544 @default.
- W2069319306 cites W1972124064 @default.
- W2069319306 cites W1991669894 @default.
- W2069319306 cites W1994990680 @default.
- W2069319306 cites W1995318306 @default.
- W2069319306 cites W1995988830 @default.
- W2069319306 cites W1996516710 @default.
- W2069319306 cites W1997631042 @default.
- W2069319306 cites W2000086166 @default.
- W2069319306 cites W2002638840 @default.
- W2069319306 cites W2002805476 @default.
- W2069319306 cites W2007668764 @default.
- W2069319306 cites W2007939821 @default.
- W2069319306 cites W2011289056 @default.
- W2069319306 cites W2017777683 @default.
- W2069319306 cites W2021839440 @default.
- W2069319306 cites W2021883973 @default.
- W2069319306 cites W2022349790 @default.
- W2069319306 cites W2023151136 @default.
- W2069319306 cites W2024665422 @default.
- W2069319306 cites W2026185422 @default.
- W2069319306 cites W2027490384 @default.
- W2069319306 cites W2027893274 @default.
- W2069319306 cites W2044164898 @default.
- W2069319306 cites W2044250973 @default.
- W2069319306 cites W2045895660 @default.
- W2069319306 cites W2049283139 @default.
- W2069319306 cites W2051545676 @default.
- W2069319306 cites W2053238889 @default.
- W2069319306 cites W2053281575 @default.
- W2069319306 cites W2060425093 @default.
- W2069319306 cites W2061029680 @default.
- W2069319306 cites W2061149284 @default.
- W2069319306 cites W2065283382 @default.
- W2069319306 cites W2068448872 @default.
- W2069319306 cites W2071749306 @default.
- W2069319306 cites W2073234837 @default.
- W2069319306 cites W2073630766 @default.
- W2069319306 cites W2074231493 @default.
- W2069319306 cites W2077026347 @default.
- W2069319306 cites W2079093876 @default.
- W2069319306 cites W2080942850 @default.
- W2069319306 cites W2083201277 @default.
- W2069319306 cites W2090748603 @default.
- W2069319306 cites W2092672051 @default.
- W2069319306 cites W2092979861 @default.
- W2069319306 cites W2093057423 @default.
- W2069319306 cites W2095040593 @default.
- W2069319306 cites W2097059553 @default.
- W2069319306 cites W2097730814 @default.
- W2069319306 cites W2097767833 @default.
- W2069319306 cites W2100099557 @default.
- W2069319306 cites W2101072339 @default.
- W2069319306 cites W2102424972 @default.
- W2069319306 cites W2102941368 @default.
- W2069319306 cites W2103130355 @default.
- W2069319306 cites W2105862765 @default.
- W2069319306 cites W2107335609 @default.
- W2069319306 cites W2107511706 @default.
- W2069319306 cites W2109119775 @default.
- W2069319306 cites W2111755094 @default.
- W2069319306 cites W2115888213 @default.
- W2069319306 cites W2117477006 @default.
- W2069319306 cites W2118756701 @default.
- W2069319306 cites W2120866529 @default.
- W2069319306 cites W2121691652 @default.
- W2069319306 cites W2122035121 @default.
- W2069319306 cites W2127164334 @default.
- W2069319306 cites W2127556561 @default.
- W2069319306 cites W2127688908 @default.
- W2069319306 cites W2127860913 @default.
- W2069319306 cites W2128482297 @default.
- W2069319306 cites W2130909239 @default.
- W2069319306 cites W2132972885 @default.
- W2069319306 cites W2133437368 @default.
- W2069319306 cites W2134274257 @default.
- W2069319306 cites W2134595458 @default.
- W2069319306 cites W2136570298 @default.
- W2069319306 cites W2138369959 @default.
- W2069319306 cites W2139449083 @default.
- W2069319306 cites W2140259073 @default.
- W2069319306 cites W2140771555 @default.