Matches in SemOpenAlex for { <https://semopenalex.org/work/W4214891019> ?p ?o ?g. }
- W4214891019 abstract "Abstract Motivation Multiple sequence alignment (MSA) is a basic step in many bioinformatics pipelines. However, achieving highly accurate alignments on large datasets, especially those with sequence length heterogeneity, is a challenging task. UPP (Ultra-large multiple sequence alignment using Phylogeny-aware Profiles) is a method for MSA estimation that builds an ensemble of Hidden Markov Models (eHMM) to represent an estimated alignment on the full length sequences in the input, and then adds the remaining sequences into the alignment using selected HMMs in the ensemble. Although UPP provides good accuracy, it is computationally intensive on large datasets. Results We present UPP2, a direct improvement on UPP. The main advance is a fast technique for selecting HMMs in the ensemble that allows us to achieve the same accuracy as UPP but with greatly reduced runtime. We show UPP2 produces more accurate alignments compared to leading MSA methods on datasets exhibiting substantial sequence length heterogeneity, and is among the most accurate otherwise. Availability https://github.com/gillichu/sepp Contact warnow@illinois.edu" @default.
- W4214891019 created "2022-03-05" @default.
- W4214891019 creator A5006996313 @default.
- W4214891019 creator A5032280221 @default.
- W4214891019 creator A5032411541 @default.
- W4214891019 creator A5032942384 @default.
- W4214891019 creator A5069222630 @default.
- W4214891019 date "2022-03-01" @default.
- W4214891019 modified "2023-10-18" @default.
- W4214891019 title "UPP2: Fast and Accurate Alignment Estimation of Datasets with Fragmentary Sequences" @default.
- W4214891019 cites W1531430623 @default.
- W4214891019 cites W1646393397 @default.
- W4214891019 cites W2007668764 @default.
- W4214891019 cites W2033900229 @default.
- W4214891019 cites W2036792999 @default.
- W4214891019 cites W2056251063 @default.
- W4214891019 cites W2103130355 @default.
- W4214891019 cites W2107511706 @default.
- W4214891019 cites W2127322768 @default.
- W4214891019 cites W2132926880 @default.
- W4214891019 cites W2138122982 @default.
- W4214891019 cites W2141152740 @default.
- W4214891019 cites W2153800802 @default.
- W4214891019 cites W2154598603 @default.
- W4214891019 cites W2159638180 @default.
- W4214891019 cites W2160378127 @default.
- W4214891019 cites W2279493624 @default.
- W4214891019 cites W2904030683 @default.
- W4214891019 cites W2919831875 @default.
- W4214891019 cites W2991142785 @default.
- W4214891019 cites W3044581192 @default.
- W4214891019 cites W3109134151 @default.
- W4214891019 cites W3193928773 @default.
- W4214891019 cites W3204504530 @default.
- W4214891019 cites W4245668478 @default.
- W4214891019 cites W4280524234 @default.
- W4214891019 doi "https://doi.org/10.1101/2022.02.26.482099" @default.
- W4214891019 hasPublicationYear "2022" @default.
- W4214891019 type Work @default.
- W4214891019 citedByCount "2" @default.
- W4214891019 countsByYear W42148910192022 @default.
- W4214891019 crossrefType "posted-content" @default.
- W4214891019 hasAuthorship W4214891019A5006996313 @default.
- W4214891019 hasAuthorship W4214891019A5032280221 @default.
- W4214891019 hasAuthorship W4214891019A5032411541 @default.
- W4214891019 hasAuthorship W4214891019A5032942384 @default.
- W4214891019 hasAuthorship W4214891019A5069222630 @default.
- W4214891019 hasConcept C104317684 @default.
- W4214891019 hasConcept C11413529 @default.
- W4214891019 hasConcept C124101348 @default.
- W4214891019 hasConcept C153180895 @default.
- W4214891019 hasConcept C154945302 @default.
- W4214891019 hasConcept C162324750 @default.
- W4214891019 hasConcept C167625842 @default.
- W4214891019 hasConcept C180384323 @default.
- W4214891019 hasConcept C187736073 @default.
- W4214891019 hasConcept C199360897 @default.
- W4214891019 hasConcept C23224414 @default.
- W4214891019 hasConcept C2778112365 @default.
- W4214891019 hasConcept C2780451532 @default.
- W4214891019 hasConcept C41008148 @default.
- W4214891019 hasConcept C43521106 @default.
- W4214891019 hasConcept C45484198 @default.
- W4214891019 hasConcept C54355233 @default.
- W4214891019 hasConcept C55493867 @default.
- W4214891019 hasConcept C86803240 @default.
- W4214891019 hasConcept C88031987 @default.
- W4214891019 hasConceptScore W4214891019C104317684 @default.
- W4214891019 hasConceptScore W4214891019C11413529 @default.
- W4214891019 hasConceptScore W4214891019C124101348 @default.
- W4214891019 hasConceptScore W4214891019C153180895 @default.
- W4214891019 hasConceptScore W4214891019C154945302 @default.
- W4214891019 hasConceptScore W4214891019C162324750 @default.
- W4214891019 hasConceptScore W4214891019C167625842 @default.
- W4214891019 hasConceptScore W4214891019C180384323 @default.
- W4214891019 hasConceptScore W4214891019C187736073 @default.
- W4214891019 hasConceptScore W4214891019C199360897 @default.
- W4214891019 hasConceptScore W4214891019C23224414 @default.
- W4214891019 hasConceptScore W4214891019C2778112365 @default.
- W4214891019 hasConceptScore W4214891019C2780451532 @default.
- W4214891019 hasConceptScore W4214891019C41008148 @default.
- W4214891019 hasConceptScore W4214891019C43521106 @default.
- W4214891019 hasConceptScore W4214891019C45484198 @default.
- W4214891019 hasConceptScore W4214891019C54355233 @default.
- W4214891019 hasConceptScore W4214891019C55493867 @default.
- W4214891019 hasConceptScore W4214891019C86803240 @default.
- W4214891019 hasConceptScore W4214891019C88031987 @default.
- W4214891019 hasLocation W42148910191 @default.
- W4214891019 hasOpenAccess W4214891019 @default.
- W4214891019 hasPrimaryLocation W42148910191 @default.
- W4214891019 hasRelatedWork W2021852343 @default.
- W4214891019 hasRelatedWork W2035698531 @default.
- W4214891019 hasRelatedWork W2041804354 @default.
- W4214891019 hasRelatedWork W2091678889 @default.
- W4214891019 hasRelatedWork W2129361631 @default.
- W4214891019 hasRelatedWork W2145293832 @default.
- W4214891019 hasRelatedWork W2160791843 @default.
- W4214891019 hasRelatedWork W4214891019 @default.
- W4214891019 hasRelatedWork W4280524234 @default.
- W4214891019 hasRelatedWork W88386512 @default.