Matches in SemOpenAlex for { <https://semopenalex.org/work/W4377019635> ?p ?o ?g. }
- W4377019635 abstract "Multiple sequence alignment is widely used for sequence analysis, such as identifying important sites and phylogenetic analysis. Traditional methods, such as progressive alignment, are time-consuming. To address this issue, we introduce StarTree, a novel method to fast construct a guide tree by combining sequence clustering and hierarchical clustering. Furthermore, we develop a new heuristic similar region detection algorithm using the FM-index and apply the k-banded dynamic program to the profile alignment. We also introduce a win-win alignment algorithm that applies the central star strategy within the clusters to fast the alignment process, then uses the progressive strategy to align the central-aligned profiles, guaranteeing the final alignment's accuracy. We present WMSA 2 based on these improvements and compare the speed and accuracy with other popular methods. The results show that the guide tree made by the StarTree clustering method can lead to better accuracy than that of PartTree while consuming less time and memory than that of UPGMA and mBed methods on datasets with thousands of sequences. During the alignment of simulated data sets, WMSA 2 can consume less time and memory while ranking at the top of Q and TC scores. The WMSA 2 is still better at the time, and memory efficiency on the real datasets and ranks at the top on the average sum of pairs score. For the alignment of 1 million SARS-CoV-2 genomes, the win-win mode of WMSA 2 significantly decreased the consumption time than the former version. The source code and data are available at https://github.com/malabz/WMSA2." @default.
- W4377019635 created "2023-05-19" @default.
- W4377019635 creator A5013881064 @default.
- W4377019635 creator A5017426085 @default.
- W4377019635 creator A5046817187 @default.
- W4377019635 creator A5069921868 @default.
- W4377019635 creator A5075995313 @default.
- W4377019635 creator A5088804993 @default.
- W4377019635 date "2023-05-17" @default.
- W4377019635 modified "2023-10-14" @default.
- W4377019635 title "WMSA 2: a multiple DNA/RNA sequence alignment tool implemented with accurate progressive mode and a fast win-win mode combining the center star and progressive strategies" @default.
- W4377019635 cites W1519266993 @default.
- W4377019635 cites W1969153299 @default.
- W4377019635 cites W2031611770 @default.
- W4377019635 cites W2050178723 @default.
- W4377019635 cites W2056251063 @default.
- W4377019635 cites W2066959416 @default.
- W4377019635 cites W2080728354 @default.
- W4377019635 cites W2100673218 @default.
- W4377019635 cites W2115394533 @default.
- W4377019635 cites W2119270998 @default.
- W4377019635 cites W2127322768 @default.
- W4377019635 cites W2127520112 @default.
- W4377019635 cites W2127774996 @default.
- W4377019635 cites W2144362290 @default.
- W4377019635 cites W2606325896 @default.
- W4377019635 cites W2623372292 @default.
- W4377019635 cites W2982461819 @default.
- W4377019635 cites W3003217347 @default.
- W4377019635 cites W3109134151 @default.
- W4377019635 cites W3204504530 @default.
- W4377019635 cites W4200611247 @default.
- W4377019635 cites W4289313383 @default.
- W4377019635 cites W4298126241 @default.
- W4377019635 doi "https://doi.org/10.1093/bib/bbad190" @default.
- W4377019635 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/37200156" @default.
- W4377019635 hasPublicationYear "2023" @default.
- W4377019635 type Work @default.
- W4377019635 citedByCount "0" @default.
- W4377019635 crossrefType "journal-article" @default.
- W4377019635 hasAuthorship W4377019635A5013881064 @default.
- W4377019635 hasAuthorship W4377019635A5017426085 @default.
- W4377019635 hasAuthorship W4377019635A5046817187 @default.
- W4377019635 hasAuthorship W4377019635A5069921868 @default.
- W4377019635 hasAuthorship W4377019635A5075995313 @default.
- W4377019635 hasAuthorship W4377019635A5088804993 @default.
- W4377019635 hasConcept C104317684 @default.
- W4377019635 hasConcept C111919701 @default.
- W4377019635 hasConcept C113174947 @default.
- W4377019635 hasConcept C11413529 @default.
- W4377019635 hasConcept C124101348 @default.
- W4377019635 hasConcept C134306372 @default.
- W4377019635 hasConcept C154945302 @default.
- W4377019635 hasConcept C167625842 @default.
- W4377019635 hasConcept C173801870 @default.
- W4377019635 hasConcept C180384323 @default.
- W4377019635 hasConcept C185592680 @default.
- W4377019635 hasConcept C189430467 @default.
- W4377019635 hasConcept C2778112365 @default.
- W4377019635 hasConcept C2780897414 @default.
- W4377019635 hasConcept C33923547 @default.
- W4377019635 hasConcept C41008148 @default.
- W4377019635 hasConcept C45484198 @default.
- W4377019635 hasConcept C48677424 @default.
- W4377019635 hasConcept C54355233 @default.
- W4377019635 hasConcept C55493867 @default.
- W4377019635 hasConcept C73555534 @default.
- W4377019635 hasConcept C86803240 @default.
- W4377019635 hasConcept C88031987 @default.
- W4377019635 hasConcept C92835128 @default.
- W4377019635 hasConcept C98045186 @default.
- W4377019635 hasConceptScore W4377019635C104317684 @default.
- W4377019635 hasConceptScore W4377019635C111919701 @default.
- W4377019635 hasConceptScore W4377019635C113174947 @default.
- W4377019635 hasConceptScore W4377019635C11413529 @default.
- W4377019635 hasConceptScore W4377019635C124101348 @default.
- W4377019635 hasConceptScore W4377019635C134306372 @default.
- W4377019635 hasConceptScore W4377019635C154945302 @default.
- W4377019635 hasConceptScore W4377019635C167625842 @default.
- W4377019635 hasConceptScore W4377019635C173801870 @default.
- W4377019635 hasConceptScore W4377019635C180384323 @default.
- W4377019635 hasConceptScore W4377019635C185592680 @default.
- W4377019635 hasConceptScore W4377019635C189430467 @default.
- W4377019635 hasConceptScore W4377019635C2778112365 @default.
- W4377019635 hasConceptScore W4377019635C2780897414 @default.
- W4377019635 hasConceptScore W4377019635C33923547 @default.
- W4377019635 hasConceptScore W4377019635C41008148 @default.
- W4377019635 hasConceptScore W4377019635C45484198 @default.
- W4377019635 hasConceptScore W4377019635C48677424 @default.
- W4377019635 hasConceptScore W4377019635C54355233 @default.
- W4377019635 hasConceptScore W4377019635C55493867 @default.
- W4377019635 hasConceptScore W4377019635C73555534 @default.
- W4377019635 hasConceptScore W4377019635C86803240 @default.
- W4377019635 hasConceptScore W4377019635C88031987 @default.
- W4377019635 hasConceptScore W4377019635C92835128 @default.
- W4377019635 hasConceptScore W4377019635C98045186 @default.
- W4377019635 hasFunder F4320321001 @default.
- W4377019635 hasFunder F4320329861 @default.
- W4377019635 hasIssue "4" @default.
- W4377019635 hasLocation W43770196351 @default.