Matches in SemOpenAlex for { <https://semopenalex.org/work/W4220791336> ?p ?o ?g. }
- W4220791336 abstract "Abstract Background Biological sequence clustering is a complicated data clustering problem owing to the high computation costs incurred for pairwise sequence distance calculations through sequence alignments, as well as difficulties in determining parameters for deriving robust clusters. While current approaches are successful in reducing the number of sequence alignments performed, the generated clusters are based on a single sequence identity threshold applied to every cluster. Poor choices of this identity threshold would thus lead to low quality clusters. There is however little support provided to users in selecting thresholds that are well matched with the input sequences. Results We present a novel sequence clustering approach called ALFATClust that exploits rapid pairwise alignment-free sequence distance calculations and community detection in graph for clusters generation. Instead of a single threshold applied to every generated cluster, ALFATClust is capable of dynamically determining the cut-off threshold for each individual cluster by considering both cluster separation and intra-cluster sequence similarity. Benchmarking analysis shows that ALFATClust generally outperforms existing approaches by simultaneously maintaining cluster robustness and substantial cluster separation for the benchmark datasets. The software also provides an evaluation report for verifying the quality of the non-singleton clusters obtained. Conclusions ALFATClust is able to generate sequence clusters having high intra-cluster sequence similarity and substantial separation between clusters without having users to decide precise similarity cut-off thresholds." @default.
- W4220791336 created "2022-04-03" @default.
- W4220791336 creator A5048547355 @default.
- W4220791336 creator A5072970858 @default.
- W4220791336 date "2022-03-30" @default.
- W4220791336 modified "2023-10-14" @default.
- W4220791336 title "Clustering biological sequences with dynamic sequence similarity threshold" @default.
- W4220791336 cites W1657207870 @default.
- W4220791336 cites W1971421925 @default.
- W4220791336 cites W1987971958 @default.
- W4220791336 cites W2011301426 @default.
- W4220791336 cites W2022686119 @default.
- W4220791336 cites W2032345031 @default.
- W4220791336 cites W2033947830 @default.
- W4220791336 cites W2045691329 @default.
- W4220791336 cites W2054475422 @default.
- W4220791336 cites W2067964678 @default.
- W4220791336 cites W2108211735 @default.
- W4220791336 cites W2118597789 @default.
- W4220791336 cites W2118688136 @default.
- W4220791336 cites W2124351063 @default.
- W4220791336 cites W2131681506 @default.
- W4220791336 cites W2133828198 @default.
- W4220791336 cites W2134852597 @default.
- W4220791336 cites W2145853890 @default.
- W4220791336 cites W2150593711 @default.
- W4220791336 cites W2151370125 @default.
- W4220791336 cites W2156125289 @default.
- W4220791336 cites W2158703410 @default.
- W4220791336 cites W2160877678 @default.
- W4220791336 cites W2164078810 @default.
- W4220791336 cites W2170747616 @default.
- W4220791336 cites W2513506562 @default.
- W4220791336 cites W2565824505 @default.
- W4220791336 cites W2573834278 @default.
- W4220791336 cites W2893222042 @default.
- W4220791336 cites W2897401786 @default.
- W4220791336 cites W2909601828 @default.
- W4220791336 cites W2950150251 @default.
- W4220791336 cites W2950589160 @default.
- W4220791336 cites W2950954328 @default.
- W4220791336 cites W2953008890 @default.
- W4220791336 cites W2982531601 @default.
- W4220791336 cites W2992400060 @default.
- W4220791336 cites W3106188259 @default.
- W4220791336 cites W4213009331 @default.
- W4220791336 cites W2904651995 @default.
- W4220791336 doi "https://doi.org/10.1186/s12859-022-04643-9" @default.
- W4220791336 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/35354426" @default.
- W4220791336 hasPublicationYear "2022" @default.
- W4220791336 type Work @default.
- W4220791336 citedByCount "0" @default.
- W4220791336 crossrefType "journal-article" @default.
- W4220791336 hasAuthorship W4220791336A5048547355 @default.
- W4220791336 hasAuthorship W4220791336A5072970858 @default.
- W4220791336 hasBestOaLocation W42207913361 @default.
- W4220791336 hasConcept C103278499 @default.
- W4220791336 hasConcept C104317684 @default.
- W4220791336 hasConcept C11413529 @default.
- W4220791336 hasConcept C115961682 @default.
- W4220791336 hasConcept C124101348 @default.
- W4220791336 hasConcept C153180895 @default.
- W4220791336 hasConcept C154945302 @default.
- W4220791336 hasConcept C164866538 @default.
- W4220791336 hasConcept C184898388 @default.
- W4220791336 hasConcept C199360897 @default.
- W4220791336 hasConcept C2778112365 @default.
- W4220791336 hasConcept C41008148 @default.
- W4220791336 hasConcept C54355233 @default.
- W4220791336 hasConcept C55493867 @default.
- W4220791336 hasConcept C63479239 @default.
- W4220791336 hasConcept C73555534 @default.
- W4220791336 hasConcept C86803240 @default.
- W4220791336 hasConceptScore W4220791336C103278499 @default.
- W4220791336 hasConceptScore W4220791336C104317684 @default.
- W4220791336 hasConceptScore W4220791336C11413529 @default.
- W4220791336 hasConceptScore W4220791336C115961682 @default.
- W4220791336 hasConceptScore W4220791336C124101348 @default.
- W4220791336 hasConceptScore W4220791336C153180895 @default.
- W4220791336 hasConceptScore W4220791336C154945302 @default.
- W4220791336 hasConceptScore W4220791336C164866538 @default.
- W4220791336 hasConceptScore W4220791336C184898388 @default.
- W4220791336 hasConceptScore W4220791336C199360897 @default.
- W4220791336 hasConceptScore W4220791336C2778112365 @default.
- W4220791336 hasConceptScore W4220791336C41008148 @default.
- W4220791336 hasConceptScore W4220791336C54355233 @default.
- W4220791336 hasConceptScore W4220791336C55493867 @default.
- W4220791336 hasConceptScore W4220791336C63479239 @default.
- W4220791336 hasConceptScore W4220791336C73555534 @default.
- W4220791336 hasConceptScore W4220791336C86803240 @default.
- W4220791336 hasIssue "1" @default.
- W4220791336 hasLocation W42207913361 @default.
- W4220791336 hasLocation W42207913362 @default.
- W4220791336 hasLocation W42207913363 @default.
- W4220791336 hasOpenAccess W4220791336 @default.
- W4220791336 hasPrimaryLocation W42207913361 @default.
- W4220791336 hasRelatedWork W1457719682 @default.
- W4220791336 hasRelatedWork W2148556617 @default.
- W4220791336 hasRelatedWork W2153839362 @default.
- W4220791336 hasRelatedWork W2295731408 @default.