Matches in SemOpenAlex for { <https://semopenalex.org/work/W3047731862> ?p ?o ?g. }
- W3047731862 endingPage "161" @default.
- W3047731862 startingPage "155" @default.
- W3047731862 abstract "Abstract Motivation Rapid developments in sequencing technologies have boosted generating high volumes of sequence data. To archive and analyze those data, one primary step is sequence comparison. Alignment-free sequence comparison based on k-mer frequencies offers a computationally efficient solution, yet in practice, the k-mer frequency vectors for large k of practical interest lead to excessive memory and storage consumption. Results We report CRAFT, a general genomic/metagenomic search engine to learn compact representations of sequences and perform fast comparison between DNA sequences. Specifically, given genome or high throughput sequencing data as input, CRAFT maps the data into a much smaller embedding space and locates the best matching genome in the archived massive sequence repositories. With 102−104-fold reduction of storage space, CRAFT performs fast query for gigabytes of data within seconds or minutes, achieving comparable performance as six state-of-the-art alignment-free measures. Availability and implementation CRAFT offers a user-friendly graphical user interface with one-click installation on Windows and Linux operating systems, freely available at https://github.com/jiaxingbai/CRAFT. Supplementary information Supplementary data are available at Bioinformatics online." @default.
- W3047731862 created "2020-08-13" @default.
- W3047731862 creator A5022255550 @default.
- W3047731862 creator A5025406911 @default.
- W3047731862 creator A5078451282 @default.
- W3047731862 creator A5084931396 @default.
- W3047731862 date "2020-08-07" @default.
- W3047731862 modified "2023-10-11" @default.
- W3047731862 title "CRAFT: Compact genome Representation toward large-scale Alignment-Free daTabase" @default.
- W3047731862 cites W1999597013 @default.
- W3047731862 cites W1999674546 @default.
- W3047731862 cites W2006075770 @default.
- W3047731862 cites W2055043387 @default.
- W3047731862 cites W2060425093 @default.
- W3047731862 cites W2074231493 @default.
- W3047731862 cites W2086291326 @default.
- W3047731862 cites W2087064593 @default.
- W3047731862 cites W2107903949 @default.
- W3047731862 cites W2112874908 @default.
- W3047731862 cites W2116790427 @default.
- W3047731862 cites W2116988150 @default.
- W3047731862 cites W2133530448 @default.
- W3047731862 cites W2144820135 @default.
- W3047731862 cites W2151409320 @default.
- W3047731862 cites W2159954944 @default.
- W3047731862 cites W2341364667 @default.
- W3047731862 cites W2468915207 @default.
- W3047731862 cites W2611554670 @default.
- W3047731862 cites W2761430568 @default.
- W3047731862 cites W2950150251 @default.
- W3047731862 cites W2951737218 @default.
- W3047731862 cites W2953328352 @default.
- W3047731862 cites W2962807110 @default.
- W3047731862 cites W2963607348 @default.
- W3047731862 cites W4230932396 @default.
- W3047731862 doi "https://doi.org/10.1093/bioinformatics/btaa699" @default.
- W3047731862 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/32766810" @default.
- W3047731862 hasPublicationYear "2020" @default.
- W3047731862 type Work @default.
- W3047731862 sameAs 3047731862 @default.
- W3047731862 citedByCount "3" @default.
- W3047731862 countsByYear W30477318622021 @default.
- W3047731862 countsByYear W30477318622022 @default.
- W3047731862 countsByYear W30477318622023 @default.
- W3047731862 crossrefType "journal-article" @default.
- W3047731862 hasAuthorship W3047731862A5022255550 @default.
- W3047731862 hasAuthorship W3047731862A5025406911 @default.
- W3047731862 hasAuthorship W3047731862A5078451282 @default.
- W3047731862 hasAuthorship W3047731862A5084931396 @default.
- W3047731862 hasBestOaLocation W30477318622 @default.
- W3047731862 hasConcept C104317684 @default.
- W3047731862 hasConcept C111919701 @default.
- W3047731862 hasConcept C124101348 @default.
- W3047731862 hasConcept C166957645 @default.
- W3047731862 hasConcept C167625842 @default.
- W3047731862 hasConcept C180384323 @default.
- W3047731862 hasConcept C2778112365 @default.
- W3047731862 hasConcept C2779732396 @default.
- W3047731862 hasConcept C37789001 @default.
- W3047731862 hasConcept C41008148 @default.
- W3047731862 hasConcept C45484198 @default.
- W3047731862 hasConcept C51679486 @default.
- W3047731862 hasConcept C54355233 @default.
- W3047731862 hasConcept C552990157 @default.
- W3047731862 hasConcept C55493867 @default.
- W3047731862 hasConcept C77088390 @default.
- W3047731862 hasConcept C86803240 @default.
- W3047731862 hasConcept C95457728 @default.
- W3047731862 hasConceptScore W3047731862C104317684 @default.
- W3047731862 hasConceptScore W3047731862C111919701 @default.
- W3047731862 hasConceptScore W3047731862C124101348 @default.
- W3047731862 hasConceptScore W3047731862C166957645 @default.
- W3047731862 hasConceptScore W3047731862C167625842 @default.
- W3047731862 hasConceptScore W3047731862C180384323 @default.
- W3047731862 hasConceptScore W3047731862C2778112365 @default.
- W3047731862 hasConceptScore W3047731862C2779732396 @default.
- W3047731862 hasConceptScore W3047731862C37789001 @default.
- W3047731862 hasConceptScore W3047731862C41008148 @default.
- W3047731862 hasConceptScore W3047731862C45484198 @default.
- W3047731862 hasConceptScore W3047731862C51679486 @default.
- W3047731862 hasConceptScore W3047731862C54355233 @default.
- W3047731862 hasConceptScore W3047731862C552990157 @default.
- W3047731862 hasConceptScore W3047731862C55493867 @default.
- W3047731862 hasConceptScore W3047731862C77088390 @default.
- W3047731862 hasConceptScore W3047731862C86803240 @default.
- W3047731862 hasConceptScore W3047731862C95457728 @default.
- W3047731862 hasFunder F4320306076 @default.
- W3047731862 hasFunder F4320321001 @default.
- W3047731862 hasFunder F4320321878 @default.
- W3047731862 hasFunder F4320332161 @default.
- W3047731862 hasFunder F4320335777 @default.
- W3047731862 hasIssue "2" @default.
- W3047731862 hasLocation W30477318621 @default.
- W3047731862 hasLocation W30477318622 @default.
- W3047731862 hasLocation W30477318623 @default.
- W3047731862 hasLocation W30477318624 @default.
- W3047731862 hasOpenAccess W3047731862 @default.
- W3047731862 hasPrimaryLocation W30477318621 @default.