Matches in SemOpenAlex for { <https://semopenalex.org/work/W4386714266> ?p ?o ?g. }
- W4386714266 abstract "Abstract Metagenomic data compression is very important as metagenomic projects are facing the challenges of larger data volumes per sample and more samples nowadays. The reference-based compression is a promising method to obtain a high compression ratio. However, existing microbial reference genome databases are not suitable to be directly used as references for compression due to their large size and redundancy, and different metagenomic cohorts often have various microbial compositions. We presented a novel pipeline that generated simplified and tailored reference genomes for large metagenomic cohorts, enabling reference-based compression for metagenomic data. We constructed customized reference genomes, ranging from 2.4 to 3.9GB, for 29 real metagenomic datasets, and evaluated their compression performance. Reference-based compression achieved an impressive compression ratio of over 20 for human whole-genome data and up to 33.8 for all samples, demonstrating a remarkably 4.5 times improvement than the standard Gzip compression. Our method provides new insights into reference-based metagenomic data compression and has a broad application potential for faster and cheaper data transfer, storage, and analysis." @default.
- W4386714266 created "2023-09-14" @default.
- W4386714266 creator A5028830378 @default.
- W4386714266 creator A5032412994 @default.
- W4386714266 creator A5049006053 @default.
- W4386714266 creator A5062951919 @default.
- W4386714266 creator A5068441306 @default.
- W4386714266 date "2023-09-12" @default.
- W4386714266 modified "2023-10-15" @default.
- W4386714266 title "A pipeline for constructing reference genomes for large cohort-specific metagenome compression" @default.
- W4386714266 cites W1931590386 @default.
- W4386714266 cites W1989348205 @default.
- W4386714266 cites W2001261225 @default.
- W4386714266 cites W2004690636 @default.
- W4386714266 cites W2020255088 @default.
- W4386714266 cites W2042947822 @default.
- W4386714266 cites W2075716829 @default.
- W4386714266 cites W2092880969 @default.
- W4386714266 cites W2104422209 @default.
- W4386714266 cites W2106399600 @default.
- W4386714266 cites W2111044311 @default.
- W4386714266 cites W2113552312 @default.
- W4386714266 cites W2159084616 @default.
- W4386714266 cites W2166588423 @default.
- W4386714266 cites W2170551349 @default.
- W4386714266 cites W2170727800 @default.
- W4386714266 cites W2171057071 @default.
- W4386714266 cites W2173732482 @default.
- W4386714266 cites W2230655163 @default.
- W4386714266 cites W2377027462 @default.
- W4386714266 cites W2460850757 @default.
- W4386714266 cites W2530896378 @default.
- W4386714266 cites W2559993150 @default.
- W4386714266 cites W2597471102 @default.
- W4386714266 cites W2763540102 @default.
- W4386714266 cites W2766239397 @default.
- W4386714266 cites W2789843538 @default.
- W4386714266 cites W2790743416 @default.
- W4386714266 cites W2794368263 @default.
- W4386714266 cites W2794407684 @default.
- W4386714266 cites W2905694505 @default.
- W4386714266 cites W2912990896 @default.
- W4386714266 cites W2916976079 @default.
- W4386714266 cites W2922289309 @default.
- W4386714266 cites W2949113050 @default.
- W4386714266 cites W2956864158 @default.
- W4386714266 cites W2973821866 @default.
- W4386714266 cites W3025757981 @default.
- W4386714266 cites W3042305844 @default.
- W4386714266 cites W3106764389 @default.
- W4386714266 cites W3118724893 @default.
- W4386714266 cites W3128751250 @default.
- W4386714266 cites W3132214945 @default.
- W4386714266 cites W3156729733 @default.
- W4386714266 cites W3159504416 @default.
- W4386714266 cites W3166011525 @default.
- W4386714266 cites W3169629261 @default.
- W4386714266 cites W3198563773 @default.
- W4386714266 cites W3202985852 @default.
- W4386714266 cites W4200235252 @default.
- W4386714266 cites W4220699460 @default.
- W4386714266 cites W4220737584 @default.
- W4386714266 cites W4220787108 @default.
- W4386714266 cites W4221009274 @default.
- W4386714266 cites W4317242243 @default.
- W4386714266 doi "https://doi.org/10.1101/2023.09.12.557346" @default.
- W4386714266 hasPublicationYear "2023" @default.
- W4386714266 type Work @default.
- W4386714266 citedByCount "0" @default.
- W4386714266 crossrefType "posted-content" @default.
- W4386714266 hasAuthorship W4386714266A5028830378 @default.
- W4386714266 hasAuthorship W4386714266A5032412994 @default.
- W4386714266 hasAuthorship W4386714266A5049006053 @default.
- W4386714266 hasAuthorship W4386714266A5062951919 @default.
- W4386714266 hasAuthorship W4386714266A5068441306 @default.
- W4386714266 hasBestOaLocation W43867142661 @default.
- W4386714266 hasConcept C104317684 @default.
- W4386714266 hasConcept C111919701 @default.
- W4386714266 hasConcept C124101348 @default.
- W4386714266 hasConcept C127413603 @default.
- W4386714266 hasConcept C141231307 @default.
- W4386714266 hasConcept C15151743 @default.
- W4386714266 hasConcept C152124472 @default.
- W4386714266 hasConcept C154945302 @default.
- W4386714266 hasConcept C159985019 @default.
- W4386714266 hasConcept C171146098 @default.
- W4386714266 hasConcept C180016635 @default.
- W4386714266 hasConcept C192562407 @default.
- W4386714266 hasConcept C192953774 @default.
- W4386714266 hasConcept C199360897 @default.
- W4386714266 hasConcept C25797200 @default.
- W4386714266 hasConcept C41008148 @default.
- W4386714266 hasConcept C43521106 @default.
- W4386714266 hasConcept C511840579 @default.
- W4386714266 hasConcept C54355233 @default.
- W4386714266 hasConcept C7545210 @default.
- W4386714266 hasConcept C77088390 @default.
- W4386714266 hasConcept C78548338 @default.
- W4386714266 hasConcept C86803240 @default.
- W4386714266 hasConceptScore W4386714266C104317684 @default.