Matches in SemOpenAlex for { <https://semopenalex.org/work/W4206903088> ?p ?o ?g. }
- W4206903088 abstract "Abstract Background Because of the rapid generation of data, the study of compression algorithms to reduce storage and transmission costs is important to bioinformaticians. Much of the focus has been on sequence data, including both genomes and protein amino acid sequences stored in FASTA files. Current standard practice is to use an ordinary lossless compressor such as gzip on a sequential list of atomic coordinates, but this approach expends bits on saving an arbitrary ordering of atoms, and it also prevents reordering the atoms for compressibility. The standard MMTF and BCIF file formats extend this approach with custom encoding of the coordinates. However, the brand new Foldcomp tool introduces a new paradigm of compressing local angles, to great effect. In this article, we explore a different paradigm, showing for the first time that image-based compression using global angles can also significantly improve compression ratios. To this end, we implement a prototype compressor ‘PIC’, specialized for point clouds of atom coordinates contained in PDB and mmCIF files. PIC maps the 3D data to a 2D 8-bit greyscale image and leverages the well developed PNG image compressor to minimize the size of the resulting image, forming the compressed file. Results PIC outperforms gzip in terms of compression ratio on proteins over 20,000 atoms in size, with a savings over gzip of up to 37.4% on the proteins compressed. In addition, PIC’s compression ratio increases with protein size. Conclusion Image-centric compression as demonstrated by our prototype PIC provides a potential means of constructing 3D structure-aware protein compression software, though future work would be necessary to make this practical." @default.
- W4206903088 created "2022-01-26" @default.
- W4206903088 creator A5011101868 @default.
- W4206903088 creator A5015501574 @default.
- W4206903088 date "2022-01-22" @default.
- W4206903088 modified "2023-09-30" @default.
- W4206903088 title "Image-centric compression of protein structures improves space savings" @default.
- W4206903088 cites W1532202009 @default.
- W4206903088 cites W1777016212 @default.
- W4206903088 cites W2069066547 @default.
- W4206903088 cites W2111044311 @default.
- W4206903088 cites W2128162768 @default.
- W4206903088 cites W2131106408 @default.
- W4206903088 cites W2131298020 @default.
- W4206903088 cites W2158678815 @default.
- W4206903088 cites W2159084616 @default.
- W4206903088 cites W2159906372 @default.
- W4206903088 cites W2187463565 @default.
- W4206903088 cites W2284413383 @default.
- W4206903088 cites W2622731166 @default.
- W4206903088 cites W2883628759 @default.
- W4206903088 cites W2922407406 @default.
- W4206903088 cites W2950192635 @default.
- W4206903088 cites W2951893579 @default.
- W4206903088 cites W2954770217 @default.
- W4206903088 cites W3093323969 @default.
- W4206903088 cites W3211795435 @default.
- W4206903088 cites W4311211228 @default.
- W4206903088 doi "https://doi.org/10.1101/2022.01.20.477098" @default.
- W4206903088 hasPublicationYear "2022" @default.
- W4206903088 type Work @default.
- W4206903088 citedByCount "2" @default.
- W4206903088 countsByYear W42069030882022 @default.
- W4206903088 countsByYear W42069030882023 @default.
- W4206903088 crossrefType "posted-content" @default.
- W4206903088 hasAuthorship W4206903088A5011101868 @default.
- W4206903088 hasAuthorship W4206903088A5015501574 @default.
- W4206903088 hasBestOaLocation W42069030881 @default.
- W4206903088 hasConcept C111919701 @default.
- W4206903088 hasConcept C113775141 @default.
- W4206903088 hasConcept C11413529 @default.
- W4206903088 hasConcept C115961682 @default.
- W4206903088 hasConcept C121684516 @default.
- W4206903088 hasConcept C125411270 @default.
- W4206903088 hasConcept C127413603 @default.
- W4206903088 hasConcept C131097465 @default.
- W4206903088 hasConcept C13481523 @default.
- W4206903088 hasConcept C154945302 @default.
- W4206903088 hasConcept C159985019 @default.
- W4206903088 hasConcept C171146098 @default.
- W4206903088 hasConcept C180016635 @default.
- W4206903088 hasConcept C192562407 @default.
- W4206903088 hasConcept C25797200 @default.
- W4206903088 hasConcept C2776029614 @default.
- W4206903088 hasConcept C31972630 @default.
- W4206903088 hasConcept C41008148 @default.
- W4206903088 hasConcept C459310 @default.
- W4206903088 hasConcept C46900642 @default.
- W4206903088 hasConcept C511840579 @default.
- W4206903088 hasConcept C65377053 @default.
- W4206903088 hasConcept C77088390 @default.
- W4206903088 hasConcept C78519656 @default.
- W4206903088 hasConcept C78548338 @default.
- W4206903088 hasConcept C80444323 @default.
- W4206903088 hasConcept C81081738 @default.
- W4206903088 hasConcept C9417928 @default.
- W4206903088 hasConcept C97250363 @default.
- W4206903088 hasConceptScore W4206903088C111919701 @default.
- W4206903088 hasConceptScore W4206903088C113775141 @default.
- W4206903088 hasConceptScore W4206903088C11413529 @default.
- W4206903088 hasConceptScore W4206903088C115961682 @default.
- W4206903088 hasConceptScore W4206903088C121684516 @default.
- W4206903088 hasConceptScore W4206903088C125411270 @default.
- W4206903088 hasConceptScore W4206903088C127413603 @default.
- W4206903088 hasConceptScore W4206903088C131097465 @default.
- W4206903088 hasConceptScore W4206903088C13481523 @default.
- W4206903088 hasConceptScore W4206903088C154945302 @default.
- W4206903088 hasConceptScore W4206903088C159985019 @default.
- W4206903088 hasConceptScore W4206903088C171146098 @default.
- W4206903088 hasConceptScore W4206903088C180016635 @default.
- W4206903088 hasConceptScore W4206903088C192562407 @default.
- W4206903088 hasConceptScore W4206903088C25797200 @default.
- W4206903088 hasConceptScore W4206903088C2776029614 @default.
- W4206903088 hasConceptScore W4206903088C31972630 @default.
- W4206903088 hasConceptScore W4206903088C41008148 @default.
- W4206903088 hasConceptScore W4206903088C459310 @default.
- W4206903088 hasConceptScore W4206903088C46900642 @default.
- W4206903088 hasConceptScore W4206903088C511840579 @default.
- W4206903088 hasConceptScore W4206903088C65377053 @default.
- W4206903088 hasConceptScore W4206903088C77088390 @default.
- W4206903088 hasConceptScore W4206903088C78519656 @default.
- W4206903088 hasConceptScore W4206903088C78548338 @default.
- W4206903088 hasConceptScore W4206903088C80444323 @default.
- W4206903088 hasConceptScore W4206903088C81081738 @default.
- W4206903088 hasConceptScore W4206903088C9417928 @default.
- W4206903088 hasConceptScore W4206903088C97250363 @default.
- W4206903088 hasLocation W42069030881 @default.
- W4206903088 hasOpenAccess W4206903088 @default.
- W4206903088 hasPrimaryLocation W42069030881 @default.
- W4206903088 hasRelatedWork W2008404305 @default.