Matches in SemOpenAlex for { <https://semopenalex.org/work/W2983201557> ?p ?o ?g. }
Showing items 1 to 72 of
72
with 100 items per page.
- W2983201557 abstract "Distributed storage in the cloud needs to offer both low latency and high bandwidth access to data and efficient use of storage capacity in order to keep up with emerging big data workloads. Deduplication has been successfully used to help with the latter requirement but it is often at odds with low latency data access. Deduplication ratios can be significantly increased if the storage nodes are aware of the file format and the ways clients interact with it – but implementing different file-type specific parsing on FPGAs for multiple tenants can be unfeasible due to area constraints. We show the benefits of making the storage system aware of the application through the example of Parquet files, a columnar format used in machine learning and big data frameworks to store and transfer datasets. We achieve high deduplication ratios by using a companion software library that allows Parquet files to be stored in a divided way. This makes deduplication more efficient and enables clients to access individual columns or meta-data fields selectively. At the same time, the storage nodes remain general purpose and can store and deduplicate arbitrary data. This work paves the way for in-storage processing for Parquet files and other columnar formats because the different columns can be accessed in a streaming fashion and their processing requires no specialized logic on the FPGA." @default.
- W2983201557 created "2019-11-22" @default.
- W2983201557 creator A5030342672 @default.
- W2983201557 creator A5082765592 @default.
- W2983201557 date "2019-09-01" @default.
- W2983201557 modified "2023-09-23" @default.
- W2983201557 title "Storing Parquet Tile by Tile: Application-Aware Storage with Deduplication" @default.
- W2983201557 cites W1969455062 @default.
- W2983201557 cites W2004286258 @default.
- W2983201557 cites W2034591426 @default.
- W2983201557 cites W2090815861 @default.
- W2983201557 cites W2208424819 @default.
- W2983201557 cites W2572583791 @default.
- W2983201557 cites W2900463239 @default.
- W2983201557 cites W2948847334 @default.
- W2983201557 doi "https://doi.org/10.1109/fpl.2019.00073" @default.
- W2983201557 hasPublicationYear "2019" @default.
- W2983201557 type Work @default.
- W2983201557 sameAs 2983201557 @default.
- W2983201557 citedByCount "0" @default.
- W2983201557 crossrefType "proceedings-article" @default.
- W2983201557 hasAuthorship W2983201557A5030342672 @default.
- W2983201557 hasAuthorship W2983201557A5082765592 @default.
- W2983201557 hasConcept C111919701 @default.
- W2983201557 hasConcept C194739806 @default.
- W2983201557 hasConcept C24885549 @default.
- W2983201557 hasConcept C2777059624 @default.
- W2983201557 hasConcept C2777904410 @default.
- W2983201557 hasConcept C32587265 @default.
- W2983201557 hasConcept C41008148 @default.
- W2983201557 hasConcept C67646966 @default.
- W2983201557 hasConcept C77088390 @default.
- W2983201557 hasConcept C79974875 @default.
- W2983201557 hasConcept C97250363 @default.
- W2983201557 hasConceptScore W2983201557C111919701 @default.
- W2983201557 hasConceptScore W2983201557C194739806 @default.
- W2983201557 hasConceptScore W2983201557C24885549 @default.
- W2983201557 hasConceptScore W2983201557C2777059624 @default.
- W2983201557 hasConceptScore W2983201557C2777904410 @default.
- W2983201557 hasConceptScore W2983201557C32587265 @default.
- W2983201557 hasConceptScore W2983201557C41008148 @default.
- W2983201557 hasConceptScore W2983201557C67646966 @default.
- W2983201557 hasConceptScore W2983201557C77088390 @default.
- W2983201557 hasConceptScore W2983201557C79974875 @default.
- W2983201557 hasConceptScore W2983201557C97250363 @default.
- W2983201557 hasLocation W29832015571 @default.
- W2983201557 hasOpenAccess W2983201557 @default.
- W2983201557 hasPrimaryLocation W29832015571 @default.
- W2983201557 hasRelatedWork W1498020397 @default.
- W2983201557 hasRelatedWork W200233886 @default.
- W2983201557 hasRelatedWork W2043384072 @default.
- W2983201557 hasRelatedWork W2137092019 @default.
- W2983201557 hasRelatedWork W2380944442 @default.
- W2983201557 hasRelatedWork W2472668849 @default.
- W2983201557 hasRelatedWork W2484731461 @default.
- W2983201557 hasRelatedWork W2501241433 @default.
- W2983201557 hasRelatedWork W2512339778 @default.
- W2983201557 hasRelatedWork W2611518728 @default.
- W2983201557 hasRelatedWork W2762023785 @default.
- W2983201557 hasRelatedWork W2765816362 @default.
- W2983201557 hasRelatedWork W2810370040 @default.
- W2983201557 hasRelatedWork W2998669235 @default.
- W2983201557 hasRelatedWork W3102408035 @default.
- W2983201557 hasRelatedWork W2220309835 @default.
- W2983201557 hasRelatedWork W2300775738 @default.
- W2983201557 hasRelatedWork W2737422314 @default.
- W2983201557 hasRelatedWork W275157621 @default.
- W2983201557 hasRelatedWork W2933785475 @default.
- W2983201557 isParatext "false" @default.
- W2983201557 isRetracted "false" @default.
- W2983201557 magId "2983201557" @default.
- W2983201557 workType "article" @default.