Matches in SemOpenAlex for { <https://semopenalex.org/work/W4382724591> ?p ?o ?g. }
Showing items 1 to 96 of
96
with 100 items per page.
- W4382724591 endingPage "2579" @default.
- W4382724591 startingPage "2568" @default.
- W4382724591 abstract "In data deduplication systems, chunking has a significant impact on the deduplication ratio and throughput. Existing <italic xmlns:mml=http://www.w3.org/1998/Math/MathML xmlns:xlink=http://www.w3.org/1999/xlink>Content-Defined Chunking</i> (CDC) approaches exploit a sliding window to calculate rolling hashes of the input data stream byte-by-byte, and then determine chunk cut-points if the rolling hash satisfies a given cut-condition. Since previous CDC approaches are extremely costly, it often significantly degrades the throughput of data deduplication systems. In this paper, we argue that calculating and checking the rolling hashes byte-by-byte is unnecessary. To reduce the CPU overhead of CDC, we propose a <italic xmlns:mml=http://www.w3.org/1998/Math/MathML xmlns:xlink=http://www.w3.org/1999/xlink>jump-based chunking</i> (JC) approach. The key idea is to introduce a jump-condition, and the sliding window can jump over a specific length of the input data stream if the rolling hashes satisfy the jump-condition. Moreover, we also explore the impact of the cut-condition and the jump-condition on the chunk size. Our theoretic studies demonstrate the effectiveness and efficiency of JC, without compromising the deduplication ratio. Experimental results show that JC improves the throughput of chunking by about 2× on average compared with the state-of-the-art CDC approaches while still guaranteeing high deduplication ratio." @default.
- W4382724591 created "2023-07-01" @default.
- W4382724591 creator A5005730481 @default.
- W4382724591 creator A5007630760 @default.
- W4382724591 creator A5022262922 @default.
- W4382724591 creator A5022398389 @default.
- W4382724591 creator A5051170863 @default.
- W4382724591 creator A5092370649 @default.
- W4382724591 date "2023-09-01" @default.
- W4382724591 modified "2023-10-15" @default.
- W4382724591 title "Accelerating Content-Defined Chunking for Data Deduplication Based on Speculative Jump" @default.
- W4382724591 cites W1521996498 @default.
- W4382724591 cites W1553098517 @default.
- W4382724591 cites W1614703486 @default.
- W4382724591 cites W1639305476 @default.
- W4382724591 cites W1969335064 @default.
- W4382724591 cites W1976024527 @default.
- W4382724591 cites W2110322986 @default.
- W4382724591 cites W2475932436 @default.
- W4382724591 cites W2481696877 @default.
- W4382724591 cites W2916086000 @default.
- W4382724591 cites W2935106878 @default.
- W4382724591 cites W2979451760 @default.
- W4382724591 cites W2986445348 @default.
- W4382724591 cites W3006159724 @default.
- W4382724591 cites W3014193728 @default.
- W4382724591 cites W3195673285 @default.
- W4382724591 cites W3209955552 @default.
- W4382724591 cites W4205095516 @default.
- W4382724591 cites W4288070990 @default.
- W4382724591 cites W3006437988 @default.
- W4382724591 doi "https://doi.org/10.1109/tpds.2023.3290770" @default.
- W4382724591 hasPublicationYear "2023" @default.
- W4382724591 type Work @default.
- W4382724591 citedByCount "0" @default.
- W4382724591 crossrefType "journal-article" @default.
- W4382724591 hasAuthorship W4382724591A5005730481 @default.
- W4382724591 hasAuthorship W4382724591A5007630760 @default.
- W4382724591 hasAuthorship W4382724591A5022262922 @default.
- W4382724591 hasAuthorship W4382724591A5022398389 @default.
- W4382724591 hasAuthorship W4382724591A5051170863 @default.
- W4382724591 hasAuthorship W4382724591A5092370649 @default.
- W4382724591 hasBestOaLocation W43827245911 @default.
- W4382724591 hasConcept C102392041 @default.
- W4382724591 hasConcept C111919701 @default.
- W4382724591 hasConcept C11413529 @default.
- W4382724591 hasConcept C124101348 @default.
- W4382724591 hasConcept C154945302 @default.
- W4382724591 hasConcept C157764524 @default.
- W4382724591 hasConcept C173608175 @default.
- W4382724591 hasConcept C199360897 @default.
- W4382724591 hasConcept C203357204 @default.
- W4382724591 hasConcept C2778751112 @default.
- W4382724591 hasConcept C32587265 @default.
- W4382724591 hasConcept C41008148 @default.
- W4382724591 hasConcept C43364308 @default.
- W4382724591 hasConcept C555944384 @default.
- W4382724591 hasConcept C77088390 @default.
- W4382724591 hasConcept C99138194 @default.
- W4382724591 hasConceptScore W4382724591C102392041 @default.
- W4382724591 hasConceptScore W4382724591C111919701 @default.
- W4382724591 hasConceptScore W4382724591C11413529 @default.
- W4382724591 hasConceptScore W4382724591C124101348 @default.
- W4382724591 hasConceptScore W4382724591C154945302 @default.
- W4382724591 hasConceptScore W4382724591C157764524 @default.
- W4382724591 hasConceptScore W4382724591C173608175 @default.
- W4382724591 hasConceptScore W4382724591C199360897 @default.
- W4382724591 hasConceptScore W4382724591C203357204 @default.
- W4382724591 hasConceptScore W4382724591C2778751112 @default.
- W4382724591 hasConceptScore W4382724591C32587265 @default.
- W4382724591 hasConceptScore W4382724591C41008148 @default.
- W4382724591 hasConceptScore W4382724591C43364308 @default.
- W4382724591 hasConceptScore W4382724591C555944384 @default.
- W4382724591 hasConceptScore W4382724591C77088390 @default.
- W4382724591 hasConceptScore W4382724591C99138194 @default.
- W4382724591 hasFunder F4320321001 @default.
- W4382724591 hasIssue "9" @default.
- W4382724591 hasLocation W43827245911 @default.
- W4382724591 hasOpenAccess W4382724591 @default.
- W4382724591 hasPrimaryLocation W43827245911 @default.
- W4382724591 hasRelatedWork W1521996498 @default.
- W4382724591 hasRelatedWork W2070099235 @default.
- W4382724591 hasRelatedWork W2351279544 @default.
- W4382724591 hasRelatedWork W2356209611 @default.
- W4382724591 hasRelatedWork W2587323806 @default.
- W4382724591 hasRelatedWork W3014193728 @default.
- W4382724591 hasRelatedWork W3094771490 @default.
- W4382724591 hasRelatedWork W3159666886 @default.
- W4382724591 hasRelatedWork W4312753418 @default.
- W4382724591 hasRelatedWork W4382724591 @default.
- W4382724591 hasVolume "34" @default.
- W4382724591 isParatext "false" @default.
- W4382724591 isRetracted "false" @default.
- W4382724591 workType "article" @default.