Matches in SemOpenAlex for { <https://semopenalex.org/work/W2539230224> ?p ?o ?g. }
Showing items 1 to 92 of
92
with 100 items per page.
- W2539230224 endingPage "29" @default.
- W2539230224 startingPage "16" @default.
- W2539230224 abstract "Non-volatile devices, such as SSDs, will be an integral part of the deepening storage hierarchy on large-scale HPC systems. These devices can be on the compute nodes as part of a distributed burst buffer service or they can be external. Wherever they are located in the hierarchy, one critical design issue is the SSD endurance under the write-heavy workloads, such as the checkpoint I/O for scientific applications. For these environments, it is widely assumed that checkpoint operations can occur once every 60 min and for each checkpoint step as much as half of the system memory can be written out. Unfortunately, for large-scale HPC applications, the burst buffer SSDs can be worn out much more quickly given the extensive amount of data written at every checkpoint step. One possible solution is to control the amount of data written by reducing the checkpoint frequency. However, a direct effect caused by reduced checkpoint frequency is the increased vulnerability window of system failures and therefore potentially wasted computation time, especially for large-scale compute jobs. In this paper, we propose a new checkpoint placement optimization model which collaboratively utilizes both the burst buffer and the parallel file system to store the checkpoints, with design goals of maximizing computation efficiency while guaranteeing the SSD endurance requirements. Moreover, we present an adaptive algorithm which can dynamically adjust the checkpoint placement based on the system’s dynamic runtime characteristics and continuously optimize the burst buffer utilization. The evaluation results show that by using our adaptive checkpoint placement algorithm we can guarantee the burst buffer endurance with at most 5% performance degradation per application and less than 3% for the entire system." @default.
- W2539230224 created "2016-10-28" @default.
- W2539230224 creator A5014538652 @default.
- W2539230224 creator A5057326516 @default.
- W2539230224 creator A5068437302 @default.
- W2539230224 creator A5075702553 @default.
- W2539230224 date "2017-02-01" @default.
- W2539230224 modified "2023-10-16" @default.
- W2539230224 title "Optimizing checkpoint data placement with guaranteed burst buffer endurance in large-scale hierarchical storage systems" @default.
- W2539230224 cites W1974534679 @default.
- W2539230224 cites W1977792483 @default.
- W2539230224 cites W1981509927 @default.
- W2539230224 cites W1984564341 @default.
- W2539230224 cites W1987003998 @default.
- W2539230224 cites W1989299066 @default.
- W2539230224 cites W1990945477 @default.
- W2539230224 cites W1991269738 @default.
- W2539230224 cites W2004885674 @default.
- W2539230224 cites W2006425633 @default.
- W2539230224 cites W2028052812 @default.
- W2539230224 cites W2031080073 @default.
- W2539230224 cites W2031479196 @default.
- W2539230224 cites W2033656974 @default.
- W2539230224 cites W2039631162 @default.
- W2539230224 cites W2056966287 @default.
- W2539230224 cites W2063793192 @default.
- W2539230224 cites W2064388050 @default.
- W2539230224 cites W2094723572 @default.
- W2539230224 cites W2097861191 @default.
- W2539230224 cites W2107200720 @default.
- W2539230224 cites W2123871098 @default.
- W2539230224 cites W2133046454 @default.
- W2539230224 cites W2138900633 @default.
- W2539230224 cites W2166143798 @default.
- W2539230224 cites W4245507143 @default.
- W2539230224 doi "https://doi.org/10.1016/j.jpdc.2016.10.002" @default.
- W2539230224 hasPublicationYear "2017" @default.
- W2539230224 type Work @default.
- W2539230224 sameAs 2539230224 @default.
- W2539230224 citedByCount "18" @default.
- W2539230224 countsByYear W25392302242017 @default.
- W2539230224 countsByYear W25392302242018 @default.
- W2539230224 countsByYear W25392302242019 @default.
- W2539230224 countsByYear W25392302242020 @default.
- W2539230224 countsByYear W25392302242021 @default.
- W2539230224 countsByYear W25392302242022 @default.
- W2539230224 crossrefType "journal-article" @default.
- W2539230224 hasAuthorship W2539230224A5014538652 @default.
- W2539230224 hasAuthorship W2539230224A5057326516 @default.
- W2539230224 hasAuthorship W2539230224A5068437302 @default.
- W2539230224 hasAuthorship W2539230224A5075702553 @default.
- W2539230224 hasBestOaLocation W25392302241 @default.
- W2539230224 hasConcept C11413529 @default.
- W2539230224 hasConcept C115537543 @default.
- W2539230224 hasConcept C120314980 @default.
- W2539230224 hasConcept C149635348 @default.
- W2539230224 hasConcept C173608175 @default.
- W2539230224 hasConcept C2778100165 @default.
- W2539230224 hasConcept C41008148 @default.
- W2539230224 hasConcept C45374587 @default.
- W2539230224 hasConceptScore W2539230224C11413529 @default.
- W2539230224 hasConceptScore W2539230224C115537543 @default.
- W2539230224 hasConceptScore W2539230224C120314980 @default.
- W2539230224 hasConceptScore W2539230224C149635348 @default.
- W2539230224 hasConceptScore W2539230224C173608175 @default.
- W2539230224 hasConceptScore W2539230224C2778100165 @default.
- W2539230224 hasConceptScore W2539230224C41008148 @default.
- W2539230224 hasConceptScore W2539230224C45374587 @default.
- W2539230224 hasFunder F4320306076 @default.
- W2539230224 hasFunder F4320332359 @default.
- W2539230224 hasLocation W25392302241 @default.
- W2539230224 hasLocation W25392302242 @default.
- W2539230224 hasLocation W25392302243 @default.
- W2539230224 hasOpenAccess W2539230224 @default.
- W2539230224 hasPrimaryLocation W25392302241 @default.
- W2539230224 hasRelatedWork W1509211761 @default.
- W2539230224 hasRelatedWork W1558545464 @default.
- W2539230224 hasRelatedWork W1984303163 @default.
- W2539230224 hasRelatedWork W2117014006 @default.
- W2539230224 hasRelatedWork W2358725432 @default.
- W2539230224 hasRelatedWork W2372170743 @default.
- W2539230224 hasRelatedWork W3037767301 @default.
- W2539230224 hasRelatedWork W314212532 @default.
- W2539230224 hasRelatedWork W4232453487 @default.
- W2539230224 hasRelatedWork W4233815414 @default.
- W2539230224 hasVolume "100" @default.
- W2539230224 isParatext "false" @default.
- W2539230224 isRetracted "false" @default.
- W2539230224 magId "2539230224" @default.
- W2539230224 workType "article" @default.