Matches in SemOpenAlex for { <https://semopenalex.org/work/W2951201167> ?p ?o ?g. }
Showing items 1 to 91 of
91
with 100 items per page.
- W2951201167 abstract "Fault tolerance is one of the major design goals for HPC. The emergence of non-volatile memories (NVM) provides a solution to build fault tolerant HPC. Data in NVM-based main memory are not lost when the system crashes because of the non-volatility nature of NVM. However, because of volatile caches, data must be logged and explicitly flushed from caches into NVM to ensure consistence and correctness before crashes, which can cause large runtime overhead. In this paper, we introduce an algorithm-based method to establish crash consistence in NVM for HPC applications. We slightly extend application data structures or sparsely flush cache blocks, which introduce ignorable runtime overhead. Such extension or cache flushing allows us to use algorithm knowledge to textit{reason} data consistence or correct inconsistent data when the application crashes. We demonstrate the effectiveness of our method for three algorithms, including an iterative solver, dense matrix multiplication, and Monte-Carlo simulation. Based on comprehensive performance evaluation on a variety of test environments, we demonstrate that our approach has very small runtime overhead (at most 8.2% and less than 3% in most cases), much smaller than that of traditional checkpoint, while having the same or less recomputation cost." @default.
- W2951201167 created "2019-06-27" @default.
- W2951201167 creator A5029827598 @default.
- W2951201167 creator A5056036725 @default.
- W2951201167 creator A5062590263 @default.
- W2951201167 creator A5071200777 @default.
- W2951201167 creator A5074531400 @default.
- W2951201167 date "2017-05-16" @default.
- W2951201167 modified "2023-10-12" @default.
- W2951201167 title "Algorithm-Directed Crash Consistence in Non-Volatile Memory for HPC" @default.
- W2951201167 cites W1865162546 @default.
- W2951201167 cites W1976573677 @default.
- W2951201167 cites W1980321276 @default.
- W2951201167 cites W1981432246 @default.
- W2951201167 cites W1990832096 @default.
- W2951201167 cites W2005887179 @default.
- W2951201167 cites W2041135949 @default.
- W2951201167 cites W2090204040 @default.
- W2951201167 cites W2104129492 @default.
- W2951201167 cites W2107594045 @default.
- W2951201167 cites W2113637091 @default.
- W2951201167 cites W2130076536 @default.
- W2951201167 cites W2133287150 @default.
- W2951201167 cites W2134633067 @default.
- W2951201167 cites W2150662965 @default.
- W2951201167 cites W2155694951 @default.
- W2951201167 cites W2165022815 @default.
- W2951201167 cites W2171724053 @default.
- W2951201167 cites W2296772319 @default.
- W2951201167 cites W2337228275 @default.
- W2951201167 cites W2411778045 @default.
- W2951201167 cites W2515380288 @default.
- W2951201167 cites W2565225244 @default.
- W2951201167 cites W2565270815 @default.
- W2951201167 cites W28511425 @default.
- W2951201167 doi "https://doi.org/10.48550/arxiv.1705.05541" @default.
- W2951201167 hasPublicationYear "2017" @default.
- W2951201167 type Work @default.
- W2951201167 sameAs 2951201167 @default.
- W2951201167 citedByCount "1" @default.
- W2951201167 countsByYear W29512011672017 @default.
- W2951201167 crossrefType "posted-content" @default.
- W2951201167 hasAuthorship W2951201167A5029827598 @default.
- W2951201167 hasAuthorship W2951201167A5056036725 @default.
- W2951201167 hasAuthorship W2951201167A5062590263 @default.
- W2951201167 hasAuthorship W2951201167A5071200777 @default.
- W2951201167 hasAuthorship W2951201167A5074531400 @default.
- W2951201167 hasBestOaLocation W29512011671 @default.
- W2951201167 hasConcept C111919701 @default.
- W2951201167 hasConcept C11413529 @default.
- W2951201167 hasConcept C115537543 @default.
- W2951201167 hasConcept C120314980 @default.
- W2951201167 hasConcept C149635348 @default.
- W2951201167 hasConcept C173608175 @default.
- W2951201167 hasConcept C177950962 @default.
- W2951201167 hasConcept C183469790 @default.
- W2951201167 hasConcept C2779960059 @default.
- W2951201167 hasConcept C41008148 @default.
- W2951201167 hasConcept C55439883 @default.
- W2951201167 hasConcept C63540848 @default.
- W2951201167 hasConcept C9390403 @default.
- W2951201167 hasConceptScore W2951201167C111919701 @default.
- W2951201167 hasConceptScore W2951201167C11413529 @default.
- W2951201167 hasConceptScore W2951201167C115537543 @default.
- W2951201167 hasConceptScore W2951201167C120314980 @default.
- W2951201167 hasConceptScore W2951201167C149635348 @default.
- W2951201167 hasConceptScore W2951201167C173608175 @default.
- W2951201167 hasConceptScore W2951201167C177950962 @default.
- W2951201167 hasConceptScore W2951201167C183469790 @default.
- W2951201167 hasConceptScore W2951201167C2779960059 @default.
- W2951201167 hasConceptScore W2951201167C41008148 @default.
- W2951201167 hasConceptScore W2951201167C55439883 @default.
- W2951201167 hasConceptScore W2951201167C63540848 @default.
- W2951201167 hasConceptScore W2951201167C9390403 @default.
- W2951201167 hasLocation W29512011671 @default.
- W2951201167 hasOpenAccess W2951201167 @default.
- W2951201167 hasPrimaryLocation W29512011671 @default.
- W2951201167 hasRelatedWork W1521356350 @default.
- W2951201167 hasRelatedWork W1571368810 @default.
- W2951201167 hasRelatedWork W1616582327 @default.
- W2951201167 hasRelatedWork W1784146144 @default.
- W2951201167 hasRelatedWork W1796231360 @default.
- W2951201167 hasRelatedWork W2285867394 @default.
- W2951201167 hasRelatedWork W2350731024 @default.
- W2951201167 hasRelatedWork W2379400621 @default.
- W2951201167 hasRelatedWork W2611544471 @default.
- W2951201167 hasRelatedWork W4242263690 @default.
- W2951201167 isParatext "false" @default.
- W2951201167 isRetracted "false" @default.
- W2951201167 magId "2951201167" @default.
- W2951201167 workType "article" @default.