Matches in SemOpenAlex for { <https://semopenalex.org/work/W4386943870> ?p ?o ?g. }
Showing items 1 to 97 of
97
with 100 items per page.
- W4386943870 abstract "This paper studies checkpointing strategies for parallel applications subject to failures. The optimal strategy to minimize total execution time, or makespan, is well known when failure IATs obey an Exponential distribution, but it is unknown for non-memoryless failure distributions. We explain why the latter fact is misunderstood in recent literature. We propose a general strategy that maximizes the expected efficiency until the next failure, and we show that this strategy achieves an asymptotically optimal makespan, thereby establishing the first optimality result for arbitrary failure distributions. Through extensive simulations, we show that the new strategy is always at least as good as the Young/Daly strategy for various failure distributions. For distributions with a high infant mortality (such as LogNormal with shape parameter k = 2.51 or Weibull with shape parameter 0.5), the execution time is divided by a factor 1.9 on average, and up to a factor 4.2 for recently deployed platforms." @default.
- W4386943870 created "2023-09-23" @default.
- W4386943870 creator A5001838181 @default.
- W4386943870 creator A5008190319 @default.
- W4386943870 creator A5037778045 @default.
- W4386943870 creator A5086100728 @default.
- W4386943870 date "2023-09-22" @default.
- W4386943870 modified "2023-10-01" @default.
- W4386943870 title "Checkpointing strategies to tolerate non-memoryless failures on HPC platforms" @default.
- W4386943870 cites W1558516248 @default.
- W4386943870 cites W1981432246 @default.
- W4386943870 cites W2033656974 @default.
- W4386943870 cites W2039631162 @default.
- W4386943870 cites W2042643186 @default.
- W4386943870 cites W2056966287 @default.
- W4386943870 cites W2062313287 @default.
- W4386943870 cites W2063924830 @default.
- W4386943870 cites W2064388050 @default.
- W4386943870 cites W2077911375 @default.
- W4386943870 cites W2081235423 @default.
- W4386943870 cites W2081605270 @default.
- W4386943870 cites W2098259808 @default.
- W4386943870 cites W2127433432 @default.
- W4386943870 cites W2131053137 @default.
- W4386943870 cites W2133046454 @default.
- W4386943870 cites W2147470852 @default.
- W4386943870 cites W2152975002 @default.
- W4386943870 cites W2266293027 @default.
- W4386943870 cites W2477392784 @default.
- W4386943870 cites W2491102031 @default.
- W4386943870 cites W2567534547 @default.
- W4386943870 cites W2792837593 @default.
- W4386943870 cites W2800215724 @default.
- W4386943870 cites W2908721660 @default.
- W4386943870 cites W3173475201 @default.
- W4386943870 cites W4233783938 @default.
- W4386943870 doi "https://doi.org/10.1145/3624560" @default.
- W4386943870 hasPublicationYear "2023" @default.
- W4386943870 type Work @default.
- W4386943870 citedByCount "0" @default.
- W4386943870 crossrefType "journal-article" @default.
- W4386943870 hasAuthorship W4386943870A5001838181 @default.
- W4386943870 hasAuthorship W4386943870A5008190319 @default.
- W4386943870 hasAuthorship W4386943870A5037778045 @default.
- W4386943870 hasAuthorship W4386943870A5086100728 @default.
- W4386943870 hasBestOaLocation W43869438701 @default.
- W4386943870 hasConcept C105795698 @default.
- W4386943870 hasConcept C110121322 @default.
- W4386943870 hasConcept C126255220 @default.
- W4386943870 hasConcept C134306372 @default.
- W4386943870 hasConcept C151376022 @default.
- W4386943870 hasConcept C151620405 @default.
- W4386943870 hasConcept C173291955 @default.
- W4386943870 hasConcept C181789720 @default.
- W4386943870 hasConcept C199360897 @default.
- W4386943870 hasConcept C206729178 @default.
- W4386943870 hasConcept C2781039887 @default.
- W4386943870 hasConcept C31258907 @default.
- W4386943870 hasConcept C33923547 @default.
- W4386943870 hasConcept C41008148 @default.
- W4386943870 hasConcept C55350006 @default.
- W4386943870 hasConcept C55416958 @default.
- W4386943870 hasConcept C74172769 @default.
- W4386943870 hasConceptScore W4386943870C105795698 @default.
- W4386943870 hasConceptScore W4386943870C110121322 @default.
- W4386943870 hasConceptScore W4386943870C126255220 @default.
- W4386943870 hasConceptScore W4386943870C134306372 @default.
- W4386943870 hasConceptScore W4386943870C151376022 @default.
- W4386943870 hasConceptScore W4386943870C151620405 @default.
- W4386943870 hasConceptScore W4386943870C173291955 @default.
- W4386943870 hasConceptScore W4386943870C181789720 @default.
- W4386943870 hasConceptScore W4386943870C199360897 @default.
- W4386943870 hasConceptScore W4386943870C206729178 @default.
- W4386943870 hasConceptScore W4386943870C2781039887 @default.
- W4386943870 hasConceptScore W4386943870C31258907 @default.
- W4386943870 hasConceptScore W4386943870C33923547 @default.
- W4386943870 hasConceptScore W4386943870C41008148 @default.
- W4386943870 hasConceptScore W4386943870C55350006 @default.
- W4386943870 hasConceptScore W4386943870C55416958 @default.
- W4386943870 hasConceptScore W4386943870C74172769 @default.
- W4386943870 hasLocation W43869438701 @default.
- W4386943870 hasLocation W43869438702 @default.
- W4386943870 hasOpenAccess W4386943870 @default.
- W4386943870 hasPrimaryLocation W43869438701 @default.
- W4386943870 hasRelatedWork W10392919 @default.
- W4386943870 hasRelatedWork W1602419965 @default.
- W4386943870 hasRelatedWork W1981223513 @default.
- W4386943870 hasRelatedWork W2061339814 @default.
- W4386943870 hasRelatedWork W2067732275 @default.
- W4386943870 hasRelatedWork W4212841943 @default.
- W4386943870 hasRelatedWork W4312779177 @default.
- W4386943870 hasRelatedWork W4320161920 @default.
- W4386943870 hasRelatedWork W4386943870 @default.
- W4386943870 hasRelatedWork W1902375684 @default.
- W4386943870 isParatext "false" @default.
- W4386943870 isRetracted "false" @default.
- W4386943870 workType "article" @default.