Matches in SemOpenAlex for { <https://semopenalex.org/work/W4317928031> ?p ?o ?g. }
Showing items 1 to 91 of
91
with 100 items per page.
- W4317928031 abstract "Scientific applications have long embraced the MPI as the environment of choice to execute on large distributed systems. The User-Level Failure Mitigation (ULFM) specification extends the MPI standard to address resilience and enable MPI applications to restore their communication capability after a failure. This works builds upon the wide body of experience gained in the field to eliminate a gap between current practice and the ideal, more asynchronous, recovery model in which the fault tolerance activities of multiple components can be carried out simultaneously and overlap. This work proposes to: (1) provide the required consistency in fault reporting to applications (i.e., enable an application to assess the success of a computational phase without incurring an unacceptable performance hit); (2) bring forward the building blocks that permit the effective scoping of fault recovery in an application, so that independent components in an application can recover without interfering with each other, and separate groups of processes in the application can recover independently or in unison; and (3) overlap recovery activities necessary to restore the consistency of the system (e.g., eviction of faulty processes from the communication group) with application recovery activities (e.g., dataset restoration from checkpoints)." @default.
- W4317928031 created "2023-01-25" @default.
- W4317928031 creator A5023554829 @default.
- W4317928031 creator A5027550136 @default.
- W4317928031 creator A5032748496 @default.
- W4317928031 creator A5033868370 @default.
- W4317928031 creator A5038062469 @default.
- W4317928031 creator A5054590774 @default.
- W4317928031 creator A5059605240 @default.
- W4317928031 creator A5069846379 @default.
- W4317928031 date "2022-11-01" @default.
- W4317928031 modified "2023-10-17" @default.
- W4317928031 title "Towards Precision-Aware Fault Tolerance Approaches for Mixed-Precision Applications" @default.
- W4317928031 cites W1553890549 @default.
- W4317928031 cites W1986905947 @default.
- W4317928031 cites W2105524676 @default.
- W4317928031 cites W2268994915 @default.
- W4317928031 cites W2318507312 @default.
- W4317928031 cites W2767260595 @default.
- W4317928031 cites W2951941091 @default.
- W4317928031 cites W2979340153 @default.
- W4317928031 cites W3043303806 @default.
- W4317928031 cites W3101271453 @default.
- W4317928031 cites W3187552919 @default.
- W4317928031 doi "https://doi.org/10.1109/ftxs56515.2022.00010" @default.
- W4317928031 hasPublicationYear "2022" @default.
- W4317928031 type Work @default.
- W4317928031 citedByCount "0" @default.
- W4317928031 crossrefType "proceedings-article" @default.
- W4317928031 hasAuthorship W4317928031A5023554829 @default.
- W4317928031 hasAuthorship W4317928031A5027550136 @default.
- W4317928031 hasAuthorship W4317928031A5032748496 @default.
- W4317928031 hasAuthorship W4317928031A5033868370 @default.
- W4317928031 hasAuthorship W4317928031A5038062469 @default.
- W4317928031 hasAuthorship W4317928031A5054590774 @default.
- W4317928031 hasAuthorship W4317928031A5059605240 @default.
- W4317928031 hasAuthorship W4317928031A5069846379 @default.
- W4317928031 hasConcept C120314980 @default.
- W4317928031 hasConcept C121332964 @default.
- W4317928031 hasConcept C127413603 @default.
- W4317928031 hasConcept C149635348 @default.
- W4317928031 hasConcept C151319957 @default.
- W4317928031 hasConcept C154945302 @default.
- W4317928031 hasConcept C200601418 @default.
- W4317928031 hasConcept C202444582 @default.
- W4317928031 hasConcept C24890656 @default.
- W4317928031 hasConcept C2776436953 @default.
- W4317928031 hasConcept C2779585090 @default.
- W4317928031 hasConcept C2780304638 @default.
- W4317928031 hasConcept C31258907 @default.
- W4317928031 hasConcept C33923547 @default.
- W4317928031 hasConcept C41008148 @default.
- W4317928031 hasConcept C50712370 @default.
- W4317928031 hasConcept C63540848 @default.
- W4317928031 hasConcept C9652623 @default.
- W4317928031 hasConcept C97355855 @default.
- W4317928031 hasConceptScore W4317928031C120314980 @default.
- W4317928031 hasConceptScore W4317928031C121332964 @default.
- W4317928031 hasConceptScore W4317928031C127413603 @default.
- W4317928031 hasConceptScore W4317928031C149635348 @default.
- W4317928031 hasConceptScore W4317928031C151319957 @default.
- W4317928031 hasConceptScore W4317928031C154945302 @default.
- W4317928031 hasConceptScore W4317928031C200601418 @default.
- W4317928031 hasConceptScore W4317928031C202444582 @default.
- W4317928031 hasConceptScore W4317928031C24890656 @default.
- W4317928031 hasConceptScore W4317928031C2776436953 @default.
- W4317928031 hasConceptScore W4317928031C2779585090 @default.
- W4317928031 hasConceptScore W4317928031C2780304638 @default.
- W4317928031 hasConceptScore W4317928031C31258907 @default.
- W4317928031 hasConceptScore W4317928031C33923547 @default.
- W4317928031 hasConceptScore W4317928031C41008148 @default.
- W4317928031 hasConceptScore W4317928031C50712370 @default.
- W4317928031 hasConceptScore W4317928031C63540848 @default.
- W4317928031 hasConceptScore W4317928031C9652623 @default.
- W4317928031 hasConceptScore W4317928031C97355855 @default.
- W4317928031 hasLocation W43179280311 @default.
- W4317928031 hasOpenAccess W4317928031 @default.
- W4317928031 hasPrimaryLocation W43179280311 @default.
- W4317928031 hasRelatedWork W130300090 @default.
- W4317928031 hasRelatedWork W2001315747 @default.
- W4317928031 hasRelatedWork W2018353224 @default.
- W4317928031 hasRelatedWork W2106754492 @default.
- W4317928031 hasRelatedWork W2120774006 @default.
- W4317928031 hasRelatedWork W2416188964 @default.
- W4317928031 hasRelatedWork W3102160464 @default.
- W4317928031 hasRelatedWork W4251380107 @default.
- W4317928031 hasRelatedWork W4312045749 @default.
- W4317928031 hasRelatedWork W4317928114 @default.
- W4317928031 isParatext "false" @default.
- W4317928031 isRetracted "false" @default.
- W4317928031 workType "article" @default.