Matches in SemOpenAlex for { <https://semopenalex.org/work/W4313003346> ?p ?o ?g. }
Showing items 1 to 51 of
51
with 100 items per page.
- W4313003346 abstract "Silent data corruption poses a significant risk to the integrity of data in storage systems. Although error correction codes (ECC) can recover the majority of such errors, a non-negligible portion of them escape ECC, referred as uncorrectable errors (UEs). Despite being rare in nature, increasing scale of storage systems and fast-growing I/O rates decreased the mean time between UEs from months to hours. Yet, unlike disk failures, UEs are hard to predict with high precision, making it difficult to adopt proactive measures. In this paper, we introduce a probabilistic approach to deploy UE mitigation strategies that can capture significant portion of UE while keeping the system overhead at a tolerable range. To achieve this, we first estimate the probability of I/O operations to be exposed to UEs and find a minimum subset of disks for which employing UE avoidance strategies can lead to significant decrease in UE exposure. We demonstrate through extensive simulations that when the proposed probabilistic model is used to implement write verification strategy to detect and recover from UEs, more than 50% of all write-triggered UEs can be avoided with 1% read overhead, and more than 70% of UEs can be mitigated with less than 3.5% read overhead. We further measure the impact of incurred read overhead on write performance in production Lustre and GPFS file systems and validate our findings that more than half of UEs can be avoided while degrading write I/O throughout by less than 0.9%." @default.
- W4313003346 created "2023-01-05" @default.
- W4313003346 creator A5006686573 @default.
- W4313003346 creator A5008559299 @default.
- W4313003346 creator A5019077099 @default.
- W4313003346 creator A5029780043 @default.
- W4313003346 date "2022-09-01" @default.
- W4313003346 modified "2023-09-24" @default.
- W4313003346 title "Be SMART, Save I/O: A Probabilistic Approach to Avoid Uncorrectable Errors in Storage Systems" @default.
- W4313003346 doi "https://doi.org/10.1109/cluster51413.2022.00038" @default.
- W4313003346 hasPublicationYear "2022" @default.
- W4313003346 type Work @default.
- W4313003346 citedByCount "0" @default.
- W4313003346 crossrefType "proceedings-article" @default.
- W4313003346 hasAuthorship W4313003346A5006686573 @default.
- W4313003346 hasAuthorship W4313003346A5008559299 @default.
- W4313003346 hasAuthorship W4313003346A5019077099 @default.
- W4313003346 hasAuthorship W4313003346A5029780043 @default.
- W4313003346 hasConcept C111919701 @default.
- W4313003346 hasConcept C120314980 @default.
- W4313003346 hasConcept C149635348 @default.
- W4313003346 hasConcept C154945302 @default.
- W4313003346 hasConcept C2779960059 @default.
- W4313003346 hasConcept C41008148 @default.
- W4313003346 hasConcept C49937458 @default.
- W4313003346 hasConcept C79403827 @default.
- W4313003346 hasConceptScore W4313003346C111919701 @default.
- W4313003346 hasConceptScore W4313003346C120314980 @default.
- W4313003346 hasConceptScore W4313003346C149635348 @default.
- W4313003346 hasConceptScore W4313003346C154945302 @default.
- W4313003346 hasConceptScore W4313003346C2779960059 @default.
- W4313003346 hasConceptScore W4313003346C41008148 @default.
- W4313003346 hasConceptScore W4313003346C49937458 @default.
- W4313003346 hasConceptScore W4313003346C79403827 @default.
- W4313003346 hasFunder F4320306076 @default.
- W4313003346 hasLocation W43130033461 @default.
- W4313003346 hasOpenAccess W4313003346 @default.
- W4313003346 hasPrimaryLocation W43130033461 @default.
- W4313003346 hasRelatedWork W1530562558 @default.
- W4313003346 hasRelatedWork W1557295419 @default.
- W4313003346 hasRelatedWork W1569520790 @default.
- W4313003346 hasRelatedWork W2092071486 @default.
- W4313003346 hasRelatedWork W2131630752 @default.
- W4313003346 hasRelatedWork W2139833378 @default.
- W4313003346 hasRelatedWork W2391167130 @default.
- W4313003346 hasRelatedWork W4283067488 @default.
- W4313003346 hasRelatedWork W94000989 @default.
- W4313003346 hasRelatedWork W2460246254 @default.
- W4313003346 isParatext "false" @default.
- W4313003346 isRetracted "false" @default.
- W4313003346 workType "article" @default.