Matches in SemOpenAlex for { <https://semopenalex.org/work/W2290138186> ?p ?o ?g. }
Showing items 1 to 98 of
98
with 100 items per page.
- W2290138186 abstract "Application demands and grand challenges in numerical simulation require for both highly capable computing platforms and efficient numerical solution schemes. Power constraints and further miniaturization of modern and future hardware give way for multi- and manycore processors with increasing fine-grained parallelism and deeply nested hierarchical memory systems -- as already exemplified by recent graphics processing units. Accordingly, numerical schemes need to be adapted and re-engineered in order to deliver scalable solutions across diverse processor configurations. Portability of parallel software solutions across emerging hardware platforms is another challenge. This work investigates multi-coloring and re-ordering schemes for block Gaus-Seidel methods and, in particular, for incomplete LU factorizations with and without fill-ins. We consider two matrix re-ordering schemes that deliver flexible and efficient parallel preconditioners. The general idea is to generate block decompositions of the system matrix such that the diagonal blocks are diagonal itself. In such a way, parallelism can be exploited on the block-level in a scalable manner. Our goal is to provide widely applicable, out-of-the-box preconditioners that can be used in the context of finite element solvers. We propose a new method for anticipating the fill-in pattern of ILU($p$) schemes which we call the power($q$)-pattern method . This method is based on an incomplete factorization of the system matrix $A$ subject to a predetermined pattern given by the matrix power $|A|^(p+1)$ and its associated multi-coloring permutation $. We prove that the obtained sparsity pattern is a superset of our modified ILU($p$) factorization applied to pi A pi^(-1). As a result, this modified ILU($p$) applied to multi-colored system matrix has no fill-ins in its diagonal blocks. This leads to an inherently parallel execution of triangular ILU($p$) sweeps. In addition, we describe the integration of the preconditioners into the HiFlow$^3$ open-source finite element package that provides a portable software solution across diverse hardware platforms. On this basis, we conduct performance analysis across a variety of test problems on multi-core CPUs and GPUs that proves efficiency, scalability and flexibility of our approach. Our preconditioners achieve a solver acceleration by a factor of up to 1.5, 8 and 85 for three different test problems. The GPU versions of the preconditioned solver are by a factor of up to 4 faster than an OpenMP parallel version on eight cores." @default.
- W2290138186 created "2016-06-24" @default.
- W2290138186 creator A5000665959 @default.
- W2290138186 creator A5058920681 @default.
- W2290138186 creator A5079475129 @default.
- W2290138186 date "2011-07-01" @default.
- W2290138186 modified "2023-09-26" @default.
- W2290138186 title "Enhanced Parallel ILU(p)-based Preconditioners for Multi-core CPUs and GPUs -- The Power(q)-pattern Method" @default.
- W2290138186 cites W1506342804 @default.
- W2290138186 cites W1524015806 @default.
- W2290138186 cites W1575701986 @default.
- W2290138186 cites W171162551 @default.
- W2290138186 cites W1760551737 @default.
- W2290138186 cites W1994805693 @default.
- W2290138186 cites W2039789965 @default.
- W2290138186 cites W2106032147 @default.
- W2290138186 cites W2128853364 @default.
- W2290138186 cites W2156588317 @default.
- W2290138186 cites W764232590 @default.
- W2290138186 doi "https://doi.org/10.11588/emclpp.2011.08.11690" @default.
- W2290138186 hasPublicationYear "2011" @default.
- W2290138186 type Work @default.
- W2290138186 sameAs 2290138186 @default.
- W2290138186 citedByCount "6" @default.
- W2290138186 countsByYear W22901381862012 @default.
- W2290138186 countsByYear W22901381862014 @default.
- W2290138186 crossrefType "journal-article" @default.
- W2290138186 hasAuthorship W2290138186A5000665959 @default.
- W2290138186 hasAuthorship W2290138186A5058920681 @default.
- W2290138186 hasAuthorship W2290138186A5079475129 @default.
- W2290138186 hasConcept C121332964 @default.
- W2290138186 hasConcept C121684516 @default.
- W2290138186 hasConcept C151730666 @default.
- W2290138186 hasConcept C163716315 @default.
- W2290138186 hasConcept C173608175 @default.
- W2290138186 hasConcept C199360897 @default.
- W2290138186 hasConcept C21442007 @default.
- W2290138186 hasConcept C2524010 @default.
- W2290138186 hasConcept C2777210771 @default.
- W2290138186 hasConcept C2778119891 @default.
- W2290138186 hasConcept C2779343474 @default.
- W2290138186 hasConcept C33923547 @default.
- W2290138186 hasConcept C41008148 @default.
- W2290138186 hasConcept C459310 @default.
- W2290138186 hasConcept C48044578 @default.
- W2290138186 hasConcept C56372850 @default.
- W2290138186 hasConcept C62520636 @default.
- W2290138186 hasConcept C63000827 @default.
- W2290138186 hasConcept C77088390 @default.
- W2290138186 hasConcept C86803240 @default.
- W2290138186 hasConceptScore W2290138186C121332964 @default.
- W2290138186 hasConceptScore W2290138186C121684516 @default.
- W2290138186 hasConceptScore W2290138186C151730666 @default.
- W2290138186 hasConceptScore W2290138186C163716315 @default.
- W2290138186 hasConceptScore W2290138186C173608175 @default.
- W2290138186 hasConceptScore W2290138186C199360897 @default.
- W2290138186 hasConceptScore W2290138186C21442007 @default.
- W2290138186 hasConceptScore W2290138186C2524010 @default.
- W2290138186 hasConceptScore W2290138186C2777210771 @default.
- W2290138186 hasConceptScore W2290138186C2778119891 @default.
- W2290138186 hasConceptScore W2290138186C2779343474 @default.
- W2290138186 hasConceptScore W2290138186C33923547 @default.
- W2290138186 hasConceptScore W2290138186C41008148 @default.
- W2290138186 hasConceptScore W2290138186C459310 @default.
- W2290138186 hasConceptScore W2290138186C48044578 @default.
- W2290138186 hasConceptScore W2290138186C56372850 @default.
- W2290138186 hasConceptScore W2290138186C62520636 @default.
- W2290138186 hasConceptScore W2290138186C63000827 @default.
- W2290138186 hasConceptScore W2290138186C77088390 @default.
- W2290138186 hasConceptScore W2290138186C86803240 @default.
- W2290138186 hasIssue "08" @default.
- W2290138186 hasLocation W22901381861 @default.
- W2290138186 hasOpenAccess W2290138186 @default.
- W2290138186 hasPrimaryLocation W22901381861 @default.
- W2290138186 hasRelatedWork W1110572413 @default.
- W2290138186 hasRelatedWork W1490374280 @default.
- W2290138186 hasRelatedWork W1506342804 @default.
- W2290138186 hasRelatedWork W171162551 @default.
- W2290138186 hasRelatedWork W1965034778 @default.
- W2290138186 hasRelatedWork W1985263109 @default.
- W2290138186 hasRelatedWork W1991324025 @default.
- W2290138186 hasRelatedWork W2025829250 @default.
- W2290138186 hasRelatedWork W2035080386 @default.
- W2290138186 hasRelatedWork W2099093654 @default.
- W2290138186 hasRelatedWork W2128853364 @default.
- W2290138186 hasRelatedWork W2132450860 @default.
- W2290138186 hasRelatedWork W2509377822 @default.
- W2290138186 hasRelatedWork W2550177803 @default.
- W2290138186 hasRelatedWork W2884218692 @default.
- W2290138186 hasRelatedWork W2965979058 @default.
- W2290138186 hasRelatedWork W2972050710 @default.
- W2290138186 hasRelatedWork W3005868431 @default.
- W2290138186 hasRelatedWork W3119799675 @default.
- W2290138186 hasRelatedWork W1848026038 @default.
- W2290138186 isParatext "false" @default.
- W2290138186 isRetracted "false" @default.
- W2290138186 magId "2290138186" @default.
- W2290138186 workType "article" @default.