Matches in SemOpenAlex for { <https://semopenalex.org/work/W3172479815> ?p ?o ?g. }
Showing items 1 to 65 of
65
with 100 items per page.
- W3172479815 abstract "Computational scientists are eager to utilize computing resources to execute their applications to advance their understanding of various complex phenomena. This eagerness drives the rapid technological development in high performance computing (HPC). Modern HPC systems exhibit rapid growth in the number of cores per computing node and the number of computing nodes per system. As such, modern HPC systems offer additional levels of hardware parallelism at the core, node, and system levels. Each level requires and employs techniques for appropriate scheduling of the computational work at the respective level. These scheduling techniques work separately without coordination, and each technique is designed to achieve specific performance targets. Currently, the absence of coordination between schedulers at different levels is an open research problem. In many cases, independent scheduling decisions degrade applications’ performance and signify inefficient resources’ usage of contemporary HPC systems. To solve this problem, we formulate the following research question: How can the multilevel parallelism of a modern HPC system be exploited through scheduling to improve the performance of computationally-intensive applications and to enhance the utilization of HPC resources? Understanding the relation between the different scheduling levels is crucial for solving the aforementioned research question. However, it is challenged by (1) the absence of methods, models, and tools that allow examining and analyzing the interaction and the mutual impact of these scheduling levels and (2) the different nature and performance targets of each of these scheduling levels. This doctoral dissertation addresses these challenges in the context of two specific scheduling classes: queuing-based job scheduling at the batch-level and dynamic loop self-scheduling (DLS) at the application-level. We propose and evaluate a multilevel scheduling (MLS) prototype that solves the problem by bridging the schedulers at these scheduling levels. The MLS prototype aims to decrease applications’ execution time and increase system utilization. It employs two novel scheduling approaches that have been introduced by this doctoral dissertation: (1) the distributed chunk-calculation approach (DCA) and (2) the resourceful coordination approach (RCA) to achieve performance targets. At the application-level, DCA addresses the scalability challenges associated with existing DLS implementation approaches while maintaining a global scheduling overview that is important to achieve global optimal scheduling decisions. We apply DCA to several DLS techniques, and we show how it benefits applications’ execution time (the first goal of the MLS prototype). At the batch-level, RCA enables application schedulers to share their allocated but idle computing resources with other applications through a batch system. The significance of RCA is that it leverages and combines the advantages of node sharing and dynamic resource and job management. It offers an efficient resource sharing (of idle resources only) and avoids shrink and expansion operations on the application side. RCA allows batch systems to reassign computing resources once they become free (the second goal of the MLS prototype). By employing DCA and RCA, the MLS prototype answers the research question and shows a creative and useful way of exploiting the multilevel parallelism of modern HPC systems through scheduling. This doctoral dissertation advances the state-of-the-art by demonstrating the usefulness and the performance potential of coordinated scheduling decisions at different levels. We also designed and implemented a set of methods and tools, which we make available for the community to analyze the mutual impact of decisions at different levels of scheduling." @default.
- W3172479815 created "2021-06-22" @default.
- W3172479815 creator A5066704316 @default.
- W3172479815 date "2021-01-01" @default.
- W3172479815 modified "2023-09-23" @default.
- W3172479815 title "Multilevel Scheduling of Computations on Parallel Large-scale Systems" @default.
- W3172479815 hasPublicationYear "2021" @default.
- W3172479815 type Work @default.
- W3172479815 sameAs 3172479815 @default.
- W3172479815 citedByCount "0" @default.
- W3172479815 crossrefType "dissertation" @default.
- W3172479815 hasAuthorship W3172479815A5066704316 @default.
- W3172479815 hasConcept C107568181 @default.
- W3172479815 hasConcept C119948110 @default.
- W3172479815 hasConcept C120314980 @default.
- W3172479815 hasConcept C126255220 @default.
- W3172479815 hasConcept C173608175 @default.
- W3172479815 hasConcept C175893541 @default.
- W3172479815 hasConcept C206729178 @default.
- W3172479815 hasConcept C31258907 @default.
- W3172479815 hasConcept C31689143 @default.
- W3172479815 hasConcept C33923547 @default.
- W3172479815 hasConcept C41008148 @default.
- W3172479815 hasConcept C5119721 @default.
- W3172479815 hasConcept C83283714 @default.
- W3172479815 hasConceptScore W3172479815C107568181 @default.
- W3172479815 hasConceptScore W3172479815C119948110 @default.
- W3172479815 hasConceptScore W3172479815C120314980 @default.
- W3172479815 hasConceptScore W3172479815C126255220 @default.
- W3172479815 hasConceptScore W3172479815C173608175 @default.
- W3172479815 hasConceptScore W3172479815C175893541 @default.
- W3172479815 hasConceptScore W3172479815C206729178 @default.
- W3172479815 hasConceptScore W3172479815C31258907 @default.
- W3172479815 hasConceptScore W3172479815C31689143 @default.
- W3172479815 hasConceptScore W3172479815C33923547 @default.
- W3172479815 hasConceptScore W3172479815C41008148 @default.
- W3172479815 hasConceptScore W3172479815C5119721 @default.
- W3172479815 hasConceptScore W3172479815C83283714 @default.
- W3172479815 hasLocation W31724798151 @default.
- W3172479815 hasOpenAccess W3172479815 @default.
- W3172479815 hasPrimaryLocation W31724798151 @default.
- W3172479815 hasRelatedWork W100566344 @default.
- W3172479815 hasRelatedWork W1525770743 @default.
- W3172479815 hasRelatedWork W181259032 @default.
- W3172479815 hasRelatedWork W1990399073 @default.
- W3172479815 hasRelatedWork W2080589812 @default.
- W3172479815 hasRelatedWork W2117790631 @default.
- W3172479815 hasRelatedWork W2156809218 @default.
- W3172479815 hasRelatedWork W2281193709 @default.
- W3172479815 hasRelatedWork W2309619976 @default.
- W3172479815 hasRelatedWork W2373101223 @default.
- W3172479815 hasRelatedWork W2614944019 @default.
- W3172479815 hasRelatedWork W2744745229 @default.
- W3172479815 hasRelatedWork W2885087577 @default.
- W3172479815 hasRelatedWork W2893236605 @default.
- W3172479815 hasRelatedWork W2910264096 @default.
- W3172479815 hasRelatedWork W3158759366 @default.
- W3172479815 hasRelatedWork W3169025515 @default.
- W3172479815 hasRelatedWork W3205961405 @default.
- W3172479815 hasRelatedWork W39062071 @default.
- W3172479815 hasRelatedWork W1558764633 @default.
- W3172479815 isParatext "false" @default.
- W3172479815 isRetracted "false" @default.
- W3172479815 magId "3172479815" @default.
- W3172479815 workType "dissertation" @default.