Matches in SemOpenAlex for { <https://semopenalex.org/work/W2524803075> ?p ?o ?g. }
Showing items 1 to 69 of
69
with 100 items per page.
- W2524803075 abstract "Resource management and job scheduling is a crucial task on large-scale computing systems. Despite years of research on resource management and scheduling, it has not kept pace with modern changes and technology trends. The study of this thesis is motivated by emerging issues observed in current production supercomputers, caused by reasons such as human behaviors, application characteristics, and increasing system complexity. Specifically, users tend to provide inaccurate parameters for their jobs which are dependent by the scheduler; system owners have diverse goals which are always conflicting with each other. Also, workload characteristics on production supercomputers keep changing unpredictably, making it hard to achieve a sustainable scheduling performance since the scheduling policies are largely dependent on workload characteristics. Further, increasing hardware complexity causes system issues and leads to new demands. For example, issues such as node fragmentation, failure interruption, power consumption, and I/O overhead have become common in large-scale systems. Existing resource management systems lack the support for these issues and demands. In this study, we present an integrated resource management and scheduling framework, aiming at addressing emerging issues and challenges in resource management for large-scale production supercomputers. We have designed a set of new schemes, including job parameter prediction, adaptive metric-aware job scheduling, cost-aware job scheduling, and multi-domain job coscheduling. We have implemented these approaches in the production resource manager Cobalt, and evaluated them with real job traces from production supercomputers such as the Blue Gene/P system at Argonne National Laboratory. Experimental results show our schemes can effectively improve job scheduling regarding both user satisfaction and system utilization." @default.
- W2524803075 created "2016-10-07" @default.
- W2524803075 creator A5050175109 @default.
- W2524803075 creator A5060400468 @default.
- W2524803075 date "2012-01-01" @default.
- W2524803075 modified "2023-09-23" @default.
- W2524803075 title "An integrated resource management and scheduling framework for production supercomputers" @default.
- W2524803075 hasPublicationYear "2012" @default.
- W2524803075 type Work @default.
- W2524803075 sameAs 2524803075 @default.
- W2524803075 citedByCount "0" @default.
- W2524803075 crossrefType "journal-article" @default.
- W2524803075 hasAuthorship W2524803075A5050175109 @default.
- W2524803075 hasAuthorship W2524803075A5060400468 @default.
- W2524803075 hasConcept C107568181 @default.
- W2524803075 hasConcept C111873713 @default.
- W2524803075 hasConcept C111919701 @default.
- W2524803075 hasConcept C120314980 @default.
- W2524803075 hasConcept C127413603 @default.
- W2524803075 hasConcept C176165272 @default.
- W2524803075 hasConcept C202372285 @default.
- W2524803075 hasConcept C206729178 @default.
- W2524803075 hasConcept C21547014 @default.
- W2524803075 hasConcept C2778476105 @default.
- W2524803075 hasConcept C41008148 @default.
- W2524803075 hasConcept C56739046 @default.
- W2524803075 hasConcept C68387754 @default.
- W2524803075 hasConcept C79974875 @default.
- W2524803075 hasConceptScore W2524803075C107568181 @default.
- W2524803075 hasConceptScore W2524803075C111873713 @default.
- W2524803075 hasConceptScore W2524803075C111919701 @default.
- W2524803075 hasConceptScore W2524803075C120314980 @default.
- W2524803075 hasConceptScore W2524803075C127413603 @default.
- W2524803075 hasConceptScore W2524803075C176165272 @default.
- W2524803075 hasConceptScore W2524803075C202372285 @default.
- W2524803075 hasConceptScore W2524803075C206729178 @default.
- W2524803075 hasConceptScore W2524803075C21547014 @default.
- W2524803075 hasConceptScore W2524803075C2778476105 @default.
- W2524803075 hasConceptScore W2524803075C41008148 @default.
- W2524803075 hasConceptScore W2524803075C56739046 @default.
- W2524803075 hasConceptScore W2524803075C68387754 @default.
- W2524803075 hasConceptScore W2524803075C79974875 @default.
- W2524803075 hasLocation W25248030751 @default.
- W2524803075 hasOpenAccess W2524803075 @default.
- W2524803075 hasPrimaryLocation W25248030751 @default.
- W2524803075 hasRelatedWork W12709835 @default.
- W2524803075 hasRelatedWork W2000506235 @default.
- W2524803075 hasRelatedWork W2045492779 @default.
- W2524803075 hasRelatedWork W2112272914 @default.
- W2524803075 hasRelatedWork W2410249735 @default.
- W2524803075 hasRelatedWork W2562225561 @default.
- W2524803075 hasRelatedWork W2606946523 @default.
- W2524803075 hasRelatedWork W2746083998 @default.
- W2524803075 hasRelatedWork W2770662851 @default.
- W2524803075 hasRelatedWork W2782964495 @default.
- W2524803075 hasRelatedWork W2792423399 @default.
- W2524803075 hasRelatedWork W2885445628 @default.
- W2524803075 hasRelatedWork W2906718912 @default.
- W2524803075 hasRelatedWork W2974142568 @default.
- W2524803075 hasRelatedWork W2991391160 @default.
- W2524803075 hasRelatedWork W3171320739 @default.
- W2524803075 hasRelatedWork W3183258681 @default.
- W2524803075 hasRelatedWork W3185253049 @default.
- W2524803075 hasRelatedWork W769686211 @default.
- W2524803075 hasRelatedWork W2339664244 @default.
- W2524803075 isParatext "false" @default.
- W2524803075 isRetracted "false" @default.
- W2524803075 magId "2524803075" @default.
- W2524803075 workType "article" @default.