Matches in SemOpenAlex for { <https://semopenalex.org/work/W4214540558> ?p ?o ?g. }
- W4214540558 abstract "Multi-stage user-facing applications on GPUs are widely-used nowa- days, and are often implemented to be microservices. Prior re- search works are not applicable to ensuring QoS of GPU-based microservices due to the different communication patterns and shared resource contentions. We propose Astraea to manage GPU microservices considering the above factors. In Astraea, a microser- vice deployment policy is used to maximize the supported peak service load while ensuring the required QoS. To adaptively switch the communication methods between microservices according to different deployments, we propose an auto-scaling GPU communi- cation framework. The framework automatically scales based on the currently used hardware topology and microservice location, and adopts global memory-based techniques to reduce intra-GPU communication. Astraea increases the supported peak load by up to 82.3% while achieving the desired 99%-ile latency target compared with state-of-the-art solutions." @default.
- W4214540558 created "2022-03-02" @default.
- W4214540558 creator A5003939279 @default.
- W4214540558 creator A5018118687 @default.
- W4214540558 creator A5037258424 @default.
- W4214540558 creator A5037447327 @default.
- W4214540558 creator A5039318240 @default.
- W4214540558 creator A5049824718 @default.
- W4214540558 creator A5090297023 @default.
- W4214540558 date "2022-02-22" @default.
- W4214540558 modified "2023-10-13" @default.
- W4214540558 title "Astraea: towards QoS-aware and resource-efficient multi-stage GPU services" @default.
- W4214540558 cites W1895577753 @default.
- W4214540558 cites W1905882502 @default.
- W4214540558 cites W1982063824 @default.
- W4214540558 cites W2015518316 @default.
- W4214540558 cites W2062482040 @default.
- W4214540558 cites W2080592089 @default.
- W4214540558 cites W2118297542 @default.
- W4214540558 cites W2170616854 @default.
- W4214540558 cites W2216946510 @default.
- W4214540558 cites W2293634267 @default.
- W4214540558 cites W2323909431 @default.
- W4214540558 cites W2503339013 @default.
- W4214540558 cites W2581065617 @default.
- W4214540558 cites W2604514113 @default.
- W4214540558 cites W2612225380 @default.
- W4214540558 cites W2612387305 @default.
- W4214540558 cites W2616747538 @default.
- W4214540558 cites W2798291715 @default.
- W4214540558 cites W2798748382 @default.
- W4214540558 cites W2887500939 @default.
- W4214540558 cites W2899134946 @default.
- W4214540558 cites W2903838325 @default.
- W4214540558 cites W2916954860 @default.
- W4214540558 cites W2928897890 @default.
- W4214540558 cites W2930500175 @default.
- W4214540558 cites W2931122162 @default.
- W4214540558 cites W2948000013 @default.
- W4214540558 cites W2949380140 @default.
- W4214540558 cites W2952562115 @default.
- W4214540558 cites W2953038929 @default.
- W4214540558 cites W2955831855 @default.
- W4214540558 cites W2963341956 @default.
- W4214540558 cites W2963612019 @default.
- W4214540558 cites W2964161387 @default.
- W4214540558 cites W2965503873 @default.
- W4214540558 cites W2966792645 @default.
- W4214540558 cites W2970419734 @default.
- W4214540558 cites W2982157693 @default.
- W4214540558 cites W2982506996 @default.
- W4214540558 cites W2982576723 @default.
- W4214540558 cites W3014810041 @default.
- W4214540558 cites W3016842236 @default.
- W4214540558 cites W3017148238 @default.
- W4214540558 cites W3043433718 @default.
- W4214540558 cites W3099214871 @default.
- W4214540558 cites W3156127671 @default.
- W4214540558 cites W3208777667 @default.
- W4214540558 cites W3214539954 @default.
- W4214540558 cites W4200542591 @default.
- W4214540558 doi "https://doi.org/10.1145/3503222.3507721" @default.
- W4214540558 hasPublicationYear "2022" @default.
- W4214540558 type Work @default.
- W4214540558 citedByCount "7" @default.
- W4214540558 countsByYear W42145405582022 @default.
- W4214540558 countsByYear W42145405582023 @default.
- W4214540558 crossrefType "proceedings-article" @default.
- W4214540558 hasAuthorship W4214540558A5003939279 @default.
- W4214540558 hasAuthorship W4214540558A5018118687 @default.
- W4214540558 hasAuthorship W4214540558A5037258424 @default.
- W4214540558 hasAuthorship W4214540558A5037447327 @default.
- W4214540558 hasAuthorship W4214540558A5039318240 @default.
- W4214540558 hasAuthorship W4214540558A5049824718 @default.
- W4214540558 hasAuthorship W4214540558A5090297023 @default.
- W4214540558 hasConcept C105339364 @default.
- W4214540558 hasConcept C111919701 @default.
- W4214540558 hasConcept C118524514 @default.
- W4214540558 hasConcept C120314980 @default.
- W4214540558 hasConcept C173608175 @default.
- W4214540558 hasConcept C206345919 @default.
- W4214540558 hasConcept C2778119891 @default.
- W4214540558 hasConcept C2778505942 @default.
- W4214540558 hasConcept C31258907 @default.
- W4214540558 hasConcept C41008148 @default.
- W4214540558 hasConcept C5119721 @default.
- W4214540558 hasConcept C76155785 @default.
- W4214540558 hasConcept C79974875 @default.
- W4214540558 hasConcept C82876162 @default.
- W4214540558 hasConceptScore W4214540558C105339364 @default.
- W4214540558 hasConceptScore W4214540558C111919701 @default.
- W4214540558 hasConceptScore W4214540558C118524514 @default.
- W4214540558 hasConceptScore W4214540558C120314980 @default.
- W4214540558 hasConceptScore W4214540558C173608175 @default.
- W4214540558 hasConceptScore W4214540558C206345919 @default.
- W4214540558 hasConceptScore W4214540558C2778119891 @default.
- W4214540558 hasConceptScore W4214540558C2778505942 @default.
- W4214540558 hasConceptScore W4214540558C31258907 @default.
- W4214540558 hasConceptScore W4214540558C41008148 @default.
- W4214540558 hasConceptScore W4214540558C5119721 @default.