Matches in SemOpenAlex for { <https://semopenalex.org/work/W2934853022> ?p ?o ?g. }
Showing items 1 to 96 of
96
with 100 items per page.
- W2934853022 abstract "Recurrent neural networks (RNNs) have gained significant attention due to their effectiveness in modeling sequential data, such as text and voice signal. However, due to the complex data dependencies and limited parallelism, current inference libraries for RNNs on GPUs produce either high latency or poor scalability, leading to inefficient resource utilization. Consequently, companies like Microsoft and Facebook use CPUs to serve RNN models. This work demonstrates the root causes of the unsatisfactory performance of existing implementations for RNN inference on GPUs from several aspects, including poor data reuse, low on-chip resource utilization, and high synchronization overhead. We systematically address these issues and develop a GPU-based RNN inference library, called GRNN, that provides low latency, high throughput, and efficient resource utilization. GRNN minimizes global memory accesses and synchronization overhead, as well as balancing on-chip resource usage through novel data reorganization, thread mapping, and performance modeling techniques. Evaluated on extensive benchmarking and real-world applications, we show that GRNN outperforms the state-of-the-art CPU inference library by up to 17.5X and state-of-the-art GPU inference libraries by up to 9X in terms of latency reduction." @default.
- W2934853022 created "2019-04-11" @default.
- W2934853022 creator A5012018288 @default.
- W2934853022 creator A5015762394 @default.
- W2934853022 creator A5018958225 @default.
- W2934853022 creator A5040302174 @default.
- W2934853022 creator A5059368105 @default.
- W2934853022 date "2019-03-25" @default.
- W2934853022 modified "2023-10-13" @default.
- W2934853022 title "GRNN" @default.
- W2934853022 cites W1902237438 @default.
- W2934853022 cites W1968391520 @default.
- W2934853022 cites W2143612262 @default.
- W2934853022 cites W2155893237 @default.
- W2934853022 cites W2157331557 @default.
- W2934853022 cites W2323909431 @default.
- W2934853022 cites W2470673105 @default.
- W2934853022 cites W2513383847 @default.
- W2934853022 cites W2604787577 @default.
- W2934853022 cites W2657126969 @default.
- W2934853022 cites W2766166018 @default.
- W2934853022 cites W2794670651 @default.
- W2934853022 cites W2798291715 @default.
- W2934853022 cites W2953212265 @default.
- W2934853022 doi "https://doi.org/10.1145/3302424.3303949" @default.
- W2934853022 hasPublicationYear "2019" @default.
- W2934853022 type Work @default.
- W2934853022 sameAs 2934853022 @default.
- W2934853022 citedByCount "31" @default.
- W2934853022 countsByYear W29348530222018 @default.
- W2934853022 countsByYear W29348530222019 @default.
- W2934853022 countsByYear W29348530222020 @default.
- W2934853022 countsByYear W29348530222021 @default.
- W2934853022 countsByYear W29348530222022 @default.
- W2934853022 countsByYear W29348530222023 @default.
- W2934853022 crossrefType "proceedings-article" @default.
- W2934853022 hasAuthorship W2934853022A5012018288 @default.
- W2934853022 hasAuthorship W2934853022A5015762394 @default.
- W2934853022 hasAuthorship W2934853022A5018958225 @default.
- W2934853022 hasAuthorship W2934853022A5040302174 @default.
- W2934853022 hasAuthorship W2934853022A5059368105 @default.
- W2934853022 hasBestOaLocation W29348530221 @default.
- W2934853022 hasConcept C113775141 @default.
- W2934853022 hasConcept C118524514 @default.
- W2934853022 hasConcept C120314980 @default.
- W2934853022 hasConcept C147168706 @default.
- W2934853022 hasConcept C154945302 @default.
- W2934853022 hasConcept C173608175 @default.
- W2934853022 hasConcept C2776214188 @default.
- W2934853022 hasConcept C41008148 @default.
- W2934853022 hasConcept C48044578 @default.
- W2934853022 hasConcept C50644808 @default.
- W2934853022 hasConcept C76155785 @default.
- W2934853022 hasConcept C77088390 @default.
- W2934853022 hasConcept C82876162 @default.
- W2934853022 hasConceptScore W2934853022C113775141 @default.
- W2934853022 hasConceptScore W2934853022C118524514 @default.
- W2934853022 hasConceptScore W2934853022C120314980 @default.
- W2934853022 hasConceptScore W2934853022C147168706 @default.
- W2934853022 hasConceptScore W2934853022C154945302 @default.
- W2934853022 hasConceptScore W2934853022C173608175 @default.
- W2934853022 hasConceptScore W2934853022C2776214188 @default.
- W2934853022 hasConceptScore W2934853022C41008148 @default.
- W2934853022 hasConceptScore W2934853022C48044578 @default.
- W2934853022 hasConceptScore W2934853022C50644808 @default.
- W2934853022 hasConceptScore W2934853022C76155785 @default.
- W2934853022 hasConceptScore W2934853022C77088390 @default.
- W2934853022 hasConceptScore W2934853022C82876162 @default.
- W2934853022 hasFunder F4320306076 @default.
- W2934853022 hasLocation W29348530221 @default.
- W2934853022 hasOpenAccess W2934853022 @default.
- W2934853022 hasPrimaryLocation W29348530221 @default.
- W2934853022 hasRelatedWork W1667652561 @default.
- W2934853022 hasRelatedWork W2342173569 @default.
- W2934853022 hasRelatedWork W2585720638 @default.
- W2934853022 hasRelatedWork W2588448445 @default.
- W2934853022 hasRelatedWork W2606722458 @default.
- W2934853022 hasRelatedWork W2623333128 @default.
- W2934853022 hasRelatedWork W2657126969 @default.
- W2934853022 hasRelatedWork W2796440709 @default.
- W2934853022 hasRelatedWork W2798291715 @default.
- W2934853022 hasRelatedWork W2886885214 @default.
- W2934853022 hasRelatedWork W2888193973 @default.
- W2934853022 hasRelatedWork W2971843695 @default.
- W2934853022 hasRelatedWork W3030835235 @default.
- W2934853022 hasRelatedWork W3037896803 @default.
- W2934853022 hasRelatedWork W3041406277 @default.
- W2934853022 hasRelatedWork W3080950018 @default.
- W2934853022 hasRelatedWork W3170887803 @default.
- W2934853022 hasRelatedWork W3176073989 @default.
- W2934853022 hasRelatedWork W3176869348 @default.
- W2934853022 hasRelatedWork W2969539885 @default.
- W2934853022 isParatext "false" @default.
- W2934853022 isRetracted "false" @default.
- W2934853022 magId "2934853022" @default.
- W2934853022 workType "article" @default.