Matches in SemOpenAlex for { <https://semopenalex.org/work/W4291952092> ?p ?o ?g. }
- W4291952092 endingPage "3785" @default.
- W4291952092 startingPage "3755" @default.
- W4291952092 abstract "<abstract> <p>The transformer model has recently been a milestone in artificial intelligence. The algorithm has enhanced the performance of tasks such as Machine Translation and Computer Vision to a level previously unattainable. However, the transformer model has a strong performance but also requires a high amount of memory overhead and enormous computing power. This significantly hinders the deployment of an energy-efficient transformer system. Due to the high parallelism, low latency, and low power consumption of field-programmable gate arrays (FPGAs) and application specific integrated circuits (ASICs), they demonstrate higher energy efficiency than Graphics Processing Units (GPUs) and Central Processing Units (CPUs). Therefore, FPGA and ASIC are widely used to accelerate deep learning algorithms. Several papers have addressed the issue of deploying the Transformer on dedicated hardware for acceleration, but there is a lack of comprehensive studies in this area. Therefore, we summarize the transformer model compression algorithm based on the hardware accelerator and its implementation to provide a comprehensive overview of this research domain. This paper first introduces the transformer model framework and computation process. Secondly, a discussion of hardware-friendly compression algorithms based on self-attention and Transformer is provided, along with a review of a state-of-the-art hardware accelerator framework. Finally, we considered some promising topics in transformer hardware acceleration, such as a high-level design framework and selecting the optimum device using reinforcement learning.</p> </abstract>" @default.
- W4291952092 created "2022-08-16" @default.
- W4291952092 creator A5005722645 @default.
- W4291952092 creator A5053772930 @default.
- W4291952092 creator A5056576567 @default.
- W4291952092 creator A5077327850 @default.
- W4291952092 creator A5079552709 @default.
- W4291952092 date "2022-01-01" @default.
- W4291952092 modified "2023-09-30" @default.
- W4291952092 title "Hardware-friendly compression and hardware acceleration for transformer: A survey" @default.
- W4291952092 cites W1575701986 @default.
- W4291952092 cites W2009654791 @default.
- W4291952092 cites W2012833704 @default.
- W4291952092 cites W2064675550 @default.
- W4291952092 cites W2091843288 @default.
- W4291952092 cites W2162064258 @default.
- W4291952092 cites W2197984537 @default.
- W4291952092 cites W2300242332 @default.
- W4291952092 cites W2413794162 @default.
- W4291952092 cites W2783538964 @default.
- W4291952092 cites W2860338957 @default.
- W4291952092 cites W2910396952 @default.
- W4291952092 cites W2911884654 @default.
- W4291952092 cites W2914968962 @default.
- W4291952092 cites W2915106038 @default.
- W4291952092 cites W2917450576 @default.
- W4291952092 cites W2943389092 @default.
- W4291952092 cites W2962820060 @default.
- W4291952092 cites W2963367920 @default.
- W4291952092 cites W2963396654 @default.
- W4291952092 cites W2964121960 @default.
- W4291952092 cites W2998342322 @default.
- W4291952092 cites W3012561096 @default.
- W4291952092 cites W3017024317 @default.
- W4291952092 cites W3023255099 @default.
- W4291952092 cites W3033108890 @default.
- W4291952092 cites W3047848469 @default.
- W4291952092 cites W3104151879 @default.
- W4291952092 cites W3104263050 @default.
- W4291952092 cites W3162542754 @default.
- W4291952092 cites W3169769781 @default.
- W4291952092 cites W3170825342 @default.
- W4291952092 cites W3176468986 @default.
- W4291952092 cites W3177265267 @default.
- W4291952092 cites W3189877953 @default.
- W4291952092 cites W3194017222 @default.
- W4291952092 cites W3199934250 @default.
- W4291952092 cites W3203309275 @default.
- W4291952092 cites W3204801262 @default.
- W4291952092 cites W3206837665 @default.
- W4291952092 cites W4280515676 @default.
- W4291952092 cites W4285261368 @default.
- W4291952092 cites W954001337 @default.
- W4291952092 doi "https://doi.org/10.3934/era.2022192" @default.
- W4291952092 hasPublicationYear "2022" @default.
- W4291952092 type Work @default.
- W4291952092 citedByCount "0" @default.
- W4291952092 crossrefType "journal-article" @default.
- W4291952092 hasAuthorship W4291952092A5005722645 @default.
- W4291952092 hasAuthorship W4291952092A5053772930 @default.
- W4291952092 hasAuthorship W4291952092A5056576567 @default.
- W4291952092 hasAuthorship W4291952092A5077327850 @default.
- W4291952092 hasAuthorship W4291952092A5079552709 @default.
- W4291952092 hasBestOaLocation W42919520921 @default.
- W4291952092 hasConcept C118524514 @default.
- W4291952092 hasConcept C119599485 @default.
- W4291952092 hasConcept C127413603 @default.
- W4291952092 hasConcept C13164978 @default.
- W4291952092 hasConcept C149635348 @default.
- W4291952092 hasConcept C165801399 @default.
- W4291952092 hasConcept C2742236 @default.
- W4291952092 hasConcept C41008148 @default.
- W4291952092 hasConcept C42935608 @default.
- W4291952092 hasConcept C66322947 @default.
- W4291952092 hasConcept C77390884 @default.
- W4291952092 hasConcept C9390403 @default.
- W4291952092 hasConceptScore W4291952092C118524514 @default.
- W4291952092 hasConceptScore W4291952092C119599485 @default.
- W4291952092 hasConceptScore W4291952092C127413603 @default.
- W4291952092 hasConceptScore W4291952092C13164978 @default.
- W4291952092 hasConceptScore W4291952092C149635348 @default.
- W4291952092 hasConceptScore W4291952092C165801399 @default.
- W4291952092 hasConceptScore W4291952092C2742236 @default.
- W4291952092 hasConceptScore W4291952092C41008148 @default.
- W4291952092 hasConceptScore W4291952092C42935608 @default.
- W4291952092 hasConceptScore W4291952092C66322947 @default.
- W4291952092 hasConceptScore W4291952092C77390884 @default.
- W4291952092 hasConceptScore W4291952092C9390403 @default.
- W4291952092 hasIssue "10" @default.
- W4291952092 hasLocation W42919520921 @default.
- W4291952092 hasLocation W42919520922 @default.
- W4291952092 hasOpenAccess W4291952092 @default.
- W4291952092 hasPrimaryLocation W42919520921 @default.
- W4291952092 hasRelatedWork W1732210391 @default.
- W4291952092 hasRelatedWork W2062495483 @default.
- W4291952092 hasRelatedWork W2066442567 @default.
- W4291952092 hasRelatedWork W2100470915 @default.
- W4291952092 hasRelatedWork W3010492628 @default.