Matches in SemOpenAlex for { <https://semopenalex.org/work/W3153642904> ?p ?o ?g. }
- W3153642904 endingPage "207" @default.
- W3153642904 startingPage "196" @default.
- W3153642904 abstract "Recently, pre-trained transformer-based architectures have proven to be very efficient at language modeling and understanding, given that they are trained on a large enough corpus. Applications in language generation for Arabic are still lagging in comparison to other NLP advances primarily due to the lack of advanced Arabic language generation models. In this paper, we develop the first advanced Arabic language generation model, AraGPT2, trained from scratch on a large Arabic corpus of internet text and news articles. Our largest model, AraGPT2-mega, has 1.46 billion parameters, which makes it the largest Arabic language model available. The mega model was evaluated and showed success on different tasks including synthetic news generation, and zero-shot question answering. For text generation, our best model achieves a perplexity of 29.8 on held-out Wikipedia articles. A study conducted with human evaluators showed the significant success of AraGPT2-mega in generating news articles that are difficult to distinguish from articles written by humans. We thus develop and release an automatic discriminator model with a 98% percent accuracy in detecting model-generated text. The models are also publicly available, hoping to encourage new research directions and applications for Arabic NLP." @default.
- W3153642904 created "2021-04-26" @default.
- W3153642904 creator A5050198833 @default.
- W3153642904 creator A5050749439 @default.
- W3153642904 creator A5088085383 @default.
- W3153642904 date "2020-12-31" @default.
- W3153642904 modified "2023-10-01" @default.
- W3153642904 title "AraGPT2: Pre-Trained Transformer for Arabic Language Generation" @default.
- W3153642904 cites W1990501283 @default.
- W3153642904 cites W2797328513 @default.
- W3153642904 cites W2951080837 @default.
- W3153642904 cites W2963026768 @default.
- W3153642904 cites W2963341956 @default.
- W3153642904 cites W2963403868 @default.
- W3153642904 cites W2963532001 @default.
- W3153642904 cites W2964121744 @default.
- W3153642904 cites W2965373594 @default.
- W3153642904 cites W2969958763 @default.
- W3153642904 cites W2970814728 @default.
- W3153642904 cites W2970960342 @default.
- W3153642904 cites W2971008823 @default.
- W3153642904 cites W2971016465 @default.
- W3153642904 cites W2973049837 @default.
- W3153642904 cites W2975901202 @default.
- W3153642904 cites W2983040767 @default.
- W3153642904 cites W2995435108 @default.
- W3153642904 cites W3045462440 @default.
- W3153642904 cites W3088592174 @default.
- W3153642904 cites W3098302716 @default.
- W3153642904 cites W3098824823 @default.
- W3153642904 cites W3101860695 @default.
- W3153642904 cites W3114326827 @default.
- W3153642904 cites W3118440692 @default.
- W3153642904 cites W3118942176 @default.
- W3153642904 cites W3119349118 @default.
- W3153642904 cites W3128029819 @default.
- W3153642904 cites W630532510 @default.
- W3153642904 cites W3092558113 @default.
- W3153642904 hasPublicationYear "2020" @default.
- W3153642904 type Work @default.
- W3153642904 sameAs 3153642904 @default.
- W3153642904 citedByCount "4" @default.
- W3153642904 countsByYear W31536429042021 @default.
- W3153642904 crossrefType "journal-article" @default.
- W3153642904 hasAuthorship W3153642904A5050198833 @default.
- W3153642904 hasAuthorship W3153642904A5050749439 @default.
- W3153642904 hasAuthorship W3153642904A5088085383 @default.
- W3153642904 hasConcept C100279451 @default.
- W3153642904 hasConcept C119599485 @default.
- W3153642904 hasConcept C127413603 @default.
- W3153642904 hasConcept C137293760 @default.
- W3153642904 hasConcept C138885662 @default.
- W3153642904 hasConcept C154945302 @default.
- W3153642904 hasConcept C165801399 @default.
- W3153642904 hasConcept C204321447 @default.
- W3153642904 hasConcept C2985684807 @default.
- W3153642904 hasConcept C41008148 @default.
- W3153642904 hasConcept C41895202 @default.
- W3153642904 hasConcept C66322947 @default.
- W3153642904 hasConcept C96455323 @default.
- W3153642904 hasConceptScore W3153642904C100279451 @default.
- W3153642904 hasConceptScore W3153642904C119599485 @default.
- W3153642904 hasConceptScore W3153642904C127413603 @default.
- W3153642904 hasConceptScore W3153642904C137293760 @default.
- W3153642904 hasConceptScore W3153642904C138885662 @default.
- W3153642904 hasConceptScore W3153642904C154945302 @default.
- W3153642904 hasConceptScore W3153642904C165801399 @default.
- W3153642904 hasConceptScore W3153642904C204321447 @default.
- W3153642904 hasConceptScore W3153642904C2985684807 @default.
- W3153642904 hasConceptScore W3153642904C41008148 @default.
- W3153642904 hasConceptScore W3153642904C41895202 @default.
- W3153642904 hasConceptScore W3153642904C66322947 @default.
- W3153642904 hasConceptScore W3153642904C96455323 @default.
- W3153642904 hasLocation W31536429041 @default.
- W3153642904 hasOpenAccess W3153642904 @default.
- W3153642904 hasPrimaryLocation W31536429041 @default.
- W3153642904 hasRelatedWork W1982389596 @default.
- W3153642904 hasRelatedWork W2114206703 @default.
- W3153642904 hasRelatedWork W2303320811 @default.
- W3153642904 hasRelatedWork W2767633796 @default.
- W3153642904 hasRelatedWork W2771552299 @default.
- W3153642904 hasRelatedWork W2921848514 @default.
- W3153642904 hasRelatedWork W2968098022 @default.
- W3153642904 hasRelatedWork W2971016465 @default.
- W3153642904 hasRelatedWork W2992442068 @default.
- W3153642904 hasRelatedWork W3023295910 @default.
- W3153642904 hasRelatedWork W3080820686 @default.
- W3153642904 hasRelatedWork W3088262928 @default.
- W3153642904 hasRelatedWork W3095706972 @default.
- W3153642904 hasRelatedWork W3097571385 @default.
- W3153642904 hasRelatedWork W3118580427 @default.
- W3153642904 hasRelatedWork W3119546299 @default.
- W3153642904 hasRelatedWork W3192589309 @default.
- W3153642904 hasRelatedWork W3203510119 @default.
- W3153642904 hasRelatedWork W3203784836 @default.
- W3153642904 hasRelatedWork W842243003 @default.
- W3153642904 isParatext "false" @default.
- W3153642904 isRetracted "false" @default.