Matches in SemOpenAlex for { <https://semopenalex.org/work/W3210561089> ?p ?o ?g. }
Showing items 1 to 76 of
76
with 100 items per page.
- W3210561089 abstract "This paper introduces the hmBlogs corpus for Persian, as a low resource language. This corpus has been prepared based on a collection of nearly 20 million blog posts over a period of about 15 years from a space of Persian blogs and includes more than 6.8 billion tokens. It can be claimed that this corpus is currently the largest Persian corpus that has been prepared independently for the Persian language. This corpus is presented in both raw and preprocessed forms, and based on the preprocessed corpus some word embedding models are produced. By the provided models, the hmBlogs is compared with some of the most important corpora available in Persian, and the results show the superiority of the hmBlogs corpus over the others. These evaluations also present the importance and effects of corpora, evaluation datasets, model production methods, different hyperparameters and even the evaluation methods. In addition to evaluating the corpus and its produced language models, this research also presents a semantic analogy dataset." @default.
- W3210561089 created "2021-11-08" @default.
- W3210561089 creator A5023914076 @default.
- W3210561089 creator A5055741193 @default.
- W3210561089 date "2021-11-03" @default.
- W3210561089 modified "2023-09-27" @default.
- W3210561089 title "HmBlogs: A big general Persian corpus." @default.
- W3210561089 cites W143115998 @default.
- W3210561089 cites W168564468 @default.
- W3210561089 cites W2026306693 @default.
- W3210561089 cites W2214534013 @default.
- W3210561089 cites W2251735256 @default.
- W3210561089 cites W2493916176 @default.
- W3210561089 cites W2753628379 @default.
- W3210561089 cites W2806934493 @default.
- W3210561089 cites W2835395670 @default.
- W3210561089 cites W2892839738 @default.
- W3210561089 cites W2950577311 @default.
- W3210561089 hasPublicationYear "2021" @default.
- W3210561089 type Work @default.
- W3210561089 sameAs 3210561089 @default.
- W3210561089 citedByCount "0" @default.
- W3210561089 crossrefType "posted-content" @default.
- W3210561089 hasAuthorship W3210561089A5023914076 @default.
- W3210561089 hasAuthorship W3210561089A5055741193 @default.
- W3210561089 hasConcept C137293760 @default.
- W3210561089 hasConcept C138885662 @default.
- W3210561089 hasConcept C154945302 @default.
- W3210561089 hasConcept C204321447 @default.
- W3210561089 hasConcept C2474386 @default.
- W3210561089 hasConcept C2776527531 @default.
- W3210561089 hasConcept C2777462759 @default.
- W3210561089 hasConcept C41008148 @default.
- W3210561089 hasConcept C41608201 @default.
- W3210561089 hasConcept C41895202 @default.
- W3210561089 hasConcept C532629269 @default.
- W3210561089 hasConcept C90805587 @default.
- W3210561089 hasConceptScore W3210561089C137293760 @default.
- W3210561089 hasConceptScore W3210561089C138885662 @default.
- W3210561089 hasConceptScore W3210561089C154945302 @default.
- W3210561089 hasConceptScore W3210561089C204321447 @default.
- W3210561089 hasConceptScore W3210561089C2474386 @default.
- W3210561089 hasConceptScore W3210561089C2776527531 @default.
- W3210561089 hasConceptScore W3210561089C2777462759 @default.
- W3210561089 hasConceptScore W3210561089C41008148 @default.
- W3210561089 hasConceptScore W3210561089C41608201 @default.
- W3210561089 hasConceptScore W3210561089C41895202 @default.
- W3210561089 hasConceptScore W3210561089C532629269 @default.
- W3210561089 hasConceptScore W3210561089C90805587 @default.
- W3210561089 hasLocation W32105610891 @default.
- W3210561089 hasOpenAccess W3210561089 @default.
- W3210561089 hasPrimaryLocation W32105610891 @default.
- W3210561089 hasRelatedWork W2134006216 @default.
- W3210561089 hasRelatedWork W2188549603 @default.
- W3210561089 hasRelatedWork W2240950607 @default.
- W3210561089 hasRelatedWork W2337341759 @default.
- W3210561089 hasRelatedWork W2389312407 @default.
- W3210561089 hasRelatedWork W2621038649 @default.
- W3210561089 hasRelatedWork W2774789119 @default.
- W3210561089 hasRelatedWork W2782590789 @default.
- W3210561089 hasRelatedWork W2793045447 @default.
- W3210561089 hasRelatedWork W2911453347 @default.
- W3210561089 hasRelatedWork W2933702408 @default.
- W3210561089 hasRelatedWork W2945325836 @default.
- W3210561089 hasRelatedWork W2962045490 @default.
- W3210561089 hasRelatedWork W2970345257 @default.
- W3210561089 hasRelatedWork W3043207155 @default.
- W3210561089 hasRelatedWork W344888238 @default.
- W3210561089 hasRelatedWork W962234857 @default.
- W3210561089 hasRelatedWork W2151355184 @default.
- W3210561089 hasRelatedWork W2241529223 @default.
- W3210561089 hasRelatedWork W2416674666 @default.
- W3210561089 isParatext "false" @default.
- W3210561089 isRetracted "false" @default.
- W3210561089 magId "3210561089" @default.
- W3210561089 workType "article" @default.