Matches in SemOpenAlex for { <https://semopenalex.org/work/W4280497323> ?p ?o ?g. }
Showing items 1 to 99 of
99
with 100 items per page.
- W4280497323 abstract "The article describes the results of the research, the purpose of which was to evaluate the influence of linguistic preprocessing on the interpretability of topic models for literary texts. The study was carried out as part of a large project aimed to obtain topic models of Russian short stories written in the first three decades of the 20th century and divided into three successive historical periods: 1) the period of the beginning of the century before the First World War (1900-1913), 2) the time of acute social cataclysms, wars and revolutions (World War I, the February and October revolutions, and the Civil War) (1914-1922), and 3) the early Soviet period (1923-1930). The material of the study was 3 samples of different sizes for each period, containing 100, 500 and 1000 short stories each. Preprocessing included lemmatization using spaCy library and four POS-filtering options: 1) nouns only, 2) nouns and verbs, 3) nouns, adjectives, adverbs, verbs, and 4) no filtering. Using the latent Dirichlet allocation (LDA), 36 topic models were built (9 models for each preprocessing option). The research showed that in case of literary texts topic models built without any POS filters are the most interpretable. The study made it possible to obtain information about topic diversity of Russian short stories, to assess their expert interpretability, and to offer some recommendations for optimizing topic modeling, which are to be used in the development of artificial intelligence systems that process large volumes of literary texts." @default.
- W4280497323 created "2022-05-22" @default.
- W4280497323 creator A5013254483 @default.
- W4280497323 creator A5019851673 @default.
- W4280497323 creator A5034229112 @default.
- W4280497323 creator A5062001654 @default.
- W4280497323 creator A5063640251 @default.
- W4280497323 creator A5078241249 @default.
- W4280497323 creator A5081250994 @default.
- W4280497323 creator A5082403541 @default.
- W4280497323 date "2022-04-27" @default.
- W4280497323 modified "2023-09-30" @default.
- W4280497323 title "Topic Modeling of Literary Texts Using LDA: on the Influence of Linguistic Preprocessing on Model Interpretability" @default.
- W4280497323 cites W1986937683 @default.
- W4280497323 cites W1988361130 @default.
- W4280497323 cites W2038043464 @default.
- W4280497323 cites W2174706414 @default.
- W4280497323 cites W2341256577 @default.
- W4280497323 cites W2778273717 @default.
- W4280497323 cites W2807488797 @default.
- W4280497323 cites W2921304755 @default.
- W4280497323 cites W2963104156 @default.
- W4280497323 cites W2967499952 @default.
- W4280497323 cites W3043222912 @default.
- W4280497323 cites W3092122204 @default.
- W4280497323 cites W3165515127 @default.
- W4280497323 cites W3190710204 @default.
- W4280497323 cites W3202421880 @default.
- W4280497323 cites W3206449250 @default.
- W4280497323 cites W4210595697 @default.
- W4280497323 doi "https://doi.org/10.23919/fruct54823.2022.9770887" @default.
- W4280497323 hasPublicationYear "2022" @default.
- W4280497323 type Work @default.
- W4280497323 citedByCount "1" @default.
- W4280497323 countsByYear W42804973232023 @default.
- W4280497323 crossrefType "proceedings-article" @default.
- W4280497323 hasAuthorship W4280497323A5013254483 @default.
- W4280497323 hasAuthorship W4280497323A5019851673 @default.
- W4280497323 hasAuthorship W4280497323A5034229112 @default.
- W4280497323 hasAuthorship W4280497323A5062001654 @default.
- W4280497323 hasAuthorship W4280497323A5063640251 @default.
- W4280497323 hasAuthorship W4280497323A5078241249 @default.
- W4280497323 hasAuthorship W4280497323A5081250994 @default.
- W4280497323 hasAuthorship W4280497323A5082403541 @default.
- W4280497323 hasConcept C107038049 @default.
- W4280497323 hasConcept C121934690 @default.
- W4280497323 hasConcept C138885662 @default.
- W4280497323 hasConcept C142362112 @default.
- W4280497323 hasConcept C154945302 @default.
- W4280497323 hasConcept C161831844 @default.
- W4280497323 hasConcept C171686336 @default.
- W4280497323 hasConcept C18903297 @default.
- W4280497323 hasConcept C204321447 @default.
- W4280497323 hasConcept C2777759810 @default.
- W4280497323 hasConcept C2781067378 @default.
- W4280497323 hasConcept C2781291010 @default.
- W4280497323 hasConcept C34736171 @default.
- W4280497323 hasConcept C41008148 @default.
- W4280497323 hasConcept C41895202 @default.
- W4280497323 hasConcept C46757340 @default.
- W4280497323 hasConcept C500882744 @default.
- W4280497323 hasConcept C86803240 @default.
- W4280497323 hasConcept C95457728 @default.
- W4280497323 hasConceptScore W4280497323C107038049 @default.
- W4280497323 hasConceptScore W4280497323C121934690 @default.
- W4280497323 hasConceptScore W4280497323C138885662 @default.
- W4280497323 hasConceptScore W4280497323C142362112 @default.
- W4280497323 hasConceptScore W4280497323C154945302 @default.
- W4280497323 hasConceptScore W4280497323C161831844 @default.
- W4280497323 hasConceptScore W4280497323C171686336 @default.
- W4280497323 hasConceptScore W4280497323C18903297 @default.
- W4280497323 hasConceptScore W4280497323C204321447 @default.
- W4280497323 hasConceptScore W4280497323C2777759810 @default.
- W4280497323 hasConceptScore W4280497323C2781067378 @default.
- W4280497323 hasConceptScore W4280497323C2781291010 @default.
- W4280497323 hasConceptScore W4280497323C34736171 @default.
- W4280497323 hasConceptScore W4280497323C41008148 @default.
- W4280497323 hasConceptScore W4280497323C41895202 @default.
- W4280497323 hasConceptScore W4280497323C46757340 @default.
- W4280497323 hasConceptScore W4280497323C500882744 @default.
- W4280497323 hasConceptScore W4280497323C86803240 @default.
- W4280497323 hasConceptScore W4280497323C95457728 @default.
- W4280497323 hasFunder F4320324261 @default.
- W4280497323 hasLocation W42804973231 @default.
- W4280497323 hasOpenAccess W4280497323 @default.
- W4280497323 hasPrimaryLocation W42804973231 @default.
- W4280497323 hasRelatedWork W142374489 @default.
- W4280497323 hasRelatedWork W1585034923 @default.
- W4280497323 hasRelatedWork W1601638723 @default.
- W4280497323 hasRelatedWork W2084064745 @default.
- W4280497323 hasRelatedWork W2285811659 @default.
- W4280497323 hasRelatedWork W2397390177 @default.
- W4280497323 hasRelatedWork W2510713142 @default.
- W4280497323 hasRelatedWork W4280497323 @default.
- W4280497323 hasRelatedWork W4361864099 @default.
- W4280497323 hasRelatedWork W2610669109 @default.
- W4280497323 isParatext "false" @default.
- W4280497323 isRetracted "false" @default.
- W4280497323 workType "article" @default.