Matches in SemOpenAlex for { <https://semopenalex.org/work/W3018521878> ?p ?o ?g. }
- W3018521878 abstract "Most topic models are constructed under the assumption that documents follow a multinomial distribution. The Poisson distribution is an alternative distribution to describe the probability of count data. For topic modelling, the Poisson distribution describes the number of occurrences of a word in documents of fixed length. The Poisson distribution has been successfully applied in text classification, but its application to topic modelling is not well documented, specifically in the context of a generative probabilistic model. Furthermore, the few Poisson topic models in literature are admixture models, making the assumption that a document is generated from a mixture of topics. In this study, we focus on short text. Many studies have shown that the simpler assumption of a mixture model fits short text better. With mixture models, as opposed to admixture models, the generative assumption is that a document is generated from a single topic. One topic model, which makes this one-topic-per-document assumption, is the Dirichlet-multinomial mixture model. The main contributions of this work are a new Gamma-Poisson mixture model, as well as a collapsed Gibbs sampler for the model. The benefit of the collapsed Gibbs sampler derivation is that the model is able to automatically select the number of topics contained in the corpus. The results show that the Gamma-Poisson mixture model performs better than the Dirichlet-multinomial mixture model at selecting the number of topics in labelled corpora. Furthermore, the Gamma-Poisson mixture produces better topic coherence scores than the Dirichlet-multinomial mixture model, thus making it a viable option for the challenging task of topic modelling of short text." @default.
- W3018521878 created "2020-05-01" @default.
- W3018521878 creator A5017856354 @default.
- W3018521878 creator A5059063529 @default.
- W3018521878 creator A5086834621 @default.
- W3018521878 date "2020-04-23" @default.
- W3018521878 modified "2023-09-27" @default.
- W3018521878 title "A Gamma-Poisson Mixture Topic Model for Short Text" @default.
- W3018521878 cites W1714665356 @default.
- W3018521878 cites W1880262756 @default.
- W3018521878 cites W1889660637 @default.
- W3018521878 cites W1986966428 @default.
- W3018521878 cites W2004192095 @default.
- W3018521878 cites W2020999234 @default.
- W3018521878 cites W2031489346 @default.
- W3018521878 cites W2056797132 @default.
- W3018521878 cites W2061922307 @default.
- W3018521878 cites W2063904635 @default.
- W3018521878 cites W2064772995 @default.
- W3018521878 cites W2076219102 @default.
- W3018521878 cites W2097089247 @default.
- W3018521878 cites W2128990575 @default.
- W3018521878 cites W2130339025 @default.
- W3018521878 cites W2131917031 @default.
- W3018521878 cites W2135631383 @default.
- W3018521878 cites W2145677303 @default.
- W3018521878 cites W2151703435 @default.
- W3018521878 cites W2158266063 @default.
- W3018521878 cites W216263296 @default.
- W3018521878 cites W2168332560 @default.
- W3018521878 cites W2170610543 @default.
- W3018521878 cites W2170678468 @default.
- W3018521878 cites W2171836785 @default.
- W3018521878 cites W2178725228 @default.
- W3018521878 cites W2222893162 @default.
- W3018521878 cites W2340381866 @default.
- W3018521878 cites W2516537890 @default.
- W3018521878 cites W2517407755 @default.
- W3018521878 cites W2572380545 @default.
- W3018521878 cites W2745475103 @default.
- W3018521878 cites W2767665719 @default.
- W3018521878 cites W2903606428 @default.
- W3018521878 cites W2969961168 @default.
- W3018521878 cites W3021053579 @default.
- W3018521878 hasPublicationYear "2020" @default.
- W3018521878 type Work @default.
- W3018521878 sameAs 3018521878 @default.
- W3018521878 citedByCount "0" @default.
- W3018521878 crossrefType "posted-content" @default.
- W3018521878 hasAuthorship W3018521878A5017856354 @default.
- W3018521878 hasAuthorship W3018521878A5059063529 @default.
- W3018521878 hasAuthorship W3018521878A5086834621 @default.
- W3018521878 hasConcept C100906024 @default.
- W3018521878 hasConcept C105795698 @default.
- W3018521878 hasConcept C107673813 @default.
- W3018521878 hasConcept C134306372 @default.
- W3018521878 hasConcept C149717495 @default.
- W3018521878 hasConcept C154945302 @default.
- W3018521878 hasConcept C158424031 @default.
- W3018521878 hasConcept C167966045 @default.
- W3018521878 hasConcept C169214877 @default.
- W3018521878 hasConcept C171686336 @default.
- W3018521878 hasConcept C182310444 @default.
- W3018521878 hasConcept C192065140 @default.
- W3018521878 hasConcept C204321447 @default.
- W3018521878 hasConcept C28826006 @default.
- W3018521878 hasConcept C33923547 @default.
- W3018521878 hasConcept C39890363 @default.
- W3018521878 hasConcept C41008148 @default.
- W3018521878 hasConcept C500882744 @default.
- W3018521878 hasConcept C61224824 @default.
- W3018521878 hasConceptScore W3018521878C100906024 @default.
- W3018521878 hasConceptScore W3018521878C105795698 @default.
- W3018521878 hasConceptScore W3018521878C107673813 @default.
- W3018521878 hasConceptScore W3018521878C134306372 @default.
- W3018521878 hasConceptScore W3018521878C149717495 @default.
- W3018521878 hasConceptScore W3018521878C154945302 @default.
- W3018521878 hasConceptScore W3018521878C158424031 @default.
- W3018521878 hasConceptScore W3018521878C167966045 @default.
- W3018521878 hasConceptScore W3018521878C169214877 @default.
- W3018521878 hasConceptScore W3018521878C171686336 @default.
- W3018521878 hasConceptScore W3018521878C182310444 @default.
- W3018521878 hasConceptScore W3018521878C192065140 @default.
- W3018521878 hasConceptScore W3018521878C204321447 @default.
- W3018521878 hasConceptScore W3018521878C28826006 @default.
- W3018521878 hasConceptScore W3018521878C33923547 @default.
- W3018521878 hasConceptScore W3018521878C39890363 @default.
- W3018521878 hasConceptScore W3018521878C41008148 @default.
- W3018521878 hasConceptScore W3018521878C500882744 @default.
- W3018521878 hasConceptScore W3018521878C61224824 @default.
- W3018521878 hasLocation W30185218781 @default.
- W3018521878 hasOpenAccess W3018521878 @default.
- W3018521878 hasPrimaryLocation W30185218781 @default.
- W3018521878 hasRelatedWork W1880262756 @default.
- W3018521878 hasRelatedWork W1994891386 @default.
- W3018521878 hasRelatedWork W1996888654 @default.
- W3018521878 hasRelatedWork W2011859172 @default.
- W3018521878 hasRelatedWork W2016073783 @default.
- W3018521878 hasRelatedWork W2054765427 @default.
- W3018521878 hasRelatedWork W2089421272 @default.