Matches in SemOpenAlex for { <https://semopenalex.org/work/W4297103140> ?p ?o ?g. }
Showing items 1 to 78 of
78
with 100 items per page.
- W4297103140 abstract "Topic Modeling (TM) is among the most exploited approaches to extracting and organizing information from large amounts of data. Basically, these approaches aim to find semantic topics from textual documents (e.g., product reviews, tweets). Despite the good results of these approaches in English texts, we do not observe the same semantic quality when applied in Portuguese Texts since they are more verbose, presenting varied and complex verb conjugations and many homonyms, among other specific particularities. This work intends to fill this scientific gap by exploiting and evaluating different Topic Modeling Pre-processing Pipelines for Portuguese texts, which correspond to sequences of tasks that needed to be performed before the TM strategies. More specifically, we evaluate different pre-processing pipeline configurations using different semantic data representations to overcome the challenges faced by TM strategies in Portuguese Text. In our experimentation evaluation, considering two datasets collected from Twitter and Reddit related to Brazilian political discussion, we show that our proposed extended pre-processing pipeline, especially considering semantic representations, can achieve significant gains in effectiveness when compared to the TM approaches originally proposed for English texts (up to 9x better)." @default.
- W4297103140 created "2022-09-27" @default.
- W4297103140 creator A5016932986 @default.
- W4297103140 creator A5018690278 @default.
- W4297103140 creator A5021846291 @default.
- W4297103140 creator A5057646226 @default.
- W4297103140 creator A5067532521 @default.
- W4297103140 creator A5079814920 @default.
- W4297103140 date "2022-11-07" @default.
- W4297103140 modified "2023-10-16" @default.
- W4297103140 title "Evaluating Topic Modeling Pre-processing Pipelines for Portuguese Texts" @default.
- W4297103140 cites W1916023682 @default.
- W4297103140 cites W2012490931 @default.
- W4297103140 cites W2014545475 @default.
- W4297103140 cites W2341256577 @default.
- W4297103140 cites W2461271816 @default.
- W4297103140 cites W2562617836 @default.
- W4297103140 cites W2745475103 @default.
- W4297103140 cites W2788615138 @default.
- W4297103140 cites W2789440487 @default.
- W4297103140 cites W2797779452 @default.
- W4297103140 cites W2896006963 @default.
- W4297103140 cites W2907254481 @default.
- W4297103140 cites W2980053992 @default.
- W4297103140 cites W2980140466 @default.
- W4297103140 cites W3006948163 @default.
- W4297103140 cites W3014172561 @default.
- W4297103140 cites W3022371575 @default.
- W4297103140 cites W3034608479 @default.
- W4297103140 doi "https://doi.org/10.1145/3539637.3557052" @default.
- W4297103140 hasPublicationYear "2022" @default.
- W4297103140 type Work @default.
- W4297103140 citedByCount "1" @default.
- W4297103140 crossrefType "proceedings-article" @default.
- W4297103140 hasAuthorship W4297103140A5016932986 @default.
- W4297103140 hasAuthorship W4297103140A5018690278 @default.
- W4297103140 hasAuthorship W4297103140A5021846291 @default.
- W4297103140 hasAuthorship W4297103140A5057646226 @default.
- W4297103140 hasAuthorship W4297103140A5067532521 @default.
- W4297103140 hasAuthorship W4297103140A5079814920 @default.
- W4297103140 hasConcept C138885662 @default.
- W4297103140 hasConcept C154945302 @default.
- W4297103140 hasConcept C184337299 @default.
- W4297103140 hasConcept C199360897 @default.
- W4297103140 hasConcept C204321447 @default.
- W4297103140 hasConcept C23123220 @default.
- W4297103140 hasConcept C2522767166 @default.
- W4297103140 hasConcept C35219183 @default.
- W4297103140 hasConcept C41008148 @default.
- W4297103140 hasConcept C41895202 @default.
- W4297103140 hasConcept C43521106 @default.
- W4297103140 hasConceptScore W4297103140C138885662 @default.
- W4297103140 hasConceptScore W4297103140C154945302 @default.
- W4297103140 hasConceptScore W4297103140C184337299 @default.
- W4297103140 hasConceptScore W4297103140C199360897 @default.
- W4297103140 hasConceptScore W4297103140C204321447 @default.
- W4297103140 hasConceptScore W4297103140C23123220 @default.
- W4297103140 hasConceptScore W4297103140C2522767166 @default.
- W4297103140 hasConceptScore W4297103140C35219183 @default.
- W4297103140 hasConceptScore W4297103140C41008148 @default.
- W4297103140 hasConceptScore W4297103140C41895202 @default.
- W4297103140 hasConceptScore W4297103140C43521106 @default.
- W4297103140 hasLocation W42971031401 @default.
- W4297103140 hasOpenAccess W4297103140 @default.
- W4297103140 hasPrimaryLocation W42971031401 @default.
- W4297103140 hasRelatedWork W1964944391 @default.
- W4297103140 hasRelatedWork W1990120252 @default.
- W4297103140 hasRelatedWork W2317321741 @default.
- W4297103140 hasRelatedWork W2625894112 @default.
- W4297103140 hasRelatedWork W2899467429 @default.
- W4297103140 hasRelatedWork W2912615426 @default.
- W4297103140 hasRelatedWork W4299785812 @default.
- W4297103140 hasRelatedWork W4322721676 @default.
- W4297103140 hasRelatedWork W563704311 @default.
- W4297103140 hasRelatedWork W581688671 @default.
- W4297103140 isParatext "false" @default.
- W4297103140 isRetracted "false" @default.
- W4297103140 workType "article" @default.