Matches in SemOpenAlex for { <https://semopenalex.org/work/W3197518744> ?p ?o ?g. }
- W3197518744 abstract "Abstract Topic modeling using models such as Latent Dirichlet Allocation (LDA) is a text mining technique to extract human-readable semantic “topics” (i.e., word clusters) from a corpus of textual documents. In software engineering, topic modeling has been used to analyze textual data in empirical studies (e.g., to find out what developers talk about online), but also to build new techniques to support software engineering tasks (e.g., to support source code comprehension). Topic modeling needs to be applied carefully (e.g., depending on the type of textual data analyzed and modeling parameters). Our study aims at describing how topic modeling has been applied in software engineering research with a focus on four aspects: (1) which topic models and modeling techniques have been applied, (2) which textual inputs have been used for topic modeling, (3) how textual data was “prepared” (i.e., pre-processed) for topic modeling, and (4) how generated topics (i.e., word clusters) were named to give them a human-understandable meaning. We analyzed topic modeling as applied in 111 papers from ten highly-ranked software engineering venues (five journals and five conferences) published between 2009 and 2020. We found that (1) LDA and LDA-based techniques are the most frequent topic modeling techniques, (2) developer communication and bug reports have been modelled most, (3) data pre-processing and modeling parameters vary quite a bit and are often vaguely reported, and (4) manual topic naming (such as deducting names based on frequent words in a topic) is common." @default.
- W3197518744 created "2021-09-13" @default.
- W3197518744 creator A5018823718 @default.
- W3197518744 creator A5034571717 @default.
- W3197518744 creator A5055874096 @default.
- W3197518744 date "2021-09-06" @default.
- W3197518744 modified "2023-10-16" @default.
- W3197518744 title "Topic modeling in software engineering research" @default.
- W3197518744 cites W1024906986 @default.
- W3197518744 cites W1499516075 @default.
- W3197518744 cites W1581639473 @default.
- W3197518744 cites W1902027874 @default.
- W3197518744 cites W1965335252 @default.
- W3197518744 cites W1965484571 @default.
- W3197518744 cites W1965636883 @default.
- W3197518744 cites W1969657187 @default.
- W3197518744 cites W1972978214 @default.
- W3197518744 cites W1975579663 @default.
- W3197518744 cites W1975790660 @default.
- W3197518744 cites W1979991500 @default.
- W3197518744 cites W1981109290 @default.
- W3197518744 cites W1981629032 @default.
- W3197518744 cites W1985266020 @default.
- W3197518744 cites W1991867644 @default.
- W3197518744 cites W1993344876 @default.
- W3197518744 cites W1995875735 @default.
- W3197518744 cites W1996901220 @default.
- W3197518744 cites W1997885138 @default.
- W3197518744 cites W1998130038 @default.
- W3197518744 cites W1999798506 @default.
- W3197518744 cites W2000518393 @default.
- W3197518744 cites W2001082470 @default.
- W3197518744 cites W2002641269 @default.
- W3197518744 cites W2004192095 @default.
- W3197518744 cites W2005098725 @default.
- W3197518744 cites W2009977560 @default.
- W3197518744 cites W2018663431 @default.
- W3197518744 cites W2019985699 @default.
- W3197518744 cites W2022371098 @default.
- W3197518744 cites W2033771212 @default.
- W3197518744 cites W2038043464 @default.
- W3197518744 cites W2040411904 @default.
- W3197518744 cites W2042980227 @default.
- W3197518744 cites W2045837563 @default.
- W3197518744 cites W2046799248 @default.
- W3197518744 cites W2056894403 @default.
- W3197518744 cites W2057587538 @default.
- W3197518744 cites W2064153289 @default.
- W3197518744 cites W2064772995 @default.
- W3197518744 cites W2076219102 @default.
- W3197518744 cites W2076264506 @default.
- W3197518744 cites W2085597081 @default.
- W3197518744 cites W2089468765 @default.
- W3197518744 cites W2089759055 @default.
- W3197518744 cites W2093400716 @default.
- W3197518744 cites W2094662297 @default.
- W3197518744 cites W2101819268 @default.
- W3197518744 cites W2108545456 @default.
- W3197518744 cites W2110068396 @default.
- W3197518744 cites W2112143630 @default.
- W3197518744 cites W2122599295 @default.
- W3197518744 cites W2124672527 @default.
- W3197518744 cites W2135790056 @default.
- W3197518744 cites W2138824333 @default.
- W3197518744 cites W2139543149 @default.
- W3197518744 cites W2140264852 @default.
- W3197518744 cites W2142958724 @default.
- W3197518744 cites W2145700761 @default.
- W3197518744 cites W2147152072 @default.
- W3197518744 cites W2158266063 @default.
- W3197518744 cites W2165612380 @default.
- W3197518744 cites W2168649891 @default.
- W3197518744 cites W2187358498 @default.
- W3197518744 cites W2285477878 @default.
- W3197518744 cites W2287648897 @default.
- W3197518744 cites W2290968742 @default.
- W3197518744 cites W2333884379 @default.
- W3197518744 cites W2335783870 @default.
- W3197518744 cites W2341869196 @default.
- W3197518744 cites W2356036115 @default.
- W3197518744 cites W2397731355 @default.
- W3197518744 cites W2401967267 @default.
- W3197518744 cites W2402702168 @default.
- W3197518744 cites W2417608402 @default.
- W3197518744 cites W2475828198 @default.
- W3197518744 cites W2478191067 @default.
- W3197518744 cites W2483035776 @default.
- W3197518744 cites W2487391445 @default.
- W3197518744 cites W2546780486 @default.
- W3197518744 cites W2546867968 @default.
- W3197518744 cites W2553051037 @default.
- W3197518744 cites W2560766692 @default.
- W3197518744 cites W2565718479 @default.
- W3197518744 cites W2575380243 @default.
- W3197518744 cites W2587676558 @default.
- W3197518744 cites W2593635859 @default.
- W3197518744 cites W2604794021 @default.
- W3197518744 cites W2608489681 @default.
- W3197518744 cites W2611274474 @default.
- W3197518744 cites W2617730062 @default.