Matches in SemOpenAlex for { <https://semopenalex.org/work/W2113772893> ?p ?o ?g. }
- W2113772893 endingPage "1678" @default.
- W2113772893 startingPage "1663" @default.
- W2113772893 abstract "Latent Dirichlet Allocation (LDA) is a data clustering algorithm that performs especially well for text documents. In natural-language applications it automatically finds groups of related words (called “latent topics”) and clusters the documents into sets that are about the same “topic”. LDA has also been applied to source code, where the documents are natural source code units such as methods or classes, and the words are the keywords, operators, and programmer-defined names in the code. The problem of determining a topic count that most appropriately describes a set of source code documents is an open problem. We address this empirically by constructing clusterings with different numbers of topics for a large number of software systems, and then use a pair of measures based on source code locality and topic model similarity to assess how well the topic structure identifies related source code units. Results suggest that the topic count required can be closely approximated using the number of software code fragments in the system. We extend these results to recommend appropriate topic counts for arbitrary software systems based on an analysis of a set of open source systems. • A method to estimate the appropriate number of latent topics for source code. • Using pairwise relationships between code fragments for conceptual analysis. • Estimating good latent topic counts based on experimental results. • Verification of previous estimates for latent topic counts in source code." @default.
- W2113772893 created "2016-06-24" @default.
- W2113772893 creator A5006135009 @default.
- W2113772893 creator A5054365519 @default.
- W2113772893 creator A5064499489 @default.
- W2113772893 date "2013-09-01" @default.
- W2113772893 modified "2023-10-18" @default.
- W2113772893 title "Using heuristics to estimate an appropriate number of latent topics in source code analysis" @default.
- W2113772893 cites W1539495021 @default.
- W2113772893 cites W1973756582 @default.
- W2113772893 cites W2001082470 @default.
- W2113772893 cites W2017616266 @default.
- W2113772893 cites W2022371098 @default.
- W2113772893 cites W2057870902 @default.
- W2113772893 cites W2077776641 @default.
- W2113772893 cites W2079194772 @default.
- W2113772893 cites W2099741732 @default.
- W2113772893 cites W2101154344 @default.
- W2113772893 cites W2109144580 @default.
- W2113772893 cites W2121866903 @default.
- W2113772893 cites W2128466029 @default.
- W2113772893 cites W2128581098 @default.
- W2113772893 cites W2129559874 @default.
- W2113772893 cites W2130997088 @default.
- W2113772893 cites W2131491818 @default.
- W2113772893 cites W2135541598 @default.
- W2113772893 cites W2138133615 @default.
- W2113772893 cites W2140264852 @default.
- W2113772893 cites W2143818143 @default.
- W2113772893 cites W2152161325 @default.
- W2113772893 cites W2161920802 @default.
- W2113772893 cites W4251218026 @default.
- W2113772893 doi "https://doi.org/10.1016/j.scico.2013.03.015" @default.
- W2113772893 hasPublicationYear "2013" @default.
- W2113772893 type Work @default.
- W2113772893 sameAs 2113772893 @default.
- W2113772893 citedByCount "34" @default.
- W2113772893 countsByYear W21137728932013 @default.
- W2113772893 countsByYear W21137728932014 @default.
- W2113772893 countsByYear W21137728932015 @default.
- W2113772893 countsByYear W21137728932016 @default.
- W2113772893 countsByYear W21137728932017 @default.
- W2113772893 countsByYear W21137728932018 @default.
- W2113772893 countsByYear W21137728932019 @default.
- W2113772893 countsByYear W21137728932020 @default.
- W2113772893 countsByYear W21137728932021 @default.
- W2113772893 countsByYear W21137728932022 @default.
- W2113772893 countsByYear W21137728932023 @default.
- W2113772893 crossrefType "journal-article" @default.
- W2113772893 hasAuthorship W2113772893A5006135009 @default.
- W2113772893 hasAuthorship W2113772893A5054365519 @default.
- W2113772893 hasAuthorship W2113772893A5064499489 @default.
- W2113772893 hasBestOaLocation W21137728932 @default.
- W2113772893 hasConcept C111919701 @default.
- W2113772893 hasConcept C124101348 @default.
- W2113772893 hasConcept C127705205 @default.
- W2113772893 hasConcept C137287247 @default.
- W2113772893 hasConcept C150292731 @default.
- W2113772893 hasConcept C154945302 @default.
- W2113772893 hasConcept C171686336 @default.
- W2113772893 hasConcept C177264268 @default.
- W2113772893 hasConcept C184898388 @default.
- W2113772893 hasConcept C199360897 @default.
- W2113772893 hasConcept C23123220 @default.
- W2113772893 hasConcept C2776760102 @default.
- W2113772893 hasConcept C2777904410 @default.
- W2113772893 hasConcept C41008148 @default.
- W2113772893 hasConcept C43126263 @default.
- W2113772893 hasConcept C500882744 @default.
- W2113772893 hasConcept C529173508 @default.
- W2113772893 hasConcept C73555534 @default.
- W2113772893 hasConcept C80444323 @default.
- W2113772893 hasConceptScore W2113772893C111919701 @default.
- W2113772893 hasConceptScore W2113772893C124101348 @default.
- W2113772893 hasConceptScore W2113772893C127705205 @default.
- W2113772893 hasConceptScore W2113772893C137287247 @default.
- W2113772893 hasConceptScore W2113772893C150292731 @default.
- W2113772893 hasConceptScore W2113772893C154945302 @default.
- W2113772893 hasConceptScore W2113772893C171686336 @default.
- W2113772893 hasConceptScore W2113772893C177264268 @default.
- W2113772893 hasConceptScore W2113772893C184898388 @default.
- W2113772893 hasConceptScore W2113772893C199360897 @default.
- W2113772893 hasConceptScore W2113772893C23123220 @default.
- W2113772893 hasConceptScore W2113772893C2776760102 @default.
- W2113772893 hasConceptScore W2113772893C2777904410 @default.
- W2113772893 hasConceptScore W2113772893C41008148 @default.
- W2113772893 hasConceptScore W2113772893C43126263 @default.
- W2113772893 hasConceptScore W2113772893C500882744 @default.
- W2113772893 hasConceptScore W2113772893C529173508 @default.
- W2113772893 hasConceptScore W2113772893C73555534 @default.
- W2113772893 hasConceptScore W2113772893C80444323 @default.
- W2113772893 hasIssue "9" @default.
- W2113772893 hasLocation W21137728931 @default.
- W2113772893 hasLocation W21137728932 @default.
- W2113772893 hasOpenAccess W2113772893 @default.
- W2113772893 hasPrimaryLocation W21137728931 @default.
- W2113772893 hasRelatedWork W1484035946 @default.
- W2113772893 hasRelatedWork W1522113126 @default.