Matches in SemOpenAlex for { <https://semopenalex.org/work/W4308589027> ?p ?o ?g. }
- W4308589027 abstract "Abstract In programming learning environments, the pressure of delivering many programming assignments makes plagiarism the easiest solution. This highly threatens the learning process; therefore, the need of an automatic, fast, and accurate detection of source code plagiarism becomes essential. To detect whether a pair of Java files is plagiarized, this paper proposes four classification feature sets: (i) structural histogram features, histogram-based features for summarizing similarity matrices; (ii) lexical per-class features, extracted from a lexical similarity matrix between the classes of the two compared files based on character 3-grams; (iii) structural counting features, twelve counting features representing the code structure; and (iv) modified original features: a set of modifications on the features of the used baseline. The results show that the best feature sets in F-measure are the structural histogram features and the lexical per-class features combined, which improve the F-measure by 4% compared to the baseline. The added features slow down the execution time. However, it is still efficient, given that it can classify 70k pairs in 23 min. In addition, we partially re-annotated the SOurce COde Re-use dataset. After the re-annotation, the F-measure of both the baseline and our work is improved, and our work achieves an F-measure of 93.6%, which is 7.5% higher than the new F-measure of the baseline. In addition, some remarks and recommendations are provided for using the SOurce COde Re-use dataset as a benchmark." @default.
- W4308589027 created "2022-11-12" @default.
- W4308589027 creator A5001899728 @default.
- W4308589027 creator A5013978386 @default.
- W4308589027 creator A5021627580 @default.
- W4308589027 creator A5066803745 @default.
- W4308589027 date "2022-11-08" @default.
- W4308589027 modified "2023-09-30" @default.
- W4308589027 title "Classification feature sets for source code plagiarism detection in Java" @default.
- W4308589027 cites W1971922616 @default.
- W4308589027 cites W1985205072 @default.
- W4308589027 cites W2010532254 @default.
- W4308589027 cites W2015614877 @default.
- W4308589027 cites W2066007431 @default.
- W4308589027 cites W2083278176 @default.
- W4308589027 cites W2107697055 @default.
- W4308589027 cites W2126359798 @default.
- W4308589027 cites W2141128179 @default.
- W4308589027 cites W2243493889 @default.
- W4308589027 cites W2248990385 @default.
- W4308589027 cites W2588341850 @default.
- W4308589027 cites W2609731675 @default.
- W4308589027 cites W2715879671 @default.
- W4308589027 cites W2756118627 @default.
- W4308589027 cites W2890485822 @default.
- W4308589027 cites W2897836315 @default.
- W4308589027 cites W2922712072 @default.
- W4308589027 cites W3088680525 @default.
- W4308589027 doi "https://doi.org/10.1186/s44147-022-00155-8" @default.
- W4308589027 hasPublicationYear "2022" @default.
- W4308589027 type Work @default.
- W4308589027 citedByCount "0" @default.
- W4308589027 crossrefType "journal-article" @default.
- W4308589027 hasAuthorship W4308589027A5001899728 @default.
- W4308589027 hasAuthorship W4308589027A5013978386 @default.
- W4308589027 hasAuthorship W4308589027A5021627580 @default.
- W4308589027 hasAuthorship W4308589027A5066803745 @default.
- W4308589027 hasBestOaLocation W43085890271 @default.
- W4308589027 hasConcept C103278499 @default.
- W4308589027 hasConcept C115961682 @default.
- W4308589027 hasConcept C124101348 @default.
- W4308589027 hasConcept C13280743 @default.
- W4308589027 hasConcept C138885662 @default.
- W4308589027 hasConcept C153180895 @default.
- W4308589027 hasConcept C154945302 @default.
- W4308589027 hasConcept C17426736 @default.
- W4308589027 hasConcept C177264268 @default.
- W4308589027 hasConcept C185798385 @default.
- W4308589027 hasConcept C199360897 @default.
- W4308589027 hasConcept C205649164 @default.
- W4308589027 hasConcept C2776401178 @default.
- W4308589027 hasConcept C2776517306 @default.
- W4308589027 hasConcept C2776760102 @default.
- W4308589027 hasConcept C2777212361 @default.
- W4308589027 hasConcept C2780009758 @default.
- W4308589027 hasConcept C41008148 @default.
- W4308589027 hasConcept C41895202 @default.
- W4308589027 hasConcept C43126263 @default.
- W4308589027 hasConcept C53533937 @default.
- W4308589027 hasConcept C548217200 @default.
- W4308589027 hasConceptScore W4308589027C103278499 @default.
- W4308589027 hasConceptScore W4308589027C115961682 @default.
- W4308589027 hasConceptScore W4308589027C124101348 @default.
- W4308589027 hasConceptScore W4308589027C13280743 @default.
- W4308589027 hasConceptScore W4308589027C138885662 @default.
- W4308589027 hasConceptScore W4308589027C153180895 @default.
- W4308589027 hasConceptScore W4308589027C154945302 @default.
- W4308589027 hasConceptScore W4308589027C17426736 @default.
- W4308589027 hasConceptScore W4308589027C177264268 @default.
- W4308589027 hasConceptScore W4308589027C185798385 @default.
- W4308589027 hasConceptScore W4308589027C199360897 @default.
- W4308589027 hasConceptScore W4308589027C205649164 @default.
- W4308589027 hasConceptScore W4308589027C2776401178 @default.
- W4308589027 hasConceptScore W4308589027C2776517306 @default.
- W4308589027 hasConceptScore W4308589027C2776760102 @default.
- W4308589027 hasConceptScore W4308589027C2777212361 @default.
- W4308589027 hasConceptScore W4308589027C2780009758 @default.
- W4308589027 hasConceptScore W4308589027C41008148 @default.
- W4308589027 hasConceptScore W4308589027C41895202 @default.
- W4308589027 hasConceptScore W4308589027C43126263 @default.
- W4308589027 hasConceptScore W4308589027C53533937 @default.
- W4308589027 hasConceptScore W4308589027C548217200 @default.
- W4308589027 hasIssue "1" @default.
- W4308589027 hasLocation W43085890271 @default.
- W4308589027 hasLocation W43085890272 @default.
- W4308589027 hasOpenAccess W4308589027 @default.
- W4308589027 hasPrimaryLocation W43085890271 @default.
- W4308589027 hasRelatedWork W1977863971 @default.
- W4308589027 hasRelatedWork W2040719874 @default.
- W4308589027 hasRelatedWork W2066259560 @default.
- W4308589027 hasRelatedWork W2113226963 @default.
- W4308589027 hasRelatedWork W2134786086 @default.
- W4308589027 hasRelatedWork W2172836935 @default.
- W4308589027 hasRelatedWork W2358805260 @default.
- W4308589027 hasRelatedWork W2363530787 @default.
- W4308589027 hasRelatedWork W2990472155 @default.
- W4308589027 hasRelatedWork W2181817726 @default.
- W4308589027 hasVolume "69" @default.
- W4308589027 isParatext "false" @default.
- W4308589027 isRetracted "false" @default.