Matches in SemOpenAlex for { <https://semopenalex.org/work/W4361004367> ?p ?o ?g. }
- W4361004367 endingPage "60" @default.
- W4361004367 startingPage "60" @default.
- W4361004367 abstract "To proactively mitigate malware threats, cybersecurity tools, such as anti-virus and anti-malware software, as well as firewalls, require frequent updates and proactive implementation. However, processing the vast amounts of dataset examples can be overwhelming when relying solely on traditional methods. In cybersecurity workflows, recent advances in natural language processing (NLP) models can aid in proactively detecting various threats. In this paper, we present a novel approach for representing the relevance and significance of the Malware/Goodware (MG) datasets, through the use of a pre-trained language model called MalBERTv2. Our model is trained on publicly available datasets, with a focus on the source code of the apps by extracting the top-ranked files that present the most relevant information. These files are then passed through a pre-tokenization feature generator, and the resulting keywords are used to train the tokenizer from scratch. Finally, we apply a classifier using bidirectional encoder representations from transformers (BERT) as a layer within the model pipeline. The performance of our model is evaluated on different datasets, achieving a weighted f1 score ranging from 82% to 99%. Our results demonstrate the effectiveness of our approach for proactively detecting malware threats using NLP techniques." @default.
- W4361004367 created "2023-03-30" @default.
- W4361004367 creator A5039930829 @default.
- W4361004367 creator A5076053136 @default.
- W4361004367 date "2023-03-24" @default.
- W4361004367 modified "2023-10-13" @default.
- W4361004367 title "MalBERTv2: Code Aware BERT-Based Model for Malware Identification" @default.
- W4361004367 cites W1851403712 @default.
- W4361004367 cites W1980867644 @default.
- W4361004367 cites W2122672392 @default.
- W4361004367 cites W2131988669 @default.
- W4361004367 cites W2141554582 @default.
- W4361004367 cites W2215444025 @default.
- W4361004367 cites W2493916176 @default.
- W4361004367 cites W2734718015 @default.
- W4361004367 cites W2790843555 @default.
- W4361004367 cites W2921573932 @default.
- W4361004367 cites W2962784628 @default.
- W4361004367 cites W2963250244 @default.
- W4361004367 cites W2963989339 @default.
- W4361004367 cites W2966399854 @default.
- W4361004367 cites W2972818416 @default.
- W4361004367 cites W2974072230 @default.
- W4361004367 cites W2980720901 @default.
- W4361004367 cites W2999309192 @default.
- W4361004367 cites W3002633045 @default.
- W4361004367 cites W3009194787 @default.
- W4361004367 cites W3014018380 @default.
- W4361004367 cites W3026804104 @default.
- W4361004367 cites W3034402928 @default.
- W4361004367 cites W3038275207 @default.
- W4361004367 cites W3047318188 @default.
- W4361004367 cites W3048788384 @default.
- W4361004367 cites W3080622597 @default.
- W4361004367 cites W3081627837 @default.
- W4361004367 cites W3082603384 @default.
- W4361004367 cites W3088714212 @default.
- W4361004367 cites W3094190102 @default.
- W4361004367 cites W3104008133 @default.
- W4361004367 cites W3117169309 @default.
- W4361004367 cites W3137440318 @default.
- W4361004367 cites W3170286125 @default.
- W4361004367 cites W3174361912 @default.
- W4361004367 cites W3181333984 @default.
- W4361004367 cites W3195672100 @default.
- W4361004367 cites W3202852903 @default.
- W4361004367 cites W3207653542 @default.
- W4361004367 cites W4205635956 @default.
- W4361004367 cites W4205876068 @default.
- W4361004367 cites W4206660322 @default.
- W4361004367 cites W4239025696 @default.
- W4361004367 cites W4283214976 @default.
- W4361004367 doi "https://doi.org/10.3390/bdcc7020060" @default.
- W4361004367 hasPublicationYear "2023" @default.
- W4361004367 type Work @default.
- W4361004367 citedByCount "2" @default.
- W4361004367 countsByYear W43610043672023 @default.
- W4361004367 crossrefType "journal-article" @default.
- W4361004367 hasAuthorship W4361004367A5039930829 @default.
- W4361004367 hasAuthorship W4361004367A5076053136 @default.
- W4361004367 hasBestOaLocation W43610043671 @default.
- W4361004367 hasConcept C116834253 @default.
- W4361004367 hasConcept C119857082 @default.
- W4361004367 hasConcept C124101348 @default.
- W4361004367 hasConcept C137293760 @default.
- W4361004367 hasConcept C154945302 @default.
- W4361004367 hasConcept C176982825 @default.
- W4361004367 hasConcept C199360897 @default.
- W4361004367 hasConcept C38652104 @default.
- W4361004367 hasConcept C41008148 @default.
- W4361004367 hasConcept C43126263 @default.
- W4361004367 hasConcept C541664917 @default.
- W4361004367 hasConcept C59822182 @default.
- W4361004367 hasConcept C84525096 @default.
- W4361004367 hasConcept C86803240 @default.
- W4361004367 hasConcept C95623464 @default.
- W4361004367 hasConceptScore W4361004367C116834253 @default.
- W4361004367 hasConceptScore W4361004367C119857082 @default.
- W4361004367 hasConceptScore W4361004367C124101348 @default.
- W4361004367 hasConceptScore W4361004367C137293760 @default.
- W4361004367 hasConceptScore W4361004367C154945302 @default.
- W4361004367 hasConceptScore W4361004367C176982825 @default.
- W4361004367 hasConceptScore W4361004367C199360897 @default.
- W4361004367 hasConceptScore W4361004367C38652104 @default.
- W4361004367 hasConceptScore W4361004367C41008148 @default.
- W4361004367 hasConceptScore W4361004367C43126263 @default.
- W4361004367 hasConceptScore W4361004367C541664917 @default.
- W4361004367 hasConceptScore W4361004367C59822182 @default.
- W4361004367 hasConceptScore W4361004367C84525096 @default.
- W4361004367 hasConceptScore W4361004367C86803240 @default.
- W4361004367 hasConceptScore W4361004367C95623464 @default.
- W4361004367 hasFunder F4320334593 @default.
- W4361004367 hasIssue "2" @default.
- W4361004367 hasLocation W43610043671 @default.
- W4361004367 hasOpenAccess W4361004367 @default.
- W4361004367 hasPrimaryLocation W43610043671 @default.
- W4361004367 hasRelatedWork W1994531415 @default.
- W4361004367 hasRelatedWork W2136255210 @default.