Matches in SemOpenAlex for { <https://semopenalex.org/work/W4285325872> ?p ?o ?g. }
Showing items 1 to 79 of
79
with 100 items per page.
- W4285325872 abstract "The increasing use of the Internet in everyday life and the huge number of users leads to increasing the number of malicious websites, which aim is to damage a computer system or compromise data without the owner’s consent. In this paper we propose a Natural Language Processing (NLP) model, called Malware Website Detector (MalDeWe), for malware web page identification that is trained on a domain-specific corpus. The major goal of this model is to transfer English word knowledge from the pre-trained model RoBERTa into a dataset of JavaScript codes that included the website’s text context as well as certain JavaScript expressions. With this model, we obtain a Roc Auc score of 0.95. Therefore, we can conclude that our model is doing admirably in terms of identifying a malicious web page. On the other hand, we may infer that one of the most essential aspects to consider while training a classification model is dataset balance. Whereas the model trained on the initial unbalanced dataset failed to detect harmful websites, the model trained on the balanced dataset, correctly identified 95% of dangerous websites. In that sequence, we may deduce that the metrics used for the model evaluation are critical. Therefore, we recommend using the Roc Auc score, Recall, Precision, and Confusion matrix as evaluation metrics. So, in this paper we propose a new NLP model, and we discuss the reasons why the choice of evaluation metrics are important and how the dataset balance makes changes on the model efficacy." @default.
- W4285325872 created "2022-07-14" @default.
- W4285325872 creator A5009526053 @default.
- W4285325872 creator A5046161546 @default.
- W4285325872 creator A5051889893 @default.
- W4285325872 date "2021-12-01" @default.
- W4285325872 modified "2023-10-16" @default.
- W4285325872 title "MalDeWe: New Malware Website Detector Model based on Natural Language Processing using Balanced Dataset" @default.
- W4285325872 cites W1538001410 @default.
- W4285325872 cites W179875071 @default.
- W4285325872 cites W2100460448 @default.
- W4285325872 cites W2146729596 @default.
- W4285325872 cites W2160289821 @default.
- W4285325872 cites W2402268235 @default.
- W4285325872 cites W2553660799 @default.
- W4285325872 cites W2786906486 @default.
- W4285325872 cites W2789876780 @default.
- W4285325872 cites W2793336985 @default.
- W4285325872 cites W2909737018 @default.
- W4285325872 cites W2913334908 @default.
- W4285325872 doi "https://doi.org/10.1109/csci54926.2021.00043" @default.
- W4285325872 hasPublicationYear "2021" @default.
- W4285325872 type Work @default.
- W4285325872 citedByCount "2" @default.
- W4285325872 countsByYear W42853258722023 @default.
- W4285325872 crossrefType "proceedings-article" @default.
- W4285325872 hasAuthorship W4285325872A5009526053 @default.
- W4285325872 hasAuthorship W4285325872A5046161546 @default.
- W4285325872 hasAuthorship W4285325872A5051889893 @default.
- W4285325872 hasConcept C110875604 @default.
- W4285325872 hasConcept C116834253 @default.
- W4285325872 hasConcept C119857082 @default.
- W4285325872 hasConcept C136764020 @default.
- W4285325872 hasConcept C137293760 @default.
- W4285325872 hasConcept C148524875 @default.
- W4285325872 hasConcept C151730666 @default.
- W4285325872 hasConcept C154945302 @default.
- W4285325872 hasConcept C204321447 @default.
- W4285325872 hasConcept C23123220 @default.
- W4285325872 hasConcept C2779343474 @default.
- W4285325872 hasConcept C38652104 @default.
- W4285325872 hasConcept C41008148 @default.
- W4285325872 hasConcept C541664917 @default.
- W4285325872 hasConcept C544833334 @default.
- W4285325872 hasConcept C59822182 @default.
- W4285325872 hasConcept C86803240 @default.
- W4285325872 hasConceptScore W4285325872C110875604 @default.
- W4285325872 hasConceptScore W4285325872C116834253 @default.
- W4285325872 hasConceptScore W4285325872C119857082 @default.
- W4285325872 hasConceptScore W4285325872C136764020 @default.
- W4285325872 hasConceptScore W4285325872C137293760 @default.
- W4285325872 hasConceptScore W4285325872C148524875 @default.
- W4285325872 hasConceptScore W4285325872C151730666 @default.
- W4285325872 hasConceptScore W4285325872C154945302 @default.
- W4285325872 hasConceptScore W4285325872C204321447 @default.
- W4285325872 hasConceptScore W4285325872C23123220 @default.
- W4285325872 hasConceptScore W4285325872C2779343474 @default.
- W4285325872 hasConceptScore W4285325872C38652104 @default.
- W4285325872 hasConceptScore W4285325872C41008148 @default.
- W4285325872 hasConceptScore W4285325872C541664917 @default.
- W4285325872 hasConceptScore W4285325872C544833334 @default.
- W4285325872 hasConceptScore W4285325872C59822182 @default.
- W4285325872 hasConceptScore W4285325872C86803240 @default.
- W4285325872 hasLocation W42853258721 @default.
- W4285325872 hasOpenAccess W4285325872 @default.
- W4285325872 hasPrimaryLocation W42853258721 @default.
- W4285325872 hasRelatedWork W2359001871 @default.
- W4285325872 hasRelatedWork W2620763085 @default.
- W4285325872 hasRelatedWork W2947903144 @default.
- W4285325872 hasRelatedWork W2968586400 @default.
- W4285325872 hasRelatedWork W3005154454 @default.
- W4285325872 hasRelatedWork W3102852402 @default.
- W4285325872 hasRelatedWork W3194539120 @default.
- W4285325872 hasRelatedWork W4313549251 @default.
- W4285325872 hasRelatedWork W4381956280 @default.
- W4285325872 hasRelatedWork W4386214304 @default.
- W4285325872 isParatext "false" @default.
- W4285325872 isRetracted "false" @default.
- W4285325872 workType "article" @default.