Matches in SemOpenAlex for { <https://semopenalex.org/work/W2913694443> ?p ?o ?g. }
- W2913694443 abstract "The problem of imbalanced classes arises frequently in binary classification tasks. If one class outnumbers another, trained classifiers become heavily biased towards the majority class. For phishing URL detection, it is very natural that the number of collected benign URLs (i.e., the majority class) is much larger than the number of collected phishy URLs (i.e., the minority class). Oversampling the minority class can be a powerful tool to overcome this situation. However, existing methods perform the oversampling task in the feature space where the original data format is removed and URLs are succinctly represented by vectors. These methods are successful only if feature definitions are correct and the dataset is diverse and not too sparse. In this paper, we propose an oversampling technique in the data space. We train text generative adversarial networks (text-GANs) with URLs in the minority class and generate synthetic URLs that can be made part of the training set. We crawl a crowd-sourced URL repository to collect recently discovered phishy and benign URLs. Our experiments demonstrate significant performance improvements after using the proposed oversampling technique. Interestingly, some of the original test URLs are exactly regenerated by the proposed text generative model." @default.
- W2913694443 created "2019-02-21" @default.
- W2913694443 creator A5000244424 @default.
- W2913694443 creator A5008133675 @default.
- W2913694443 creator A5029872749 @default.
- W2913694443 creator A5046521217 @default.
- W2913694443 creator A5067253588 @default.
- W2913694443 creator A5087419212 @default.
- W2913694443 date "2018-12-01" @default.
- W2913694443 modified "2023-10-14" @default.
- W2913694443 title "Phishing URL Detection with Oversampling based on Text Generative Adversarial Networks" @default.
- W2913694443 cites W1582036582 @default.
- W2913694443 cites W1983305208 @default.
- W2913694443 cites W1987971958 @default.
- W2913694443 cites W1989957782 @default.
- W2913694443 cites W2012481173 @default.
- W2913694443 cites W2029470356 @default.
- W2913694443 cites W2064675550 @default.
- W2913694443 cites W2121990650 @default.
- W2913694443 cites W2146729596 @default.
- W2913694443 cites W2147203050 @default.
- W2913694443 cites W2156838815 @default.
- W2913694443 cites W2168508521 @default.
- W2913694443 cites W2625935159 @default.
- W2913694443 doi "https://doi.org/10.1109/bigdata.2018.8622547" @default.
- W2913694443 hasPublicationYear "2018" @default.
- W2913694443 type Work @default.
- W2913694443 sameAs 2913694443 @default.
- W2913694443 citedByCount "35" @default.
- W2913694443 countsByYear W29136944432017 @default.
- W2913694443 countsByYear W29136944432019 @default.
- W2913694443 countsByYear W29136944432020 @default.
- W2913694443 countsByYear W29136944432021 @default.
- W2913694443 countsByYear W29136944432022 @default.
- W2913694443 countsByYear W29136944432023 @default.
- W2913694443 crossrefType "proceedings-article" @default.
- W2913694443 hasAuthorship W2913694443A5000244424 @default.
- W2913694443 hasAuthorship W2913694443A5008133675 @default.
- W2913694443 hasAuthorship W2913694443A5029872749 @default.
- W2913694443 hasAuthorship W2913694443A5046521217 @default.
- W2913694443 hasAuthorship W2913694443A5067253588 @default.
- W2913694443 hasAuthorship W2913694443A5087419212 @default.
- W2913694443 hasConcept C110875604 @default.
- W2913694443 hasConcept C119857082 @default.
- W2913694443 hasConcept C124101348 @default.
- W2913694443 hasConcept C127413603 @default.
- W2913694443 hasConcept C136764020 @default.
- W2913694443 hasConcept C138885662 @default.
- W2913694443 hasConcept C154945302 @default.
- W2913694443 hasConcept C177264268 @default.
- W2913694443 hasConcept C197323446 @default.
- W2913694443 hasConcept C199360897 @default.
- W2913694443 hasConcept C201995342 @default.
- W2913694443 hasConcept C23123220 @default.
- W2913694443 hasConcept C2776257435 @default.
- W2913694443 hasConcept C2776401178 @default.
- W2913694443 hasConcept C2777212361 @default.
- W2913694443 hasConcept C2780451532 @default.
- W2913694443 hasConcept C31258907 @default.
- W2913694443 hasConcept C37736160 @default.
- W2913694443 hasConcept C41008148 @default.
- W2913694443 hasConcept C41895202 @default.
- W2913694443 hasConcept C83860907 @default.
- W2913694443 hasConceptScore W2913694443C110875604 @default.
- W2913694443 hasConceptScore W2913694443C119857082 @default.
- W2913694443 hasConceptScore W2913694443C124101348 @default.
- W2913694443 hasConceptScore W2913694443C127413603 @default.
- W2913694443 hasConceptScore W2913694443C136764020 @default.
- W2913694443 hasConceptScore W2913694443C138885662 @default.
- W2913694443 hasConceptScore W2913694443C154945302 @default.
- W2913694443 hasConceptScore W2913694443C177264268 @default.
- W2913694443 hasConceptScore W2913694443C197323446 @default.
- W2913694443 hasConceptScore W2913694443C199360897 @default.
- W2913694443 hasConceptScore W2913694443C201995342 @default.
- W2913694443 hasConceptScore W2913694443C23123220 @default.
- W2913694443 hasConceptScore W2913694443C2776257435 @default.
- W2913694443 hasConceptScore W2913694443C2776401178 @default.
- W2913694443 hasConceptScore W2913694443C2777212361 @default.
- W2913694443 hasConceptScore W2913694443C2780451532 @default.
- W2913694443 hasConceptScore W2913694443C31258907 @default.
- W2913694443 hasConceptScore W2913694443C37736160 @default.
- W2913694443 hasConceptScore W2913694443C41008148 @default.
- W2913694443 hasConceptScore W2913694443C41895202 @default.
- W2913694443 hasConceptScore W2913694443C83860907 @default.
- W2913694443 hasLocation W29136944431 @default.
- W2913694443 hasOpenAccess W2913694443 @default.
- W2913694443 hasPrimaryLocation W29136944431 @default.
- W2913694443 hasRelatedWork W2913694443 @default.
- W2913694443 hasRelatedWork W2963001579 @default.
- W2913694443 hasRelatedWork W2965556534 @default.
- W2913694443 hasRelatedWork W2969781305 @default.
- W2913694443 hasRelatedWork W2981515171 @default.
- W2913694443 hasRelatedWork W3017161950 @default.
- W2913694443 hasRelatedWork W3041381337 @default.
- W2913694443 hasRelatedWork W3156291593 @default.
- W2913694443 hasRelatedWork W3181034584 @default.
- W2913694443 hasRelatedWork W4220812973 @default.
- W2913694443 isParatext "false" @default.
- W2913694443 isRetracted "false" @default.
- W2913694443 magId "2913694443" @default.