Matches in SemOpenAlex for { <https://semopenalex.org/work/W2104290684> ?p ?o ?g. }
- W2104290684 endingPage "2698" @default.
- W2104290684 startingPage "2673" @default.
- W2104290684 abstract "Spam filtering poses a special problem in text categorization, of which the defining characteristic is that filters face an active adversary, which constantly attempts to evade filtering. Since spam evolves continuously and most practical applications are based on online user feedback, the task calls for fast, incremental and robust learning algorithms. In this paper, we investigate a novel approach to spam filtering based on adaptive statistical data compression models. The nature of these models allows them to be employed as probabilistic text classifiers based on character-level or binary sequences. By modeling messages as sequences, tokenization and other error-prone preprocessing steps are omitted altogether, resulting in a method that is very robust. The models are also fast to construct and incrementally updateable. We evaluate the filtering performance of two different compression algorithms; dynamic Markov compression and prediction by partial matching. The results of our empirical evaluation indicate that compression models outperform currently established spam filters, as well as a number of methods proposed in previous studies." @default.
- W2104290684 created "2016-06-24" @default.
- W2104290684 creator A5016388891 @default.
- W2104290684 creator A5016776505 @default.
- W2104290684 creator A5053155811 @default.
- W2104290684 creator A5073792028 @default.
- W2104290684 creator A5084091807 @default.
- W2104290684 date "2006-12-01" @default.
- W2104290684 modified "2023-10-18" @default.
- W2104290684 title "Spam Filtering Using Statistical Data Compression Models" @default.
- W2104290684 cites W1490796714 @default.
- W2104290684 cites W1504008138 @default.
- W2104290684 cites W1522057547 @default.
- W2104290684 cites W1550206324 @default.
- W2104290684 cites W1571468614 @default.
- W2104290684 cites W1574862351 @default.
- W2104290684 cites W1576189261 @default.
- W2104290684 cites W1585280831 @default.
- W2104290684 cites W1594013288 @default.
- W2104290684 cites W1605925311 @default.
- W2104290684 cites W164961562 @default.
- W2104290684 cites W1904228841 @default.
- W2104290684 cites W2033672007 @default.
- W2104290684 cites W2042961901 @default.
- W2104290684 cites W2054658115 @default.
- W2104290684 cites W2068782468 @default.
- W2104290684 cites W2072994259 @default.
- W2104290684 cites W2089319476 @default.
- W2104290684 cites W2089923329 @default.
- W2104290684 cites W2099606292 @default.
- W2104290684 cites W2101694047 @default.
- W2104290684 cites W2102098892 @default.
- W2104290684 cites W2107219285 @default.
- W2104290684 cites W2108313281 @default.
- W2104290684 cites W2116091861 @default.
- W2104290684 cites W2120011452 @default.
- W2104290684 cites W2132119275 @default.
- W2104290684 cites W2137012911 @default.
- W2104290684 cites W2139440668 @default.
- W2104290684 cites W2144794905 @default.
- W2104290684 cites W2148381206 @default.
- W2104290684 cites W2149741699 @default.
- W2104290684 cites W2151752770 @default.
- W2104290684 cites W2161628678 @default.
- W2104290684 cites W2163294786 @default.
- W2104290684 cites W2165340347 @default.
- W2104290684 cites W2166064672 @default.
- W2104290684 cites W2171622762 @default.
- W2104290684 cites W2171886309 @default.
- W2104290684 cites W2591589908 @default.
- W2104290684 cites W2916045930 @default.
- W2104290684 cites W3102372265 @default.
- W2104290684 cites W39304953 @default.
- W2104290684 cites W40510259 @default.
- W2104290684 cites W71139061 @default.
- W2104290684 cites W87844232 @default.
- W2104290684 cites W2141828330 @default.
- W2104290684 doi "https://doi.org/10.5555/1248547.1248644" @default.
- W2104290684 hasPublicationYear "2006" @default.
- W2104290684 type Work @default.
- W2104290684 sameAs 2104290684 @default.
- W2104290684 citedByCount "118" @default.
- W2104290684 countsByYear W21042906842012 @default.
- W2104290684 countsByYear W21042906842013 @default.
- W2104290684 countsByYear W21042906842014 @default.
- W2104290684 countsByYear W21042906842015 @default.
- W2104290684 countsByYear W21042906842016 @default.
- W2104290684 countsByYear W21042906842017 @default.
- W2104290684 countsByYear W21042906842018 @default.
- W2104290684 countsByYear W21042906842019 @default.
- W2104290684 countsByYear W21042906842020 @default.
- W2104290684 countsByYear W21042906842021 @default.
- W2104290684 countsByYear W21042906842022 @default.
- W2104290684 countsByYear W21042906842023 @default.
- W2104290684 crossrefType "journal-article" @default.
- W2104290684 hasAuthorship W2104290684A5016388891 @default.
- W2104290684 hasAuthorship W2104290684A5016776505 @default.
- W2104290684 hasAuthorship W2104290684A5053155811 @default.
- W2104290684 hasAuthorship W2104290684A5073792028 @default.
- W2104290684 hasAuthorship W2104290684A5084091807 @default.
- W2104290684 hasConcept C119857082 @default.
- W2104290684 hasConcept C124101348 @default.
- W2104290684 hasConcept C154945302 @default.
- W2104290684 hasConcept C176982825 @default.
- W2104290684 hasConcept C34736171 @default.
- W2104290684 hasConcept C41008148 @default.
- W2104290684 hasConcept C49937458 @default.
- W2104290684 hasConcept C78548338 @default.
- W2104290684 hasConcept C81081738 @default.
- W2104290684 hasConceptScore W2104290684C119857082 @default.
- W2104290684 hasConceptScore W2104290684C124101348 @default.
- W2104290684 hasConceptScore W2104290684C154945302 @default.
- W2104290684 hasConceptScore W2104290684C176982825 @default.
- W2104290684 hasConceptScore W2104290684C34736171 @default.
- W2104290684 hasConceptScore W2104290684C41008148 @default.
- W2104290684 hasConceptScore W2104290684C49937458 @default.
- W2104290684 hasConceptScore W2104290684C78548338 @default.
- W2104290684 hasConceptScore W2104290684C81081738 @default.