Matches in SemOpenAlex for { <https://semopenalex.org/work/W2965811749> ?p ?o ?g. }
- W2965811749 abstract "Outlier detection is a fundamental task in data mining and has many applications including detecting errors in databases. While there has been extensive prior work on methods for outlier detection, modern datasets often have sizes that are beyond the ability of commonly used methods to process the data within a reasonable time. To overcome this issue, outlier detection methods can be trained over samples of the full-sized dataset. However, it is not clear how a model trained on a sample compares with one trained on the entire dataset. In this paper, we introduce the notion of resilience to sampling for outlier detection methods. Orthogonal to traditional performance metrics such as precision/recall, resilience represents the extent to which the outliers detected by a method applied to samples from a sampling scheme matches those when applied to the whole dataset. We propose a novel approach for estimating the resilience to sampling of both individual outlier methods and their ensembles. We performed an extensive experimental study on synthetic and real-world datasets where we study seven diverse and representative outlier detection methods, compare results obtained from samples versus those obtained from the whole datasets and evaluate the accuracy of our resilience estimates. We observed that the methods are not equally resilient to a given sampling scheme and it is often the case that careful joint selection of both the sampling scheme and the outlier detection method is necessary. It is our hope that the paper initiates research on designing outlier detection algorithms that are resilient to sampling." @default.
- W2965811749 created "2019-08-13" @default.
- W2965811749 creator A5001576272 @default.
- W2965811749 creator A5066004392 @default.
- W2965811749 creator A5091872345 @default.
- W2965811749 date "2019-07-30" @default.
- W2965811749 modified "2023-09-23" @default.
- W2965811749 title "Are outlier detection methods resilient to sampling" @default.
- W2965811749 cites W1977556410 @default.
- W2965811749 cites W1990106316 @default.
- W2965811749 cites W2056081083 @default.
- W2965811749 cites W2056972503 @default.
- W2965811749 cites W2061240327 @default.
- W2965811749 cites W2097714558 @default.
- W2965811749 cites W2101549186 @default.
- W2965811749 cites W2110784166 @default.
- W2965811749 cites W2113060550 @default.
- W2965811749 cites W2122646361 @default.
- W2965811749 cites W2124536999 @default.
- W2965811749 cites W2129281431 @default.
- W2965811749 cites W2134518716 @default.
- W2965811749 cites W2144182447 @default.
- W2965811749 cites W2160200253 @default.
- W2965811749 cites W2170651405 @default.
- W2965811749 cites W2282861635 @default.
- W2965811749 cites W2298388144 @default.
- W2965811749 cites W2319794630 @default.
- W2965811749 cites W2338990760 @default.
- W2965811749 cites W2784359816 @default.
- W2965811749 cites W9014458 @default.
- W2965811749 hasPublicationYear "2019" @default.
- W2965811749 type Work @default.
- W2965811749 sameAs 2965811749 @default.
- W2965811749 citedByCount "0" @default.
- W2965811749 crossrefType "posted-content" @default.
- W2965811749 hasAuthorship W2965811749A5001576272 @default.
- W2965811749 hasAuthorship W2965811749A5066004392 @default.
- W2965811749 hasAuthorship W2965811749A5091872345 @default.
- W2965811749 hasConcept C119857082 @default.
- W2965811749 hasConcept C121332964 @default.
- W2965811749 hasConcept C124101348 @default.
- W2965811749 hasConcept C127413603 @default.
- W2965811749 hasConcept C134306372 @default.
- W2965811749 hasConcept C140779682 @default.
- W2965811749 hasConcept C153180895 @default.
- W2965811749 hasConcept C154945302 @default.
- W2965811749 hasConcept C185592680 @default.
- W2965811749 hasConcept C198531522 @default.
- W2965811749 hasConcept C201995342 @default.
- W2965811749 hasConcept C2779585090 @default.
- W2965811749 hasConcept C2780451532 @default.
- W2965811749 hasConcept C33923547 @default.
- W2965811749 hasConcept C41008148 @default.
- W2965811749 hasConcept C43617362 @default.
- W2965811749 hasConcept C739882 @default.
- W2965811749 hasConcept C76155785 @default.
- W2965811749 hasConcept C77618280 @default.
- W2965811749 hasConcept C79337645 @default.
- W2965811749 hasConcept C81917197 @default.
- W2965811749 hasConcept C94915269 @default.
- W2965811749 hasConcept C97355855 @default.
- W2965811749 hasConceptScore W2965811749C119857082 @default.
- W2965811749 hasConceptScore W2965811749C121332964 @default.
- W2965811749 hasConceptScore W2965811749C124101348 @default.
- W2965811749 hasConceptScore W2965811749C127413603 @default.
- W2965811749 hasConceptScore W2965811749C134306372 @default.
- W2965811749 hasConceptScore W2965811749C140779682 @default.
- W2965811749 hasConceptScore W2965811749C153180895 @default.
- W2965811749 hasConceptScore W2965811749C154945302 @default.
- W2965811749 hasConceptScore W2965811749C185592680 @default.
- W2965811749 hasConceptScore W2965811749C198531522 @default.
- W2965811749 hasConceptScore W2965811749C201995342 @default.
- W2965811749 hasConceptScore W2965811749C2779585090 @default.
- W2965811749 hasConceptScore W2965811749C2780451532 @default.
- W2965811749 hasConceptScore W2965811749C33923547 @default.
- W2965811749 hasConceptScore W2965811749C41008148 @default.
- W2965811749 hasConceptScore W2965811749C43617362 @default.
- W2965811749 hasConceptScore W2965811749C739882 @default.
- W2965811749 hasConceptScore W2965811749C76155785 @default.
- W2965811749 hasConceptScore W2965811749C77618280 @default.
- W2965811749 hasConceptScore W2965811749C79337645 @default.
- W2965811749 hasConceptScore W2965811749C81917197 @default.
- W2965811749 hasConceptScore W2965811749C94915269 @default.
- W2965811749 hasConceptScore W2965811749C97355855 @default.
- W2965811749 hasOpenAccess W2965811749 @default.
- W2965811749 hasRelatedWork W1512833232 @default.
- W2965811749 hasRelatedWork W1556666131 @default.
- W2965811749 hasRelatedWork W1990106316 @default.
- W2965811749 hasRelatedWork W2056081083 @default.
- W2965811749 hasRelatedWork W2109634084 @default.
- W2965811749 hasRelatedWork W2167910929 @default.
- W2965811749 hasRelatedWork W2282861635 @default.
- W2965811749 hasRelatedWork W2361902984 @default.
- W2965811749 hasRelatedWork W2510696606 @default.
- W2965811749 hasRelatedWork W2913474760 @default.
- W2965811749 hasRelatedWork W2936342010 @default.
- W2965811749 hasRelatedWork W2986348730 @default.
- W2965811749 hasRelatedWork W3039137796 @default.
- W2965811749 hasRelatedWork W3080145627 @default.
- W2965811749 hasRelatedWork W3117098906 @default.