Matches in SemOpenAlex for { <https://semopenalex.org/work/W50348840> ?p ?o ?g. }
Showing items 1 to 84 of
84
with 100 items per page.
- W50348840 endingPage "98" @default.
- W50348840 startingPage "84" @default.
- W50348840 abstract "This paper describes a methodology for the application of hierarchical clustering methods to the task of outlier detection. The methodology is tested on the problem of cleaning Official Statistics data. The goal is to detect erroneous foreign trade transactions in data collected by the Portuguese Institute of Statistics (INE). These transactions are a minority, but still they have an important impact on the statistics produced by the institute. The detectiong of these rare errors is a manual, time-consuming task. This type of tasks is usually constrained by a limited amount of available resources. Our proposal addresses this issue by producing a ranking of outlyingness that allows a better management of the available resources by allocating them to the cases which are most different from the other and, thus, have a higher probability of being errors. Our method is based on the output of standard agglomerative hierarchical clustering algorithms, resulting in no significant additional computational costs. Our results show that it enables large savings by selecting a small subset of suspicious transactions for manual inspection, which, nevertheless, includes most of the erroneous transactions. In this study we compare our proposal to a state of the art outlier ranking method (LOF) and show that our method achieves better results on this particular application. The results of our experiments are also competitive with previous results on the same data. Finally, the outcome of our experiments raises important questions concerning the method currently followed at INE concerning items with small number of transactions." @default.
- W50348840 created "2016-06-24" @default.
- W50348840 creator A5058183984 @default.
- W50348840 creator A5080727549 @default.
- W50348840 date "2010-08-07" @default.
- W50348840 modified "2023-09-23" @default.
- W50348840 title "Resource-bounded Outlier Detection using Clustering Methods" @default.
- W50348840 cites W1498047526 @default.
- W50348840 cites W1552339598 @default.
- W50348840 cites W1574715085 @default.
- W50348840 cites W1575476631 @default.
- W50348840 cites W1594487429 @default.
- W50348840 cites W1970655212 @default.
- W50348840 cites W1997648776 @default.
- W50348840 cites W2049058890 @default.
- W50348840 cites W2133160781 @default.
- W50348840 cites W2137130182 @default.
- W50348840 cites W2144182447 @default.
- W50348840 cites W2331052961 @default.
- W50348840 cites W2995395573 @default.
- W50348840 cites W2999729612 @default.
- W50348840 cites W3041834803 @default.
- W50348840 hasPublicationYear "2010" @default.
- W50348840 type Work @default.
- W50348840 sameAs 50348840 @default.
- W50348840 citedByCount "5" @default.
- W50348840 countsByYear W503488402013 @default.
- W50348840 countsByYear W503488402015 @default.
- W50348840 countsByYear W503488402017 @default.
- W50348840 countsByYear W503488402018 @default.
- W50348840 crossrefType "journal-article" @default.
- W50348840 hasAuthorship W50348840A5058183984 @default.
- W50348840 hasAuthorship W50348840A5080727549 @default.
- W50348840 hasConcept C119857082 @default.
- W50348840 hasConcept C124101348 @default.
- W50348840 hasConcept C127413603 @default.
- W50348840 hasConcept C154945302 @default.
- W50348840 hasConcept C189430467 @default.
- W50348840 hasConcept C201995342 @default.
- W50348840 hasConcept C2780451532 @default.
- W50348840 hasConcept C41008148 @default.
- W50348840 hasConcept C73555534 @default.
- W50348840 hasConcept C739882 @default.
- W50348840 hasConcept C79337645 @default.
- W50348840 hasConcept C92835128 @default.
- W50348840 hasConceptScore W50348840C119857082 @default.
- W50348840 hasConceptScore W50348840C124101348 @default.
- W50348840 hasConceptScore W50348840C127413603 @default.
- W50348840 hasConceptScore W50348840C154945302 @default.
- W50348840 hasConceptScore W50348840C189430467 @default.
- W50348840 hasConceptScore W50348840C201995342 @default.
- W50348840 hasConceptScore W50348840C2780451532 @default.
- W50348840 hasConceptScore W50348840C41008148 @default.
- W50348840 hasConceptScore W50348840C73555534 @default.
- W50348840 hasConceptScore W50348840C739882 @default.
- W50348840 hasConceptScore W50348840C79337645 @default.
- W50348840 hasConceptScore W50348840C92835128 @default.
- W50348840 hasOpenAccess W50348840 @default.
- W50348840 hasRelatedWork W1975369827 @default.
- W50348840 hasRelatedWork W2002298107 @default.
- W50348840 hasRelatedWork W2037758579 @default.
- W50348840 hasRelatedWork W2074205536 @default.
- W50348840 hasRelatedWork W2111949697 @default.
- W50348840 hasRelatedWork W2140217547 @default.
- W50348840 hasRelatedWork W2777205967 @default.
- W50348840 hasRelatedWork W2791401766 @default.
- W50348840 hasRelatedWork W2811882588 @default.
- W50348840 hasRelatedWork W2875652255 @default.
- W50348840 hasRelatedWork W2900023892 @default.
- W50348840 hasRelatedWork W2907435612 @default.
- W50348840 hasRelatedWork W2912881731 @default.
- W50348840 hasRelatedWork W2952551828 @default.
- W50348840 hasRelatedWork W2962942694 @default.
- W50348840 hasRelatedWork W3049447432 @default.
- W50348840 hasRelatedWork W3088258822 @default.
- W50348840 hasRelatedWork W3103821869 @default.
- W50348840 hasRelatedWork W3107366521 @default.
- W50348840 hasRelatedWork W8455847 @default.
- W50348840 isParatext "false" @default.
- W50348840 isRetracted "false" @default.
- W50348840 magId "50348840" @default.
- W50348840 workType "article" @default.