Matches in SemOpenAlex for { <https://semopenalex.org/work/W2145503758> ?p ?o ?g. }
Showing items 1 to 92 of
92
with 100 items per page.
- W2145503758 abstract "Normal 0 MicrosoftInternetExplorer4 Data mining algorithms generally assume that data will be clean and consistent. However, in practice, this is not always the case, and for this reason the detection and elimination of duplicate records is an important part of data cleaning. The presence of similar-duplicate records causes over-representation of data. If the database contains different representations of the same data, the results obtained from the data mining algorithm will be erroneous. The detection of similar-duplicate records is a difficult task, especially when the records are domain-independent. In this paper, we propose a novel domain-independent technique for better reconciling the similar-duplicate records. We also introduce new ideas for making similar-duplicate detection algorithms faster and more efficient. In addition, a significant modification of the transitivity rule is also proposed. Finally, we propose an algorithm that incorporates all these techniques for similar-duplicate detection into a domain-independent environment. The performance of the proposed method has been compared to other methods and the superiority of the proposed method has been confirmed by the experimental results." @default.
- W2145503758 created "2016-06-24" @default.
- W2145503758 creator A5014708079 @default.
- W2145503758 creator A5023057399 @default.
- W2145503758 creator A5048946105 @default.
- W2145503758 date "2010-12-01" @default.
- W2145503758 modified "2023-09-23" @default.
- W2145503758 title "A Domain-Independent Data Cleaning Algorithm for Detecting Similar-Duplicates" @default.
- W2145503758 cites W1484228408 @default.
- W2145503758 cites W1513543741 @default.
- W2145503758 cites W1559390933 @default.
- W2145503758 cites W1569123402 @default.
- W2145503758 cites W1612155886 @default.
- W2145503758 cites W1647671624 @default.
- W2145503758 cites W2010392031 @default.
- W2145503758 cites W2024770506 @default.
- W2145503758 cites W2050071106 @default.
- W2145503758 cites W2065290081 @default.
- W2145503758 cites W2087064593 @default.
- W2145503758 cites W2101939932 @default.
- W2145503758 cites W2107976925 @default.
- W2145503758 cites W2108991785 @default.
- W2145503758 cites W2111192396 @default.
- W2145503758 cites W2131576956 @default.
- W2145503758 cites W2134826720 @default.
- W2145503758 cites W2259773661 @default.
- W2145503758 cites W25706487 @default.
- W2145503758 doi "https://doi.org/10.4304/jcp.5.12.1800-1809" @default.
- W2145503758 hasPublicationYear "2010" @default.
- W2145503758 type Work @default.
- W2145503758 sameAs 2145503758 @default.
- W2145503758 citedByCount "3" @default.
- W2145503758 countsByYear W21455037582013 @default.
- W2145503758 countsByYear W21455037582015 @default.
- W2145503758 countsByYear W21455037582020 @default.
- W2145503758 crossrefType "journal-article" @default.
- W2145503758 hasAuthorship W2145503758A5014708079 @default.
- W2145503758 hasAuthorship W2145503758A5023057399 @default.
- W2145503758 hasAuthorship W2145503758A5048946105 @default.
- W2145503758 hasConcept C11413529 @default.
- W2145503758 hasConcept C124101348 @default.
- W2145503758 hasConcept C134306372 @default.
- W2145503758 hasConcept C162324750 @default.
- W2145503758 hasConcept C17744445 @default.
- W2145503758 hasConcept C187736073 @default.
- W2145503758 hasConcept C199539241 @default.
- W2145503758 hasConcept C2776359362 @default.
- W2145503758 hasConcept C2780451532 @default.
- W2145503758 hasConcept C33923547 @default.
- W2145503758 hasConcept C36503486 @default.
- W2145503758 hasConcept C41008148 @default.
- W2145503758 hasConcept C94625758 @default.
- W2145503758 hasConceptScore W2145503758C11413529 @default.
- W2145503758 hasConceptScore W2145503758C124101348 @default.
- W2145503758 hasConceptScore W2145503758C134306372 @default.
- W2145503758 hasConceptScore W2145503758C162324750 @default.
- W2145503758 hasConceptScore W2145503758C17744445 @default.
- W2145503758 hasConceptScore W2145503758C187736073 @default.
- W2145503758 hasConceptScore W2145503758C199539241 @default.
- W2145503758 hasConceptScore W2145503758C2776359362 @default.
- W2145503758 hasConceptScore W2145503758C2780451532 @default.
- W2145503758 hasConceptScore W2145503758C33923547 @default.
- W2145503758 hasConceptScore W2145503758C36503486 @default.
- W2145503758 hasConceptScore W2145503758C41008148 @default.
- W2145503758 hasConceptScore W2145503758C94625758 @default.
- W2145503758 hasLocation W21455037581 @default.
- W2145503758 hasOpenAccess W2145503758 @default.
- W2145503758 hasPrimaryLocation W21455037581 @default.
- W2145503758 hasRelatedWork W1495001529 @default.
- W2145503758 hasRelatedWork W1524850974 @default.
- W2145503758 hasRelatedWork W1556344493 @default.
- W2145503758 hasRelatedWork W1859663609 @default.
- W2145503758 hasRelatedWork W2004025009 @default.
- W2145503758 hasRelatedWork W2026770594 @default.
- W2145503758 hasRelatedWork W2055405704 @default.
- W2145503758 hasRelatedWork W205673538 @default.
- W2145503758 hasRelatedWork W2106234354 @default.
- W2145503758 hasRelatedWork W2126131234 @default.
- W2145503758 hasRelatedWork W2130735255 @default.
- W2145503758 hasRelatedWork W2148481813 @default.
- W2145503758 hasRelatedWork W2265934547 @default.
- W2145503758 hasRelatedWork W2501900035 @default.
- W2145503758 hasRelatedWork W2674513843 @default.
- W2145503758 hasRelatedWork W2753804356 @default.
- W2145503758 hasRelatedWork W2794107983 @default.
- W2145503758 hasRelatedWork W2894986625 @default.
- W2145503758 hasRelatedWork W3176117280 @default.
- W2145503758 hasRelatedWork W1566649662 @default.
- W2145503758 isParatext "false" @default.
- W2145503758 isRetracted "false" @default.
- W2145503758 magId "2145503758" @default.
- W2145503758 workType "article" @default.