Matches in SemOpenAlex for { <https://semopenalex.org/work/W2118966476> ?p ?o ?g. }
Showing items 1 to 95 of
95
with 100 items per page.
- W2118966476 endingPage "395" @default.
- W2118966476 startingPage "346" @default.
- W2118966476 abstract "To enable information integration, schema matching is a critical step for discovering semantic correspondences of attributes across heterogeneous sources. While complex matchings are common, because of their far more complex search space, most existing techniques focus on simple 1:1 matchings. To tackle this challenge, this article takes a conceptually novel approach by viewing schema matching as correlation mining , for our task of matching Web query interfaces to integrate the myriad databases on the Internet. On this “deep Web ” query interfaces generally form complex matchings between attribute groups (e.g., {author} corresponds to {first name, last name} in the Books domain). We observe that the co-occurrences patterns across query interfaces often reveal such complex semantic relationships: grouping attributes (e.g., {first name, last name}) tend to be co-present in query interfaces and thus positively correlated. In contrast, synonym attributes are negatively correlated because they rarely co-occur. This insight enables us to discover complex matchings by a correlation mining approach. In particular, we develop the DCM framework, which consists of data preprocessing , dual mining of positive and negative correlations, and finally matching construction . We evaluate the DCM framework on manually extracted interfaces and the results show good accuracy for discovering complex matchings. Further, to automate the entire matching process, we incorporate automatic techniques for interface extraction. Executing the DCM framework on automatically extracted interfaces, we find that the inevitable errors in automatic interface extraction may significantly affect the matching result. To make the DCM framework robust against such “noisy” schemas, we integrate it with a novel “ensemble” approach, which creates an ensemble of DCM matchers, by randomizing the schema data into many trials and aggregating their ranked results by taking majority voting. As a principled basis, we provide analytic justification of the robustness of the ensemble approach. Empirically, our experiments show that the “ensemblization” indeed significantly boosts the matching accuracy, over automatically extracted and thus noisy schema data. By employing the DCM framework with the ensemble approach, we thus complete an automatic process of matchings Web query interfaces." @default.
- W2118966476 created "2016-06-24" @default.
- W2118966476 creator A5014248531 @default.
- W2118966476 creator A5070408220 @default.
- W2118966476 date "2006-03-01" @default.
- W2118966476 modified "2023-09-27" @default.
- W2118966476 title "Automatic complex schema matching across Web query interfaces" @default.
- W2118966476 cites W1600537614 @default.
- W2118966476 cites W1969831559 @default.
- W2118966476 cites W2008896880 @default.
- W2118966476 cites W2027780984 @default.
- W2118966476 cites W2051834357 @default.
- W2118966476 cites W2053539645 @default.
- W2118966476 cites W2066277072 @default.
- W2118966476 cites W2089634871 @default.
- W2118966476 cites W2094930182 @default.
- W2118966476 cites W2100417212 @default.
- W2118966476 cites W2105423800 @default.
- W2118966476 cites W2108489852 @default.
- W2118966476 cites W2110686900 @default.
- W2118966476 cites W2114990184 @default.
- W2118966476 cites W2117058208 @default.
- W2118966476 cites W2123853152 @default.
- W2118966476 cites W2140897975 @default.
- W2118966476 cites W2142385580 @default.
- W2118966476 cites W2150365753 @default.
- W2118966476 cites W2163329495 @default.
- W2118966476 cites W2166559705 @default.
- W2118966476 cites W2210278139 @default.
- W2118966476 cites W2221553715 @default.
- W2118966476 cites W2912934387 @default.
- W2118966476 doi "https://doi.org/10.1145/1132863.1132872" @default.
- W2118966476 hasPublicationYear "2006" @default.
- W2118966476 type Work @default.
- W2118966476 sameAs 2118966476 @default.
- W2118966476 citedByCount "93" @default.
- W2118966476 countsByYear W21189664762012 @default.
- W2118966476 countsByYear W21189664762013 @default.
- W2118966476 countsByYear W21189664762014 @default.
- W2118966476 countsByYear W21189664762015 @default.
- W2118966476 countsByYear W21189664762016 @default.
- W2118966476 countsByYear W21189664762018 @default.
- W2118966476 countsByYear W21189664762019 @default.
- W2118966476 countsByYear W21189664762020 @default.
- W2118966476 countsByYear W21189664762021 @default.
- W2118966476 crossrefType "journal-article" @default.
- W2118966476 hasAuthorship W2118966476A5014248531 @default.
- W2118966476 hasAuthorship W2118966476A5070408220 @default.
- W2118966476 hasConcept C105795698 @default.
- W2118966476 hasConcept C124101348 @default.
- W2118966476 hasConcept C154945302 @default.
- W2118966476 hasConcept C164120249 @default.
- W2118966476 hasConcept C165064840 @default.
- W2118966476 hasConcept C23123220 @default.
- W2118966476 hasConcept C2777327318 @default.
- W2118966476 hasConcept C33923547 @default.
- W2118966476 hasConcept C34736171 @default.
- W2118966476 hasConcept C41008148 @default.
- W2118966476 hasConcept C52146309 @default.
- W2118966476 hasConcept C72634772 @default.
- W2118966476 hasConcept C97854310 @default.
- W2118966476 hasConceptScore W2118966476C105795698 @default.
- W2118966476 hasConceptScore W2118966476C124101348 @default.
- W2118966476 hasConceptScore W2118966476C154945302 @default.
- W2118966476 hasConceptScore W2118966476C164120249 @default.
- W2118966476 hasConceptScore W2118966476C165064840 @default.
- W2118966476 hasConceptScore W2118966476C23123220 @default.
- W2118966476 hasConceptScore W2118966476C2777327318 @default.
- W2118966476 hasConceptScore W2118966476C33923547 @default.
- W2118966476 hasConceptScore W2118966476C34736171 @default.
- W2118966476 hasConceptScore W2118966476C41008148 @default.
- W2118966476 hasConceptScore W2118966476C52146309 @default.
- W2118966476 hasConceptScore W2118966476C72634772 @default.
- W2118966476 hasConceptScore W2118966476C97854310 @default.
- W2118966476 hasIssue "1" @default.
- W2118966476 hasLocation W21189664761 @default.
- W2118966476 hasOpenAccess W2118966476 @default.
- W2118966476 hasPrimaryLocation W21189664761 @default.
- W2118966476 hasRelatedWork W1601704076 @default.
- W2118966476 hasRelatedWork W2036073399 @default.
- W2118966476 hasRelatedWork W2125859764 @default.
- W2118966476 hasRelatedWork W2352498822 @default.
- W2118966476 hasRelatedWork W2363027842 @default.
- W2118966476 hasRelatedWork W2371022392 @default.
- W2118966476 hasRelatedWork W2372910313 @default.
- W2118966476 hasRelatedWork W2375007105 @default.
- W2118966476 hasRelatedWork W3130973930 @default.
- W2118966476 hasRelatedWork W73343063 @default.
- W2118966476 hasVolume "31" @default.
- W2118966476 isParatext "false" @default.
- W2118966476 isRetracted "false" @default.
- W2118966476 magId "2118966476" @default.
- W2118966476 workType "article" @default.