Matches in SemOpenAlex for { <https://semopenalex.org/work/W2911568967> ?p ?o ?g. }
Showing items 1 to 80 of
80
with 100 items per page.
- W2911568967 abstract "With the expanding diversity of database technologies and database sizes, it is becoming increasingly hard to identify similar relational databases among many large databases stored in different Database Management Systems (DBMS). Therefore, we propose to use data mining techniques to automatically identify similar structures of relational databases by comparing their metadata, which is composed by physical details of the databases. The amount of metadata is proportional to the size of the schema structure. The possibilities of combinations for comparison is quadratic in relation to the number of schemas analyzed. Looking for the most efficient technique, we propose to calculate the schema similarity evaluating a distance of all the schemas to just one schema, which is a start point. Obviously schemas with close distances are more similar than schemas with bigger distances. We compare this proposal against two other approaches. The first approach compares all schemas against all another schemas except for its inverse comparison. The second approach compares schemas in a group of schemas with similar sizes. To validate our proposal, an experiment is performed with 354 real schemas ranging in sizes from 2 to 20 thousand metadata, totaling together more than 26 thousand tables and 238 thousand columns. Those schemas came from 5 different DBMS. The metadata extracted is transformed and formatted for comparing pairs of a schema. The textual features are compared using Cosine Distance and numerical features are compared using Euclidean Distance. Then, the hierarchical cluster technique is used to facilitate the visualization of the schema that most closely resembled one another. Results showed that, our was the most efficient because it compared all schema and identified the most similar schema by its structure in less than 2 minutes. The extracted metadata was used to create the first version of the metadata repository and an initial version of a data catalog, which contributed to the knowledge of existing data. Using this procedure, duplicated schemas were discovered and then discontinued, resulting in a cost savings of 10% of cost savings, while freeing up infrastructure resources. This solution is flexible, it supports a variety of schema sizes and DBMS." @default.
- W2911568967 created "2019-02-21" @default.
- W2911568967 creator A5026224413 @default.
- W2911568967 creator A5038955665 @default.
- W2911568967 creator A5054089208 @default.
- W2911568967 creator A5068340505 @default.
- W2911568967 date "2018-11-01" @default.
- W2911568967 modified "2023-09-27" @default.
- W2911568967 title "Large Database Schema Matching using Data Mining Techniques" @default.
- W2911568967 cites W1533117389 @default.
- W2911568967 cites W1547612978 @default.
- W2911568967 cites W1998982581 @default.
- W2911568967 cites W2008896880 @default.
- W2911568967 cites W2085478182 @default.
- W2911568967 cites W2095708598 @default.
- W2911568967 cites W2111998194 @default.
- W2911568967 cites W2120718782 @default.
- W2911568967 cites W2138745488 @default.
- W2911568967 cites W2139135093 @default.
- W2911568967 cites W2160683489 @default.
- W2911568967 cites W2168996210 @default.
- W2911568967 cites W2776474505 @default.
- W2911568967 cites W2783642575 @default.
- W2911568967 doi "https://doi.org/10.1109/icdmw.2018.00083" @default.
- W2911568967 hasPublicationYear "2018" @default.
- W2911568967 type Work @default.
- W2911568967 sameAs 2911568967 @default.
- W2911568967 citedByCount "2" @default.
- W2911568967 countsByYear W29115689672021 @default.
- W2911568967 countsByYear W29115689672022 @default.
- W2911568967 crossrefType "proceedings-article" @default.
- W2911568967 hasAuthorship W2911568967A5026224413 @default.
- W2911568967 hasAuthorship W2911568967A5038955665 @default.
- W2911568967 hasAuthorship W2911568967A5054089208 @default.
- W2911568967 hasAuthorship W2911568967A5068340505 @default.
- W2911568967 hasConcept C124101348 @default.
- W2911568967 hasConcept C136764020 @default.
- W2911568967 hasConcept C148840519 @default.
- W2911568967 hasConcept C150012506 @default.
- W2911568967 hasConcept C153048206 @default.
- W2911568967 hasConcept C190703929 @default.
- W2911568967 hasConcept C23123220 @default.
- W2911568967 hasConcept C30775581 @default.
- W2911568967 hasConcept C41008148 @default.
- W2911568967 hasConcept C52146309 @default.
- W2911568967 hasConcept C56310702 @default.
- W2911568967 hasConcept C5655090 @default.
- W2911568967 hasConcept C77088390 @default.
- W2911568967 hasConcept C93518851 @default.
- W2911568967 hasConceptScore W2911568967C124101348 @default.
- W2911568967 hasConceptScore W2911568967C136764020 @default.
- W2911568967 hasConceptScore W2911568967C148840519 @default.
- W2911568967 hasConceptScore W2911568967C150012506 @default.
- W2911568967 hasConceptScore W2911568967C153048206 @default.
- W2911568967 hasConceptScore W2911568967C190703929 @default.
- W2911568967 hasConceptScore W2911568967C23123220 @default.
- W2911568967 hasConceptScore W2911568967C30775581 @default.
- W2911568967 hasConceptScore W2911568967C41008148 @default.
- W2911568967 hasConceptScore W2911568967C52146309 @default.
- W2911568967 hasConceptScore W2911568967C56310702 @default.
- W2911568967 hasConceptScore W2911568967C5655090 @default.
- W2911568967 hasConceptScore W2911568967C77088390 @default.
- W2911568967 hasConceptScore W2911568967C93518851 @default.
- W2911568967 hasLocation W29115689671 @default.
- W2911568967 hasOpenAccess W2911568967 @default.
- W2911568967 hasPrimaryLocation W29115689671 @default.
- W2911568967 hasRelatedWork W192992363 @default.
- W2911568967 hasRelatedWork W1978969411 @default.
- W2911568967 hasRelatedWork W1985081702 @default.
- W2911568967 hasRelatedWork W1996958951 @default.
- W2911568967 hasRelatedWork W2058631927 @default.
- W2911568967 hasRelatedWork W2087376388 @default.
- W2911568967 hasRelatedWork W2129725174 @default.
- W2911568967 hasRelatedWork W2582962420 @default.
- W2911568967 hasRelatedWork W2911568967 @default.
- W2911568967 hasRelatedWork W3021142537 @default.
- W2911568967 isParatext "false" @default.
- W2911568967 isRetracted "false" @default.
- W2911568967 magId "2911568967" @default.
- W2911568967 workType "article" @default.