Matches in SemOpenAlex for { <https://semopenalex.org/work/W2031293887> ?p ?o ?g. }
- W2031293887 endingPage "59" @default.
- W2031293887 startingPage "29" @default.
- W2031293887 abstract "Clustering is a popular data analysis and data mining technique. However, applying traditional clustering algorithms directly to a database is not straightforward due to the fact that a database usually consists of structured and related data; moreover, there might be several object views of the database to be clustered, depending on a data analyst's particular interest. Finally, in many cases, there is a data model discrepancy between the format used to store the database to be analyzed and the representation format that clustering algorithms expect as their input. These discrepancies have been mostly ignored by current research. This paper focuses on identifying those discrepancies and on analyzing their impact on the application of clustering techniques to databases. We are particularly interested in the question on how clustering algorithms can be generalized to become more directly applicable to real-world databases. The paper introduces methodologies, techniques, and tools that serve this purpose. We propose a data set representation framework for database clustering that characterizes objects to be clustered through sets of tuples, and introduce preprocessing techniques and tools to generate object views based on this framework. Moreover, we introduce bag-oriented similarity measures and clustering algorithms that are suitable for our proposed data set representation framework. We also demonstrate that our approach is capable of dealing with relationship information commonly found in databases through the bag-oriented clustering. We also argue that our bag-oriented data representation framework is more suitable for database clustering than the commonly used flat file format and produce better quality of clusters." @default.
- W2031293887 created "2016-06-24" @default.
- W2031293887 creator A5019887282 @default.
- W2031293887 creator A5056584154 @default.
- W2031293887 date "2005-03-04" @default.
- W2031293887 modified "2023-09-26" @default.
- W2031293887 title "A database clustering methodology and tool" @default.
- W2031293887 cites W140846004 @default.
- W2031293887 cites W147860157 @default.
- W2031293887 cites W1496748331 @default.
- W2031293887 cites W1506989794 @default.
- W2031293887 cites W1518912663 @default.
- W2031293887 cites W1524704912 @default.
- W2031293887 cites W1527883571 @default.
- W2031293887 cites W1528368796 @default.
- W2031293887 cites W1539524531 @default.
- W2031293887 cites W1566114229 @default.
- W2031293887 cites W1575476631 @default.
- W2031293887 cites W1584003909 @default.
- W2031293887 cites W1596324102 @default.
- W2031293887 cites W1601529450 @default.
- W2031293887 cites W16159250 @default.
- W2031293887 cites W1673310716 @default.
- W2031293887 cites W195759886 @default.
- W2031293887 cites W1971784203 @default.
- W2031293887 cites W1997200791 @default.
- W2031293887 cites W1998384440 @default.
- W2031293887 cites W2004131797 @default.
- W2031293887 cites W2013550121 @default.
- W2031293887 cites W2032026209 @default.
- W2031293887 cites W2041674806 @default.
- W2031293887 cites W2056884786 @default.
- W2031293887 cites W2059975159 @default.
- W2031293887 cites W2064686951 @default.
- W2031293887 cites W2073308541 @default.
- W2031293887 cites W2082236330 @default.
- W2031293887 cites W2095897464 @default.
- W2031293887 cites W2103028401 @default.
- W2031293887 cites W2118587067 @default.
- W2031293887 cites W2125055259 @default.
- W2031293887 cites W2132651096 @default.
- W2031293887 cites W2140190241 @default.
- W2031293887 cites W2147694185 @default.
- W2031293887 cites W2149127516 @default.
- W2031293887 cites W2156049106 @default.
- W2031293887 cites W2158148818 @default.
- W2031293887 cites W2163952039 @default.
- W2031293887 cites W2166559705 @default.
- W2031293887 cites W2169237596 @default.
- W2031293887 cites W2169371330 @default.
- W2031293887 cites W2292185241 @default.
- W2031293887 cites W2612166593 @default.
- W2031293887 cites W40176063 @default.
- W2031293887 cites W42468102 @default.
- W2031293887 cites W4909282 @default.
- W2031293887 cites W54446695 @default.
- W2031293887 cites W84081692 @default.
- W2031293887 cites W1520953884 @default.
- W2031293887 doi "https://doi.org/10.1016/j.ins.2004.03.016" @default.
- W2031293887 hasPublicationYear "2005" @default.
- W2031293887 type Work @default.
- W2031293887 sameAs 2031293887 @default.
- W2031293887 citedByCount "21" @default.
- W2031293887 countsByYear W20312938872012 @default.
- W2031293887 countsByYear W20312938872014 @default.
- W2031293887 countsByYear W20312938872015 @default.
- W2031293887 countsByYear W20312938872018 @default.
- W2031293887 countsByYear W20312938872020 @default.
- W2031293887 crossrefType "journal-article" @default.
- W2031293887 hasAuthorship W2031293887A5019887282 @default.
- W2031293887 hasAuthorship W2031293887A5056584154 @default.
- W2031293887 hasConcept C118615104 @default.
- W2031293887 hasConcept C118930307 @default.
- W2031293887 hasConcept C119857082 @default.
- W2031293887 hasConcept C124101348 @default.
- W2031293887 hasConcept C154945302 @default.
- W2031293887 hasConcept C17212007 @default.
- W2031293887 hasConcept C177264268 @default.
- W2031293887 hasConcept C17744445 @default.
- W2031293887 hasConcept C186767784 @default.
- W2031293887 hasConcept C193143536 @default.
- W2031293887 hasConcept C199360897 @default.
- W2031293887 hasConcept C199539241 @default.
- W2031293887 hasConcept C23123220 @default.
- W2031293887 hasConcept C2776359362 @default.
- W2031293887 hasConcept C33704608 @default.
- W2031293887 hasConcept C33923547 @default.
- W2031293887 hasConcept C34736171 @default.
- W2031293887 hasConcept C41008148 @default.
- W2031293887 hasConcept C73555534 @default.
- W2031293887 hasConcept C77088390 @default.
- W2031293887 hasConcept C94625758 @default.
- W2031293887 hasConcept C94641424 @default.
- W2031293887 hasConceptScore W2031293887C118615104 @default.
- W2031293887 hasConceptScore W2031293887C118930307 @default.
- W2031293887 hasConceptScore W2031293887C119857082 @default.
- W2031293887 hasConceptScore W2031293887C124101348 @default.
- W2031293887 hasConceptScore W2031293887C154945302 @default.