Clustering distributed homogeneous datasets

Srinivasan Parthasarathy, Mitsunori Ogihara

Research output: Chapter in Book/Report/Conference proceedingConference contribution

17 Scopus citations

Abstract

In this paper we present an elegant and effective algorithm for measuring the similarity between homogeneous datasets to enable clustering. Once similar datasets are clustered, each cluster can be independently mined to generate the appropriate rules for a given cluster. The algorithm presented is efficient in storage and scale, has the ability to adjust to time constraints, and can provide the user with likely causes of similarity or dis-similarity. The proposed similarity measure is evaluated and validated on real datasets from the Census Bureau, Reuters, and synthetic datasets fromIBM.

Original languageEnglish (US)
Title of host publicationPrinciples of Data Mining and Knowledge Discovery - 4th European Conference, PKDD 2000, Proceedings
EditorsDjamel A. Zighed, Jan Komorowski, Jan Zytkow
PublisherSpringer Verlag
Pages566-574
Number of pages9
ISBN (Print)9783540410669
StatePublished - Jan 1 2000
Externally publishedYes
Event4th European Conference on Principles and Practice of Knowledge Discovery in Databases, PKDD 2000 - Lyon, France
Duration: Sep 13 2000Sep 16 2000

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume1910
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other4th European Conference on Principles and Practice of Knowledge Discovery in Databases, PKDD 2000
CountryFrance
CityLyon
Period9/13/009/16/00

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Fingerprint Dive into the research topics of 'Clustering distributed homogeneous datasets'. Together they form a unique fingerprint.

  • Cite this

    Parthasarathy, S., & Ogihara, M. (2000). Clustering distributed homogeneous datasets. In D. A. Zighed, J. Komorowski, & J. Zytkow (Eds.), Principles of Data Mining and Knowledge Discovery - 4th European Conference, PKDD 2000, Proceedings (pp. 566-574). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 1910). Springer Verlag.