A new distributed data mining model based on similarity

Tao Li, Shenghuo Zhu, Mitsunori Ogihara

Research output: Contribution to conferencePaperpeer-review

22 Scopus citations


Distributed Data Mining(DDM) has been very active and enjoying a growing amount attention since its inception. Current DDM techniques regard the distributed data sets as a single virtual table and assume there exists a global model which could be generated if the data were combined/centralized. This paper proposes a similarity-based distributed data mining(SBDDM) framework which explicitly take the differences among distributed sources into consideration. A new similarity measure is introduced and its effectiveness is then evaluated and validated. This paper also illustrates the limitations of current DDM techniques through three concrete case studies. Finally distributed clustering within the SBDDM framework is also discussed.

Original languageEnglish (US)
Number of pages5
StatePublished - 2003
Externally publishedYes
EventProceedings of the 2003 ACM Symposium on Applied Computing - Melbourne, FL, United States
Duration: Mar 9 2003Mar 12 2003


OtherProceedings of the 2003 ACM Symposium on Applied Computing
Country/TerritoryUnited States
CityMelbourne, FL


  • Distributed Data Mining(DDM)
  • Similarity

ASJC Scopus subject areas

  • Software


Dive into the research topics of 'A new distributed data mining model based on similarity'. Together they form a unique fingerprint.

Cite this