An Efficient Divergence and Distribution Based Similarity Measure for Clustering Of Uncertain Data

Geetha; Shyla, Mary

doi:https://dx.dx.doi.org/10.21275/13031403

An Efficient Divergence and Distribution Based Similarity Measure for Clustering Of Uncertain Data

Geetha, Mary Shyla

Abstract: Data Mining is the extraction of hidden predictive information from large databases. Clustering is one of the popular data mining techniques. Clustering on uncertain data, one of the essential tasks in mining uncertain data, posts significant challenges on both modeling similarity between uncertain objects and developing efficient computational methods. The previous methods extend traditional partitioning clustering methods. Such methods cannot handle uncertain objects that are geometrically indistinguishable, such as products with the same mean but very different variances in customer ratings. Surprisingly, probability distributions, which are essential characteristics of uncertain objects, have not been considered in measuring similarity between uncertain objects. In Existing method to use the well-known Kullback-Leibler divergence to measure similarity between uncertain objects in both the continuous and discrete cases, and integrate it into partitioning and density-based clustering methods to cluster uncertain objects. It is very costly or even infeasible. The proposed work introduces the well-known Kernel skew divergence to measure similarity between uncertain objects in both the continuous and discrete cases. Measuring the cluster similarity with Poisson distribution is a discrete probability distribution that expresses the probability of a given number of events occurring in a fixed interval of time and/or space and to further speed up the computation.

Keywords: Clustering, uncertain data, Kernel skew Divergence and distribution

How to Cite?: Geetha, Mary Shyla, "An Efficient Divergence and Distribution Based Similarity Measure for Clustering Of Uncertain Data", Volume 3 Issue 3, March 2014, International Journal of Science and Research (IJSR), Pages: 333-339, https://www.ijsr.net/getabstract.php?paperid=13031403, DOI: https://dx.dx.doi.org/10.21275/13031403

Download Citation: APA | MLA | BibTeX | EndNote | RefMan