Relative Performance of K-Means, Single Linkage and Affinity Propagation in Cluster Analysis

Srinivas Nadipalli, Venkata Dattatreya Rao Akkavajhula, Karteeka Pavan Kanadam

Abstract


Here an attempt is made to study the relative performance of K-Means, Single Linkage and Affinity Propagation in clustering six public data sets viz., Iris, Glass, Breast Cancer, Half-moon, Path based and Spiral. The performance of clustering methods is studied based on seven validation techniques viz., Rand, Adjusted Rand, Error Rate, Silhouette, Davies-Bouldin, Dunn and CS Indices. The results obtained empirically and conclusions are summarized in section 6.

References


Anderson, T.W. (1962). An introduction to Multivariate Statistical Analysis. Wiley, New York.

Andrews, H.C. (1972). Introduction to Mathematical Techniques in Pattern Recognition, John Wiley & Sons, New York.

Arjun K Pujari (2001) Data Mining Techniques. University Press, India

Bandyopadhyay. S et al (1995). Pattern classification with genetic algorithms. Patterns Recognition Letters, Vol. 16(8) pp. 801-808.

Bose N. K and Liang P (1998) Neural Network Fundamentals with Graphs, Algorithms and Applications. Tata McGraw-Hill Publishing, New Delhi, India

Christopher M. Bishop (1995).Neural Networks for Pattern Rcognition. Oxford university press.

Delbert Dueck (2009) Affinity Propagation: Clustering data by passing messages.

Fisher, R. A (1936) The use of Multiple measurements in Taxonomic Problems, Annals of Eugencies

Fukunaga, K. (1990). Introduction to Statistical Pattern Recognition. Academic Press, 2nd edition.

German B (1987) Glass identification Database, UCI Machine Learning Repository.

Jang, Sun and Mizutani (2003) Neuro-Fuzzy and Soft Computing. PHI publications, New Delhi, India

Johnson Richard A., and Wichern Dean. W(2001). Applied Multivariate Statistical Analysis, Prentice Hall International Inc., Englewood Cliffs, New Jersey.

Mangasarian O. L and Wolberg W. H: "Cancer diagnosis via linear programming", SIAM News, Volume 23, Number 5, September 1990, pp 1 & 18

Mclachlan, G. J. (1992). Discriminant Analysis and Statistical Pattern. John Wiley, New York

Patrick, E. A. (1972) Fundamentals of Pattern Recognition, Prentice Hall, Englewood Cliffs, New Jersey.

Precha Thavikulwat(2008) Affinity Propagation: A Clustering Algorithm For Computer- Assisted Business Simulations And Experiential Exercises, Developments in Business Simulation and Experiential Learning, Volume 35.

Shailendra Kumar S, J.L. Rana and Jain. R. C. Fast Affinity Propagation Clustering based on Machine Learning, IJCSI International Journal of Computer Science Issues, Vol. 10

William H. Wolberg and O.L. Mangasarian: Multisurface method of pattern separation for medical diagnosis applied to breast cytology. Proceedings of the National Academy of Sciences, U.S.A., Volume 87, December 1990, pp 9193-9196.


پاراگلایدر Full Text: PDF

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.

ISSN : 2251-1563