COMPARISON OF CLUSTERING ALGORITHMS: A PRACTICAL APPROACH
Ms Meenal Sharma
First Published December 18,2022
Authors
Ms Meenal Sharma
Affiliation
Assistant Professor, International School of Informatics & Management, Jaipu
Abstract
Data mining is a technique used for retrieving information and data from the database. This paper
focuses on clustering, a technique which is used to divide the data into interdependent clusters that
are of similar type. Clustering is a way of classifying and breaking down the data, and putting the
similar type of meaningful data/objects into a single group or cluster. The technique follows the most
common approach of finding centre of clusters and input vectors, which help to identify which cluster
centre belongs to which input vector. Hierarchical clustering is a technique of clustering the data
using predetermined way of clustering, either it is top to bottom or it is bottom up approach. It is
further divided into two ways: Agglomerative (Bottom-up approach) and Divisive(Top-down
approach). K-Means clustering is a technique which makes various partitions of the given data from n
observations of objects into k clusters, where each of the object belongs to a particular cluster of
nearest mean(also known as center of clusters).
In this paper road accidents dataset is used that contains the records of road accidents occurred in
the India on day to day basis. The dataset holds the records of the accidents happened in each state
of India. This dataset is used because the number of accidents happening daily in the India is very big
in count and the accidents happen are of different types like fatal accidents, major accidents, noninjury
accidents, etc. This paper uses clustering algorithm to differentiate the categories of accidents
in the dataset. (Ambigavathi, M., & Sridharan, D. 2020).
Narendra, S., Aman, B., & Ratnesh, L. (2012). Comparison the various clustering algorithms of weka. International Journal of Emerging Technology and Advanced Engineering, 2(5), 80.
Pamulaparty, L. (2016). Cluster analysis of medical research data using R. Global Journal of Computer Science and Technology, 16(C1), 17-22.
Singh, N., & Singh, D. (2012). Performance evaluation of k-means and heirarichal clustering in terms of accuracy and running time. IJCSIT) International Journal of Computer Science and Information Technologies, 3(3), 4119-4121.
Siddiqui, F. U., & Isa, N. A. M. (2011). Enhanced moving K-means (EMKM) algorithm for image segmentation. IEEE Transactions on Consumer Electronics, 57(2), 833-841.
Verma, M., Srivastava, M., Chack, N., Diswar, A. K., & Gupta, N. (2012). A comparative study of various clustering algorithms in data mining. International Journal of Engineering Research and Applications (IJERA), 2(3), 1379-1384.
Joshi, A., & Kaur, R. (2013). A review: Comparative study of various clustering techniques in data mining. International Journal of Advanced Research in Computer Science and Software Engineering, 3(3), 55-57.
Pande, S. R., Sambare, S. S., & Thakre, V. M. (2012). Data clustering using data mining techniques. International Journal of advanced research in computer and communication engineering, 1(8), 494-9.
Arulanandham, A., Arumugam, S., Dinesh, G., Thirukkumaran, R., & Subashmoorthy, R. (2022). Analysis of classification and clustering techniques for ambient AQI using machine learning algorithms. In 2022 4th International Conference on Smart Systems and Inventive Technology (ICSSIT) (pp. 902-908). IEEE.
Xu, H. (2022). Research on clustering algorithms in data mining. In 2022 3rd International Conference on Big Data, Artificial Intelligence and Internet of Things Engineering (ICBAIE) (pp. 652-655). IEEE..
Oyelade, J., Isewon, I., Oladipupo, O., Emebo, O., Omogbadegun, Z., Aromolaran, O., ... & Olawole, O. (2019). Data clustering: Algorithms and its applications. In 2019 19th International Conference on Computational Science and Its Applications (ICCSA) (pp. 71-81). IEEE.
Sonawane, R. C., & Patil, H. D. (2020). Clustering Techniques and Research Challenages in Machine Learning. In 2020 Fourth International Conference on Computing Methodologies and Communication (ICCMC) (pp. 290-293). IEEE.
Ambigavathi, M., & Sridharan, D. (2020). Analysis of clustering algorithms in machine learning for healthcare data. In Advances in Computing and Data Sciences: 4th International Conference, ICACDS 2020, Valletta, Malta, April 24–25, 2020, Revised Selected Papers 4 (pp. 117-128). Springer Singapore.
Ms Meenal Sharma.
(2022) ‘COMPARISON OF CLUSTERING ALGORITHMS: A PRACTICAL APPROACH’, Assistant Professor, International School of Informatics & Management, Jaipu,
Case Report, pp. 54–63. doi: