An Improved Bisecting K-means Algorithm for Text Document Clustering
Published: 2016
Author(s) Name: Janani Balakumar |
Author(s) Affiliation: Bharathiar University, Coimbatore, Tamil Nadu, India
Locked
Subscribed
Available for All
Abstract
Cluster analysis is an unsupervised learning approach that aims to group the objects into different groups or clusters. So that each cluster can contain similar objects with respect to any predefined condition. Text document clustering is the important technique of text mining in efficiently organizing the large volume of documents into a small number of significant clusters. The main objective of this research work is to cluster the collection of documents into related groups based on the contents of the particular documents. In order to perform this clustering task, this research work makes use of two existing algorithms, namely K-means and Bisecting K-means algorithm, and also this research work proposes a new clustering algorithm namely Enhanced-Bisecting K-means algorithm. From the experimental results it is observed that the proposed algorithm gives the better clustering accuracy than other algorithms.
Keywords: Text Mining, Text Document Clustering,K-means, Bisecting K-means, Enhanced Bisecting K-means
View PDF