- Title
- Fast modified global k-means algorithm for incremental cluster construction
- Creator
- Bagirov, Adil; Ugon, Julien; Webb, Dean
- Date
- 2011
- Type
- Text; Journal article
- Identifier
- http://researchonline.federation.edu.au/vital/access/HandleResolver/1959.17/69428
- Identifier
- vital:3822
- Identifier
- ISSN:0031-3203
- Abstract
- The k-means algorithm and its variations are known to be fast clustering algorithms. However, they are sensitive to the choice of starting points and are inefficient for solving clustering problems in large datasets. Recently, incremental approaches have been developed to resolve difficulties with the choice of starting points. The global k-means and the modified global k-means algorithms are based on such an approach. They iteratively add one cluster center at a time. Numerical experiments show that these algorithms considerably improve the k-means algorithm. However, they require storing the whole affinity matrix or computing this matrix at each iteration. This makes both algorithms time consuming and memory demanding for clustering even moderately large datasets. In this paper, a new version of the modified global k-means algorithm is proposed. We introduce an auxiliary cluster function to generate a set of starting points lying in different parts of the dataset. We exploit information gathered in previous iterations of the incremental algorithm to eliminate the need of computing or storing the whole affinity matrix and thereby to reduce computational effort and memory usage. Results of numerical experiments on six standard datasets demonstrate that the new algorithm is more efficient than the global and the modified global k-means algorithms. © 2010 Elsevier Ltd. All rights reserved.
- Relation
- Pattern Recognition Vol. 44, no. 4 (2011), p. 866-876; http://purl.org/au-research/grants/arc/DP0666061
- Rights
- This metadata is freely available under a CCO license
- Subject
- Global k-means algorithm; k-means algorithm; Minimum sum-of-squares clustering; Nonsmooth optimization; Affinity matrix; Cluster centers; Cluster construction; Cluster functions; Clustering problems; Computational effort; Data sets; Global K-means algorithm; Incremental algorithm; Incremental approach; K-means; k-Means algorithm; Large datasets; matrix; Memory usage; Modified Global; Numerical experiments; Iterative methods; Matrix algebra; Optimization; Clustering algorithms
- Reviewed
- Hits: 8037
- Visitors: 7972
- Downloads: 1
Thumbnail | File | Description | Size | Format |
---|