Scalability of parallel genetic algorithm for two-mode clustering
Scalability of parallel genetic algorithm for two-mode clustering
No Thumbnail Available
Date
2014-05-01
Authors
Deb, Briti
Srirama, Satish Narayana
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
Data matrix having the same set of entity in the rows and cloumns is known as one-mode data matrix, and traditional one-mode clustering algorithms can be used to cluster the rows (or columns) separately. With the popularity of use of two-mode data matrices where the rows and columns have different sets of entities, the need for simultaneous clustering of rows and columns popularly known as two-mode clustering increased. Additionally, the emergence of large data sets and the prediction of Moore's law slow-down have created the challenge of clustering scalability. In this paper, we address the problem of scalability of organizing an unlabelled two-mode dataset into clusters utilizing multicore processor. We propose a parallel genetic algorithm (GA) heuristics based two-mode clustering algorithm, which is an adaptation of the classical Cuthill-McKee Matrix Bandwidth Minimization (MBM) algorithm. The classical MBM method aims at reducing the bandwidth of a sparse symmetric matrix, which we adapted to make it suitable for non-symmetric real-valued matrix. Preliminary results indicate that our algorithm is scalable on multicore processor compared to serial implementation. Future work will include more extensive experiments and evaluations of the system.
Description
Keywords
Matrix reordering,
Parallel genetic algorithm,
Scalability,
Two-mode clustering
Citation
International Journal of Computers and Applications. v.94(14)