Clustering High-Dimensional Data: A Reduction-Level Fusion of PCA and Random Projection

dc.contributor.author Pasunuri, Raghunadh
dc.contributor.author Venkaiah, Vadlamudi China
dc.contributor.author Srivastava, Amit
dc.date.accessioned 2022-03-27T05:51:01Z
dc.date.available 2022-03-27T05:51:01Z
dc.date.issued 2019-01-01
dc.description.abstract Principal Component Analysis (PCA) is a very famous statistical tool for representing the data within lower dimension embedding. K-means is a prototype (centroid)-based clustering technique used in unsupervised learning tasks. Random Projection (RP) is another widely used technique for reducing the dimensionality. RP uses projection matrix to project the data into a feature space. Here, we prove the effectiveness of these methods by combining them for efficiently clustering the low as well as high-dimensional data. Our proposed algorithms works by combining Principal Component Analysis (PCA) with Random Projection (RP) to project the data into feature space, then performs K-means clustering on that reduced space (feature space). We compare the proposed algorithm’s performance with simple K-means and PCA-K-means algorithms on 12 benchmark datasets. Of these, 4 are low-dimensional and 8 are high-dimensional datasets. Our proposed algorithms outperform the other methods.
dc.identifier.citation Advances in Intelligent Systems and Computing. v.740
dc.identifier.issn 21945357
dc.identifier.uri 10.1007/978-981-13-1280-9_44
dc.identifier.uri http://link.springer.com/10.1007/978-981-13-1280-9_44
dc.identifier.uri https://dspace.uohyd.ac.in/handle/1/8308
dc.subject Clustering
dc.subject High-dimensional data
dc.subject K-means
dc.subject Principal component analysis
dc.subject Random projection
dc.title Clustering High-Dimensional Data: A Reduction-Level Fusion of PCA and Random Projection
dc.type Book Series. Book Chapter
dspace.entity.type
Files
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Plain Text
Description: