A novel representative k-NN sampling-based clustering approach for an effective dimensionality reduction-based visualization of dynamic data

No Thumbnail Available
Date
2020-01-01
Authors
Bheekya, Dharamsotu
Rani, Kanakapodi Swarupa
Moiz, Salman Abdul
Rao, Chillarige Raghavendra
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
Visualization plays a crucial role in the exploratory analysis of Big Data. The direct visualization of Big Data is a challenging task and difficult to analyze. Dimensionality Reduction techniques extract the features in the context of visualization. Due to the unsupervised and non-parametric nature, most of the dimensionality reduction techniques are not evaluated quantitatively and not allowed to extend for dynamic data. The proposed representative k-NN sampling-based clustering, determines the underlying structure of the data by using well-known clustering techniques. The external cluster validation index determines the order sequence of clustering techniques from which the appropriate cluster techniques are recommended for the given datasets. From the recommended set, the samples of the best clustering technique are considered as representative samples which can be used for generating the visual representation. The t-Distributed Stochastic Neighbor Embedding (t-SNE) algorithm is applied to generate a low-dimensional embedding model of representative samples, which is more suitable for visualization. The new data samples are added to the generated model by using the interpolation technique. The low-dimensional embedding results are quantitatively evaluated by k-NN accuracy and trustworthiness. The performance analysis of representative k-NN sampling-based clustering results and embedding results accomplished by seven differently characterized datasets.
Description
Keywords
Cluster validation index, Clustering, Dimensionality reduction, Exploratory analysis, Interpolation, Sampling, T-distributed stochastic neighbor embedding, Visualization
Citation
Advances in Science, Technology and Engineering Systems. v.5(4)