A novel representative k-NN sampling-based clustering approach for an effective dimensionality reduction-based visualization of dynamic data

dc.contributor.author Bheekya, Dharamsotu
dc.contributor.author Rani, Kanakapodi Swarupa
dc.contributor.author Moiz, Salman Abdul
dc.contributor.author Rao, Chillarige Raghavendra
dc.date.accessioned 2022-03-27T05:58:47Z
dc.date.available 2022-03-27T05:58:47Z
dc.date.issued 2020-01-01
dc.description.abstract Visualization plays a crucial role in the exploratory analysis of Big Data. The direct visualization of Big Data is a challenging task and difficult to analyze. Dimensionality Reduction techniques extract the features in the context of visualization. Due to the unsupervised and non-parametric nature, most of the dimensionality reduction techniques are not evaluated quantitatively and not allowed to extend for dynamic data. The proposed representative k-NN sampling-based clustering, determines the underlying structure of the data by using well-known clustering techniques. The external cluster validation index determines the order sequence of clustering techniques from which the appropriate cluster techniques are recommended for the given datasets. From the recommended set, the samples of the best clustering technique are considered as representative samples which can be used for generating the visual representation. The t-Distributed Stochastic Neighbor Embedding (t-SNE) algorithm is applied to generate a low-dimensional embedding model of representative samples, which is more suitable for visualization. The new data samples are added to the generated model by using the interpolation technique. The low-dimensional embedding results are quantitatively evaluated by k-NN accuracy and trustworthiness. The performance analysis of representative k-NN sampling-based clustering results and embedding results accomplished by seven differently characterized datasets.
dc.identifier.citation Advances in Science, Technology and Engineering Systems. v.5(4)
dc.identifier.uri 10.25046/aj050402
dc.identifier.uri https://astesj.com/v05/i04/p02/
dc.identifier.uri https://dspace.uohyd.ac.in/handle/1/8995
dc.subject Cluster validation index
dc.subject Clustering
dc.subject Dimensionality reduction
dc.subject Exploratory analysis
dc.subject Interpolation
dc.subject Sampling
dc.subject T-distributed stochastic neighbor embedding
dc.subject Visualization
dc.title A novel representative k-NN sampling-based clustering approach for an effective dimensionality reduction-based visualization of dynamic data
dc.type Journal. Article
dspace.entity.type
Files
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Plain Text
Description: