A novel representative k-NN sampling-based clustering approach for an effective dimensionality reduction-based visualization of dynamic data
A novel representative k-NN sampling-based clustering approach for an effective dimensionality reduction-based visualization of dynamic data
| dc.contributor.author | Bheekya, Dharamsotu | |
| dc.contributor.author | Rani, Kanakapodi Swarupa | |
| dc.contributor.author | Moiz, Salman Abdul | |
| dc.contributor.author | Rao, Chillarige Raghavendra | |
| dc.date.accessioned | 2022-03-27T05:50:56Z | |
| dc.date.available | 2022-03-27T05:50:56Z | |
| dc.date.issued | 2020-01-01 | |
| dc.description.abstract | Visualization plays a crucial role in the exploratory analysis of Big Data. The direct visualization of Big Data is a challenging task and difficult to analyze. Dimensionality Reduction techniques extract the features in the context of visualization. Due to the unsupervised and non-parametric nature, most of the dimensionality reduction techniques are not evaluated quantitatively and not allowed to extend for dynamic data. The proposed representative k-NN sampling-based clustering, determines the underlying structure of the data by using well-known clustering techniques. The external cluster validation index determines the order sequence of clustering techniques from which the appropriate cluster techniques are recommended for the given datasets. From the recommended set, the samples of the best clustering technique are considered as representative samples which can be used for generating the visual representation. The t-Distributed Stochastic Neighbor Embedding (t-SNE) algorithm is applied to generate a low-dimensional embedding model of representative samples, which is more suitable for visualization. The new data samples are added to the generated model by using the interpolation technique. The low-dimensional embedding results are quantitatively evaluated by k-NN accuracy and trustworthiness. The performance analysis of representative k-NN sampling-based clustering results and embedding results accomplished by seven differently characterized datasets. | |
| dc.identifier.citation | Advances in Science, Technology and Engineering Systems. v.5(4) | |
| dc.identifier.uri | 10.25046/aj050402 | |
| dc.identifier.uri | https://astesj.com/v05/i04/p02/ | |
| dc.identifier.uri | https://dspace.uohyd.ac.in/handle/1/8285 | |
| dc.subject | Cluster validation index | |
| dc.subject | Clustering | |
| dc.subject | Dimensionality reduction | |
| dc.subject | Exploratory analysis | |
| dc.subject | Interpolation | |
| dc.subject | Sampling | |
| dc.subject | T-distributed stochastic neighbor embedding | |
| dc.subject | Visualization | |
| dc.title | A novel representative k-NN sampling-based clustering approach for an effective dimensionality reduction-based visualization of dynamic data | |
| dc.type | Journal. Article | |
| dspace.entity.type |
Files
License bundle
1 - 1 of 1