K-NN Sampling for Visualization of Dynamic Data Using LION-tSNE

dc.contributor.author Dharamsotu, Bheekya
dc.contributor.author Rani, K. Swarupa
dc.contributor.author Moiz, Salman Abdul
dc.contributor.author Rao, C. Raghavendra
dc.date.accessioned 2022-03-27T06:02:33Z
dc.date.available 2022-03-27T06:02:33Z
dc.date.issued 2019-12-01
dc.description.abstract Dimensionality reduction algorithms are often used to visualize multi-dimensional data, which are mostly non-parametric. Non-parametric methods do not provide any explicit intuition for adding new data points into an existing environment which limits the applicability of visualization for Big Data scenario. The LION-tSNE (Local Interpolation with Outlier coNtrol t-Distributed Stochastic Neighbor Embedding) method was proposed to overcome the limitations of existing techniques. The LION-tSNE algorithm uses random sampling method for tSNE model design which creates an initial visual environment then new data points are added to this environment using local-IDW(Inverse Distance Weighting) interpolation method. The randomly selected sample data often suffer from non-representativeness of the whole data which creates inconsistency in the tSNE environment. To overcome this problem two new sampling methods are proposed which are based on k-NN (k-Nearest Neighbor) graph update properties. It is empirically shown that proposed methods outperform existing LION-tSNE method with 0.5 to 2% more k-NN accuracy and results are more consistent. The study is done on five differently characterized datasets with three different initial solutions of tSNE. The proposed method results are statistically significant which is done by statistical method pairwise t-test.
dc.identifier.citation Proceedings - 26th IEEE International Conference on High Performance Computing, HiPC 2019
dc.identifier.uri 10.1109/HiPC.2019.00019
dc.identifier.uri https://ieeexplore.ieee.org/document/8990391/
dc.identifier.uri https://dspace.uohyd.ac.in/handle/1/9182
dc.subject Big Data
dc.subject Dimensionality reduction
dc.subject Interpolation
dc.subject k-NN graph
dc.subject Sampling
dc.subject t-Distributed Stochastic Neighbor Embedding
dc.subject visualizatio
dc.title K-NN Sampling for Visualization of Dynamic Data Using LION-tSNE
dc.type Conference Proceeding. Conference Paper
dspace.entity.type
Files
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Plain Text
Description: