Conferences >2022 IEEE International Confe...

FVec2vec: A Fast Nonlinear Dimensionality Reduction Approach for General Data

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Dimensionality reduction is a fundamental technique to address the curse of dimensionality problem in real-world big datasets. However, most existing methods either only ...Show More

Metadata

Abstract:

Dimensionality reduction is a fundamental technique to address the curse of dimensionality problem in real-world big datasets. However, most existing methods either only target raw datasets that contain explicit relationships between data points, or construct the complete neighborhood graph of the dataset by calculating pairwise similarities, and then generate contexts of data points by random walking to measure the structure of the dataset, which are computationally expensive. In this paper, we propose a fast nonlinear locality-preserving dimensionality reduction approach called FVec2vec, which extends the Skip-gram model to embedding representation of general numerical matrices. Specifically, instead of constructing neighborhood graph by calculating pairwise similarities between data points, we approximate the k-nearest neighbors (kNN) of each data point in matrices by exploring its neighbors’ neighbors first. Then, we design a novel sampling algorithm to randomly sample on the kNN to depict the structure of the dataset. Experimental results show that FVec2vec is faster than most existing methods while achieving acceptable accuracy, and the accuracy is even higher than the state-of-the-art method under certain similarity metrics.

Published in: 2022 IEEE International Conference on Big Data (Big Data)

Date of Conference: 17-20 December 2022

Date Added to IEEE Xplore: 26 January 2023

ISBN Information:

DOI: 10.1109/BigData55660.2022.10020682

Conference Location: Osaka, Japan

Funding Agency:

Contents

References is not available for this document.

FVec2vec: A Fast Nonlinear Dimensionality Reduction Approach for General Data

Abstract:

Metadata

Abstract:

Funding Agency:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

FVec2vec: A Fast Nonlinear Dimensionality Reduction Approach for General Data

Alerts

Abstract:

Metadata

Abstract:

Funding Agency:

References

IEEE Account

Purchase Details

Profile Information

Need Help?