Conferences >2017 IEEE/ACM International C...

A streaming clustering approach using a heterogeneous system for big data analysis

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Data clustering is a fundamental challenge in data analytics. It is the main task in exploratory data mining and a core technique in machine learning. As the volume, vari...Show More

Metadata

Abstract:

Data clustering is a fundamental challenge in data analytics. It is the main task in exploratory data mining and a core technique in machine learning. As the volume, variety, velocity, and variability of data grows, we need more efficient data analysis methods that can scale towards increasingly large and high dimensional data sets. We develop a streaming clustering algorithm that is highly amenable to hardware acceleration. Our algorithm eliminates the need to store the data objects, which removes limits on the size of the data that we can analyze. Our algorithm is highly parameterizable, which allows it to fit to the characteristics of the data set, and scale towards the available hardware resources. Our streaming hardware core can handle more than 40 Msamples/s when processing 3-dimensional streaming data and up to 1.78 Msamples/s for 70-dimensional data. To validate the accuracy and performance of our algorithms we compare it with several common clustering techniques on several different applications. The experimental result shows that it outperforms other prior hardware accelerated clustering systems.

Published in: 2017 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)

Date of Conference: 13-16 November 2017

Date Added to IEEE Xplore: 14 December 2017

ISBN Information:

Electronic ISSN: 1558-2434

DOI: 10.1109/ICCAD.2017.8203845

Conference Location: Irvine, CA, USA

Contents

References is not available for this document.

A streaming clustering approach using a heterogeneous system for big data analysis

Abstract:

Metadata

Abstract:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

A streaming clustering approach using a heterogeneous system for big data analysis

Alerts

Abstract:

Metadata

Abstract:

References

IEEE Account

Purchase Details

Profile Information

Need Help?