Abstract:
To handle emerging complex data at massive scale from web, social network, and sensor network etc., “big data analytics” and “big data management” areas are emerging. Man...Show MoreMetadata
Abstract:
To handle emerging complex data at massive scale from web, social network, and sensor network etc., “big data analytics” and “big data management” areas are emerging. Many traditional assumptions are not working, instead, new query and programming interfaces are required, and new computing models are emerging. The tutorial will focus on data mining and machine learning algorithms for analyzing very large amounts of data or Big data. Map Reduce and No SQL system will be used as tools/standards for creating parallel algorithms that can process very large amounts of data. The following concepts will be covered: Hadoop, Mapreduce, NoSQL systems (Cassandra, Pig, Hive, BigTable, HBASE), Storm, Spark, Large scale supervised machine learning, Data streams, Clustering, and Applications including recommendation systems, Web and security.
Published in: Proceedings of the 2014 IEEE 15th International Conference on Information Reuse and Integration (IEEE IRI 2014)
Date of Conference: 13-15 August 2014
Date Added to IEEE Xplore: 02 March 2015
Electronic ISBN:978-1-4799-5880-1