Abstract
This chapter surveys recent developments in the area of multimedia big data, the biggest big data. One core problem is how to best process this multimedia big data in an efficient and scalable way. We outline examples of the use of the MapReduce framework, including Hadoop, which has become the most common approach to a truly scalable and efficient framework for common multimedia processing tasks, e.g., content analysis and retrieval. We also examine recent developments on deep learning which has produced promising results in large-scale multimedia processing and retrieval. Overall the focus has been on empirical studies rather than the theoretical so as to highlight the most practically successful recent developments and highlight the associated caveats or lessons learned.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
(2015) Special issue on multimedia: the biggest big data. IEEE Trans Multimed 17(1):144
Smith JR (2013) Riding the multimedia big data wave. In: Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’13), Dublin, 28 July–01 Aug, pp 1–2
Video correlation and analysis suite from IBM. http://www.ibm.com/smarterplanet/ie/en/smarter_cities/solutions/solution/A863656S12083Q41.html
Chang EY (2011) Foundations of large-scale multimedia information management and retrieval: mathematics of perception. Springer, New York
McFee B, Lanckriet G (2011) Learning multi-modal similarity. J Mach Learn Res 12:491–523
Burnett I, Van de Walle R, Hill K, Bormans J, Pereira F (2003) MPEG-21: goals and achievements. IEEE MultiMed 10(4):60–70
Gurrin C, Smeaton AF, Doherty AR (2014) LifeLogging: personal big data. Found Trends Inf Retr 8(1):1–125
Moise D, Shestakov D, Gudmundsson G, Amsaleg L (2013) Indexing and searching 100M images with map-reduce. In: Proceedings of the 3rd ACM Conference on International Conference on Multimedia Retrieval (ICMR ’13), Dallas, 16–19 Apr, pp 17–24
Krishna M, Kannan B, Ramani A, Sathish SJ (2010) Implementation and performance evaluation of a hybrid distributed system for storing and processing images from the web. In: 2010 IEEE Second International Conference on Cloud Computing Technology and Science (CloudCom), Indianapolis, 30 Nov-03 Dec, pp 762–767
Meeker M (2014) Internet Trends 2014 – Code Conference
Chen SY, Lai CF, Hwang RH, Chao HC, Huang YM (2014) A multimedia parallel processing approach on GPU MapReduce framework. In: Proceedings of the 7th International Conference on Ubi-Media Computing and Workshops (UMEDIA), Ulaanbaatar, 12–14 July, pp 154–159
He B, Fang W, Luo Q, Govindaraju NK, Wang T (2008) Mars: a MapReduce framework on graphics processors. In: Proceedings of the 17th International Conference on Parallel Architectures and Compilation Techniques (PACT’08), Toronto, 25–29 Oct, pp 260–269
Wang H, Shen Y, Wang L, Zhufeng K, Wang W, Cheng C (2012) Large-scale multimedia data mining using MapReduce framework. In: IEEE 4th International Conference on Cloud Computing Technology and Science (CloudCom’12), Taipei, 3–6 Dec, pp 287–292
Mera D, Batko M, Zezula P (2014) Towards fast multimedia feature extraction: Hadoop or storm. In: IEEE International Symposium on Multimedia (ISM’14), Taichung, 10–12 Dec, pp 106–109
Deng L (2014) A tutorial survey of architectures, algorithms, and applications for deep learning. APSIPA Trans Signal Inf Process 3:e2
Cadieu CF, Hong H, Yamins DLK, Pinto N, Ardila D et al (2014) Deep neural networks rival the representation of primate IT cortex for core visual object recognition. PLoS Comput Biol 10(12):e1003963
Hinton GE, Osindero S, Teh Y (2006) A fast learning algorithm for deep belief nets. Neural Comput 18:1527–1554
Krizhevsky A, Sutskever I, Hinton GE (2012) ImageNet classification with deep convolutional neural networks. In: Proceedings of the Advances in Neural Information Processing Systems (NIPS’12), Lake Tahoe, Nevada
Hinton G, Deng L, Yu D, Dahl GE, Mohamed A, Jaitly N, Senior A, Vanhoucke V, Nguyen P, Sainath TN, Kingsbury B (2012) Deep neural networks for acoustic modeling in speech recognition: the shared views of four research groups. IEEE Signal Process Mag 29(6):82–97
Chen X-W, Lin X (2014) Big data deep learning: challenges and perspectives. IEEE Access 2:514–525
Ciresan D, Giusti A, Gambardella L, Schidhuber J (2012) Deep neural networks segment neuronal membranes in electron microscopy images. In: Proceedings of the Advances in Neural Information Processing Systems (NIPS’12), Lake Tahoe, 03–08 Dec, pp 2852–2860
Zeiler M, Fergus R (2013) Stochastic pooling for regularization of deep convolutional neural networks. CoRR, abs/1301.3557
Wan J, Wang D, Hoi SCH, Wu P, Zhu J, Zhang Y, Li J (2014) Deep learning for content-based image retrieval: a comprehensive study. In: Proceedings of the ACM international conference on multimedia (MM’14), Orlando. ACM, New York, pp 157–166
High-performance C++/CUDA implementation of convolutional neural networks. https://code.google.com/p/cuda-convnet/
Hua J, Shao J, Tian H, Zhao Z, Su F, Cai A (2014) An output aggregation system for large scale cross-modal retrieval. In: IEEE International Conference on Multimedia and Expo Workshops (ICMEW’14), Chengdu, 14–18 July 2014, pp 1–6
MSR-Bing Image Retrieval Challenge. http://research.microsoft.com/en-US/projects/irc/acmmm2014.aspx
Nair V, Hinton G (2009) 3-D object recognition with deep belief nets. In: Proceedings of the Advances in Neural Information Processing Systems (NIPS’12), Lake Tahoe, 03–08 Dec, pp 1339–1347
ISBI 2012 EM Segmentation Challenge. http://brainiac2.mit.edu/isbi_challenge/
Ngiam J, Khosla A, Kim M, Nam J, Lee H, Ng A (2011) Multimodal deep learning. In: Proceedings of the 28th International Conference on Machine Learning (ICML11), Bellevue, USA, 28 June-02 July, pp 689–696
Coates A, Huval B, Wang T, Wu D, Ng A, Catanzaro B (2013) Deep learning with COTS HPC systems. In: Proceedings of the 30th International Conference on Machine Learning (ICML13), Atlanta, 16–21 June, pp 1337–1345
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Hayes, J. (2015). Multimedia Big Data: Content Analysis and Retrieval. In: Trovati, M., Hill, R., Anjum, A., Zhu, S., Liu, L. (eds) Big-Data Analytics and Cloud Computing. Springer, Cham. https://doi.org/10.1007/978-3-319-25313-8_3
Download citation
DOI: https://doi.org/10.1007/978-3-319-25313-8_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-25311-4
Online ISBN: 978-3-319-25313-8
eBook Packages: Computer ScienceComputer Science (R0)