Abstract
In this paper a short overview on recent research efforts for digital media analysis and description using neural networks is given. Neural networks are very powerful in analyzing, representing and classifying digital media content through various architectures and learning algorithms. Both unsupervised and supervised algorithms can be used for digital media feature extraction. Digital media representation can be done either in a synaptic level or at the output level. The specific problem that is used as a case study for digital media analysis is the human-centered video analysis for activity and identity recognition. Several neural network topologies, such as self organizing maps, independent subspace analysis, multi-layer perceptrons, extreme learning machines and deep learning architectures are presented and results on human activity recognition are reported.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Kyperountas, M., Tefas, A., Pitas, I.: Dynamic training using multistage clustering for face recognition. Pattern Recognition, 894–905 (2008)
Kyperountas, M., Tefas, A., Pitas, I.: Salient feature and reliable classifier selection for facial expression classification. Pattern Recognition, 972–986 (2010)
Gkalelis, N., Tefas, A., Pitas, I.: Combining fuzzy vector quantization with linear discriminant analysis for continuous human movement recognition. IEEE Transactions on Circuits and Systems for Video Technology, 1511–1521 (2008)
Bengio, Y., Courville, A.C., Vincent, P.: Unsupervised Feature Learning and Deep Learning: A Review and New Perspectives. Arxiv (2012)
Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet Classification with Deep Convolutional Neural Networks. Neural Information Processing Systems (2012)
Bordes, A., Glorot, X., Weston, J., Bengio, Y.: Joint Learning of Words and Meaning Representations for Open-Text Semantic Parsing. In: Proceedings of the 15th International Conference on Artificial Intelligence and Statistics, AISTATS (2012)
Le, Q.V., Ranzato, M., Monga, R., Devin, M., Chen, K., Corrado, G.S., Dean, J., Ng, A.Y.: Building High-level Features Using Large Scale Unsupervised Learning. In: ICML (2012)
Goodfellow, I., Courville, A., Bengio, Y.: Large-Scale Feature Learning With Spike-and-Slab Sparse Coding. In: ICML (2012)
Iosifidis, A., Tefas, A., Pitas, I.: View-invariant action recognition based on Artificial Neural Networks. IEEE Transactions on Neural Networks and Learning Systems 23(3), 412–424 (2012)
Iosifidis, A., Tefas, A., Pitas, I.: Person Identification from Actions based on Artificial Neural Networks. In: Computational Intelligence in Biometrics and Identity Management, Singapore. Symposium Series on Computational Intelligence, SSCI (2013)
Le, Q.V., Zou, W.Y., Yeung, S.Y., Ng, A.Y.: Learning hierarchical invariant spatio-temporal features for action recognition with independent subspace analysis. In: 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3361–3368. IEEE Press, Colorado (2011)
Kapsouras, I., Karanikolos, S., Nikolaidis, N., Tefas, A.: Feature Comparison and Feature Fusion for Traditional Dances Recognition. In: 14th Engineering Applications of Neural Networks Conference, Halkidiki (2013)
Haykin, S.: Neural Networks and Learning Machines. Upper Saddle River, New Jersey (2008)
Huang, G.B., Chen, L., Siew, C.: Universal approximation using incremental constructive feedforward networks with random hidden nodes. IEEE Transactions on Neural Networks 17(4), 879–892 (2006)
Huang, G.B., Zhou, H., Ding, X., Zhang, R.: Extreme Learning Machine for Regressiona and Multiclass Classification. IEEE Transactions on Systems, Man and Cybernetics, Part B: Cybernetics 42(2), 513–529 (2012)
Minhas, R., Baradarani, S., Seifzadeh, S., Wu, Q.J.: Human action recognition using extreme learning machine based on visual vocabularies. Neurocomputing 73(10-12), 1906–1917 (2010)
Iosifidis, A., Tefas, A., Pitas, I.: Multi-view Human Action Recognition under Occlusion based on Fuzzy Distances and Neural Networks. In: European Signal Processing Conference, pp. 1129–1133 (2012)
Iosifidis, A., Tefas, A., Pitas, I.: Minimum Class Variance Extreme Learning Machine for Human Action Recognition. IEEE Transactions on Circuits and Systems for Video Technology (accepted, 2013)
Iosifidis, A., Tefas, A., Pitas, I.: Dynamic action recognition based on dynemes and Extreme Learning Machine. Pattern Recognition Letters (accepted, 2013)
Iosifidis, A., Tefas, A., Pitas, I.: Dynamic Action Classification Based on Iterative Data Selection and Feedfforward Neural Networks. In: European Signal Processing Conference (accepted, 2013)
Scholkopf, B., Smola, A.J.: Learning with kernels: Support vector machines, regularization, optimization, and beyond. MIT Press (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Tefas, A., Iosifidis, A., Pitas, I. (2013). Neural Networks for Digital Media Analysis and Description. In: Iliadis, L., Papadopoulos, H., Jayne, C. (eds) Engineering Applications of Neural Networks. EANN 2013. Communications in Computer and Information Science, vol 383. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41013-0_1
Download citation
DOI: https://doi.org/10.1007/978-3-642-41013-0_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-41012-3
Online ISBN: 978-3-642-41013-0
eBook Packages: Computer ScienceComputer Science (R0)