Skip to main content

Neural Networks for Digital Media Analysis and Description

  • Conference paper
Engineering Applications of Neural Networks (EANN 2013)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 383))

  • 1764 Accesses

Abstract

In this paper a short overview on recent research efforts for digital media analysis and description using neural networks is given. Neural networks are very powerful in analyzing, representing and classifying digital media content through various architectures and learning algorithms. Both unsupervised and supervised algorithms can be used for digital media feature extraction. Digital media representation can be done either in a synaptic level or at the output level. The specific problem that is used as a case study for digital media analysis is the human-centered video analysis for activity and identity recognition. Several neural network topologies, such as self organizing maps, independent subspace analysis, multi-layer perceptrons, extreme learning machines and deep learning architectures are presented and results on human activity recognition are reported.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Kyperountas, M., Tefas, A., Pitas, I.: Dynamic training using multistage clustering for face recognition. Pattern Recognition, 894–905 (2008)

    Google Scholar 

  2. Kyperountas, M., Tefas, A., Pitas, I.: Salient feature and reliable classifier selection for facial expression classification. Pattern Recognition, 972–986 (2010)

    Google Scholar 

  3. Gkalelis, N., Tefas, A., Pitas, I.: Combining fuzzy vector quantization with linear discriminant analysis for continuous human movement recognition. IEEE Transactions on Circuits and Systems for Video Technology, 1511–1521 (2008)

    Google Scholar 

  4. Bengio, Y., Courville, A.C., Vincent, P.: Unsupervised Feature Learning and Deep Learning: A Review and New Perspectives. Arxiv (2012)

    Google Scholar 

  5. Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet Classification with Deep Convolutional Neural Networks. Neural Information Processing Systems (2012)

    Google Scholar 

  6. Bordes, A., Glorot, X., Weston, J., Bengio, Y.: Joint Learning of Words and Meaning Representations for Open-Text Semantic Parsing. In: Proceedings of the 15th International Conference on Artificial Intelligence and Statistics, AISTATS (2012)

    Google Scholar 

  7. Le, Q.V., Ranzato, M., Monga, R., Devin, M., Chen, K., Corrado, G.S., Dean, J., Ng, A.Y.: Building High-level Features Using Large Scale Unsupervised Learning. In: ICML (2012)

    Google Scholar 

  8. Goodfellow, I., Courville, A., Bengio, Y.: Large-Scale Feature Learning With Spike-and-Slab Sparse Coding. In: ICML (2012)

    Google Scholar 

  9. Iosifidis, A., Tefas, A., Pitas, I.: View-invariant action recognition based on Artificial Neural Networks. IEEE Transactions on Neural Networks and Learning Systems 23(3), 412–424 (2012)

    Article  Google Scholar 

  10. Iosifidis, A., Tefas, A., Pitas, I.: Person Identification from Actions based on Artificial Neural Networks. In: Computational Intelligence in Biometrics and Identity Management, Singapore. Symposium Series on Computational Intelligence, SSCI (2013)

    Google Scholar 

  11. Le, Q.V., Zou, W.Y., Yeung, S.Y., Ng, A.Y.: Learning hierarchical invariant spatio-temporal features for action recognition with independent subspace analysis. In: 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3361–3368. IEEE Press, Colorado (2011)

    Google Scholar 

  12. Kapsouras, I., Karanikolos, S., Nikolaidis, N., Tefas, A.: Feature Comparison and Feature Fusion for Traditional Dances Recognition. In: 14th Engineering Applications of Neural Networks Conference, Halkidiki (2013)

    Google Scholar 

  13. Haykin, S.: Neural Networks and Learning Machines. Upper Saddle River, New Jersey (2008)

    Google Scholar 

  14. Huang, G.B., Chen, L., Siew, C.: Universal approximation using incremental constructive feedforward networks with random hidden nodes. IEEE Transactions on Neural Networks 17(4), 879–892 (2006)

    Article  Google Scholar 

  15. Huang, G.B., Zhou, H., Ding, X., Zhang, R.: Extreme Learning Machine for Regressiona and Multiclass Classification. IEEE Transactions on Systems, Man and Cybernetics, Part B: Cybernetics 42(2), 513–529 (2012)

    Article  Google Scholar 

  16. Minhas, R., Baradarani, S., Seifzadeh, S., Wu, Q.J.: Human action recognition using extreme learning machine based on visual vocabularies. Neurocomputing 73(10-12), 1906–1917 (2010)

    Article  Google Scholar 

  17. Iosifidis, A., Tefas, A., Pitas, I.: Multi-view Human Action Recognition under Occlusion based on Fuzzy Distances and Neural Networks. In: European Signal Processing Conference, pp. 1129–1133 (2012)

    Google Scholar 

  18. Iosifidis, A., Tefas, A., Pitas, I.: Minimum Class Variance Extreme Learning Machine for Human Action Recognition. IEEE Transactions on Circuits and Systems for Video Technology (accepted, 2013)

    Google Scholar 

  19. Iosifidis, A., Tefas, A., Pitas, I.: Dynamic action recognition based on dynemes and Extreme Learning Machine. Pattern Recognition Letters (accepted, 2013)

    Google Scholar 

  20. Iosifidis, A., Tefas, A., Pitas, I.: Dynamic Action Classification Based on Iterative Data Selection and Feedfforward Neural Networks. In: European Signal Processing Conference (accepted, 2013)

    Google Scholar 

  21. Scholkopf, B., Smola, A.J.: Learning with kernels: Support vector machines, regularization, optimization, and beyond. MIT Press (2001)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Tefas, A., Iosifidis, A., Pitas, I. (2013). Neural Networks for Digital Media Analysis and Description. In: Iliadis, L., Papadopoulos, H., Jayne, C. (eds) Engineering Applications of Neural Networks. EANN 2013. Communications in Computer and Information Science, vol 383. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41013-0_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-41013-0_1

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-41012-3

  • Online ISBN: 978-3-642-41013-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics