Skip to main content

A Machine Learning Based Video Summarization Framework for Yoga-Posture Video

  • Conference paper
  • First Online:
Computational Intelligence in Communications and Business Analytics (CICBA 2023)

Abstract

Video summarization techniques aim to generate a concise but complete synopsis of a video by choosing the most informative frames of the video content without loss of interpretability. Given the abundance of video content and its complex nature, there has always been a huge demand for an effective video summarization technique to analyze various dynamic posture centric videos. Yoga session video summarization is one of the interesting application areas of dynamic posture centric video analysis that is lately drawing the attention of computer vision researchers. The majority of available general video summarizing methods fail to detect key yoga poses in a yoga session video effectively, as they do not consider posture-centric information while extracting key frames. In this paper, we propose a machine learning based video summarization framework, which is capable of extracting a series of key postures in a yoga session video by tracking a few key-posture points corresponding to vital parts of the human body. Compared to the widely used FFMPEG tool, the proposed method appears to have a higher proportion of matched keyframes but a lower proportion of missing key-frames and redundant non key-frames with respect to the ground truth set, demonstrating its potential as an effective yoga posture video summarizer.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 64.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 84.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Ajmal, M., Ashraf, M.H., Shakir, M., Abbas, Y., Shah, F.A.: Video summarization: techniques and classification. In: Bolc, L., Tadeusiewicz, R., Chmielewski, L.J., Wojciechowski, K. (eds.) ICCVG 2012. LNCS, vol. 7594, pp. 1–13. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-33564-8_1

  2. Basavarajaiah, M., Sharma, P.: Survey of compressed domain video summarization techniques. ACM Comput. Surv. 52(6), 1–29 (2019)

    Article  Google Scholar 

  3. Bazarevsky, V., Grishchenko, I., Raveendran, K., Zhu, T., Zhang, F., Grundmann, M.: Blazepose: on-device real-time body pose tracking. arXiv preprint arXiv:2006.10204 (2020)

  4. Chao, G.C., Tsai, Y.P., Jeng, S.K.: Augmented 3-d keyframe extraction for surveillance videos. IEEE Trans. Circuits Syst. Video Technol. 20(11), 1395–1408 (2010)

    Article  Google Scholar 

  5. Gygli, M., Grabner, H., Riemenschneider, H., Van Gool, L.: Creating summaries from user videos. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8695, pp. 505–520. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10584-0_33

  6. Hannane, R., Elboushaki, A., Afdel, K., Naghabhushan, P., Javed, M.: An efficient method for video shot boundary detection and keyframe extraction using sift-point distribution histogram. Int. J. Multim. Inf. Retriev. 5(2), 89–104 (2016)

    Article  Google Scholar 

  7. Liu, G., Zhao, J.: Key frame extraction from mpeg video stream. In: 2010 Third International Symposium on Information Processing, pp. 423–427. IEEE (2010)

    Google Scholar 

  8. Long, C., Jo, E., Nam, Y.: Development of a yoga posture coaching system using an interactive display based on transfer learning. J. Supercomput. 78(4), 5269–5284 (2022)

    Article  Google Scholar 

  9. Luo, J., Papin, C., Costello, K.: Towards extracting semantically meaningful key frames from personal video clips: from humans to computers. IEEE Trans. Circuits Syst. Video Technol. 19(2), 289–301 (2008)

    Google Scholar 

  10. Moir, G.L., Graham, B.W., Davis, S.E., Guers, J.J., Witmer, C.A.: An efficient method of key-frame extraction based on a cluster algorithm. J. Hum. Kinet. 39(1), 15–23 (2013)

    Article  Google Scholar 

  11. Nasreen, A., Roy, K., Roy, K., Shobha, G.: Key frame extraction and foreground modelling using k-means clustering. In: 2015 7th International Conference on Computational Intelligence, Communication Systems and Networks, pp. 141–145. IEEE (2015)

    Google Scholar 

  12. Newell, A., Yang, K., Deng, J.: Stacked hourglass networks for human pose estimation. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9912, pp. 483–499. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46484-8_29

  13. Niedermayer, M., Sabatini, S., Giovara, V.: Ffmpeg documentation (2012). http://ffmpeg.org/ffmpeg-all.html#Video-Encoders

  14. Peng, B., Lei, J., Fu, H., Jia, Y., Zhang, Z., Li, Y.: Deep video action clustering via spatio-temporal feature learning. Neurocomputing 456, 519–527 (2021)

    Article  Google Scholar 

  15. Sigal, L.: Human pose estimation. In: Computer Vision: A Reference Guide, pp. 573–592. Springer (2021)

    Google Scholar 

  16. Sze, K.W., Lam, K.M., Qiu, G.: A new key frame representation for video segment retrieval. IEEE Trans. Circuits Syst. Video Technol. 15(9), 1148–1155 (2005)

    Article  Google Scholar 

  17. Yan, C., Li, X., Li, G.: A new action recognition framework for video highlights summarization in sporting events. In: 2021 16th International Conference on Computer Science and Education (ICCSE), pp. 653–666. IEEE (2021)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Soumya Bhattacharyya .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Afreen, S., Ghosh, T., Bhattacharyya, S., Bhar, A., Saha, S. (2024). A Machine Learning Based Video Summarization Framework for Yoga-Posture Video. In: Dasgupta, K., Mukhopadhyay, S., Mandal, J.K., Dutta, P. (eds) Computational Intelligence in Communications and Business Analytics. CICBA 2023. Communications in Computer and Information Science, vol 1956. Springer, Cham. https://doi.org/10.1007/978-3-031-48879-5_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-48879-5_2

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-48878-8

  • Online ISBN: 978-3-031-48879-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics