Skip to main content
Log in

Automatic detection of slide transitions in lecture videos

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

This paper presents a method to automatically detect slide changes in lecture videos. For accurate detection, the regions capturing slide images are first identified from video frames. Then, SIFT features are extracted from the regions, which are invariant to image scaling and rotation. These features are used to compare similarity between frames. If the similarity is smaller than a threshold, slide transition is detected. The threshold is estimated based on the mean and standard deviation of sample frames’ similarities. Using this method, high detection accuracy can be obtained without any supplementary slide images. The proposed method also supports detection of backward slide transitions that occur when a speaker returns to a previous slide to emphasize its contents. In experiments conducted on our test collection, the proposed method showed 87 % accuracy in forward transition detection and 86 % accuracy in backward transition detection.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Notes

  1. 1 We only consider f i,j satisfying 1≤i < jk, because f i,j is equal to f j,i , and f i,i is meaningless.

References

  1. Cernekova Z, Pitas I, Nikow C (2006) Information theory-based shot cut/fade detection and video summarization. IEEE Trans Circ Syst Video Tech 16(1): 82–91

    Article  Google Scholar 

  2. Che X, Yang H, Meinel C (2013) Lecture video segmentation by automatically analyzing the synchronized slides. In: Proceedings of the 21st ACM international conference on multimedia, ACMMM’13. ACM, pp 345–348

  3. Cooper M, Liu T, Rieffel E (2007) Video segmentation via temporal pattern classification. IEEE Trans Multimed 9(3): 610–618

    Article  Google Scholar 

  4. Fan Q, Barnard K, Amir A, Efrat A, Lin M (2006) Matching slides to presentation videos using SIFT and scene background matching. In: Proceedings of the 8th ACM international workshop on multimedia information retrieval, MIR’06. ACM, pp 239–248

  5. Fan Q, Amir A, Barnard K, Swaminathan R, Efrat A (2007) Temporal modeling of slide change in presentation videos. In: Proceedings of the IEEE internation conference on acoustics, speech and signal processing, ICASSP’. IEEE, pp 989–992

  6. Hu W et al. (2001) A survey on visual content-based Video indexing and retrieval. IEEE Trans Syst Man Cybern C Appl Rev 41(6): 797–819

    Google Scholar 

  7. Lowe D G (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60(2): 91–110

    Article  Google Scholar 

  8. Li D, Lu H (2000) Model based video segmentation. In: IEEE workshop on signal processing systems, SiPS 2000. IEEE, pp 120–129

  9. Ma D, Agam G (2012) Lecture video segmentation and indexing. In: SPIE 8297, document recognition and retrieval XIX, 82970V

  10. Mukhopadhyay S, Smith B (1999) Passive capture and structuring of lectures. In: Proceedings of the 7th ACM international conference on multimedia, ACMMM’99. ACM, New York, pp 477–487

  11. Ngo C W, Pong T C, Chin R T (1999) Detection of gradual transitions through temporal slice analysis. In: IEEE computer society conference on computer vision and pattern recognition (vol 1)

  12. Ngo C W, Pong T C, Huang TS (2002) Detection of slide transition for topic indexing. In: Proceedings of the IEEE international conference on multimedia and expo, ICME’02. IEEE, pp 533-536

  13. Ngo C W, Wang F, Pong T C (2003) Structuring lecture videos for distance learning applications. In: Proceedings of the IEEE 5th international symposium on multimedia software engineering, MMSE’03. IEEE, pp 215–222

  14. Otzu N (1979) A threshold selection method from gray-level histograms. IEEE Trans Syst Man Cybern 9(1): 62–66

    Article  Google Scholar 

  15. Porter S (2004) Video segmentation and indexing using motion estimation. University of Bristol, Dissertation

    Google Scholar 

  16. Powers D M W (2011) Evaluation: from precision, recall and f-measure to roc., informedness, markedness & correlation. J Mach Lear Tech 2(1): 37–63

    MathSciNet  Google Scholar 

  17. Shafait F, Keysers D, Breuel T M (2008) Efficient implementation of local adaptive thresholding techniques using integral images SPIE 6815, document recognition and retrieval XV, 681510

  18. Smeaton A F, Over P, Doherty A R (2010) Video shot boundary detection: sevent years of TRECVid activity. Comput Vis Image Und 114(4): 411–418

    Article  Google Scholar 

  19. Wang X, Kankanhalli M (2009) Robust alignment of presentation videos with slides. In: Proceedings of the 10th pacific rim conference on multimedia, PCM’09. Springer pp 311–322

  20. Wu X et al. (2008) Shot boundary detection: an information saliency approach. In: Congress on the image and signal processing, CISP’08. pp 808-812

  21. Xia D, Deng X, Zeng Q (2007) Shot boundary detection based on difference sequences of mutual information. In: Proceedings of the image and graphics, ICIG’07. pp 389-394

  22. Zhao ZC, Cai AN, 2006Shot boundary detection algorithm in compressed domain based on ababoost and fuzzy theory. In: Proceedings of the international conference on advances in natural computation, ICNC’06. pp 617-626

Download references

Acknowledgments

This research was supported by the MSIP(Ministry of Science, ICT and Future Planning) of Korea under the ITRC support program(NIPA-2013-H0301-13-4009), and the National Research Foundation of Korea grant funded by the Korea government(MEST) (No. 2012R1A2A2A01046694).

This paper was supported by the Sahmyook University Research Fund in 2013.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hyeon Gyu Kim.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Jeong, H.J., Kim, TE., Kim, H.G. et al. Automatic detection of slide transitions in lecture videos. Multimed Tools Appl 74, 7537–7554 (2015). https://doi.org/10.1007/s11042-014-1990-6

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-014-1990-6

Keywords

Navigation