User-Ranking Video Summarization With Multi-Stage Spatio–Temporal Representation | IEEE Journals & Magazine | IEEE Xplore

User-Ranking Video Summarization With Multi-Stage Spatio–Temporal Representation


Abstract:

Video summarization is a challenging task, mainly due to the difficulties in learning complicated semantic structural relations between videos and summaries. In this pape...Show More

Abstract:

Video summarization is a challenging task, mainly due to the difficulties in learning complicated semantic structural relations between videos and summaries. In this paper, we present a novel supervised video summarization scheme based on three-stage deep neural networks. The scheme takes a divide-and-conquer strategy to resolve the complicated task of 3D video summarization into a set of easy and flexible computational subtasks, and then to sequentially perform 2D CNNs, 1D CNNs, and long short-term memory to address the subtasks in an hierarchical fashion. The hierarchical modeling of spatio-temporal structure leads to high performance and efficiency. In addition, we propose a simple but effective user-ranking method to cope with the labeling subjectivity problem of user-created video summarization, leading to the labeling quality refinement for robust supervised learning. Experimental results show that our approach outperforms the state-of-the-art video summarization methods on two benchmark datasets.
Published in: IEEE Transactions on Image Processing ( Volume: 28, Issue: 6, June 2019)
Page(s): 2654 - 2664
Date of Publication: 21 December 2018

ISSN Information:

PubMed ID: 30582538

Funding Agency:


References

References is not available for this document.