- Sponsor:
- sigmm
It is our great pleasure to welcome you to the 2nd Workshop on User-Centric Narrative Summarization of Long Videos (- NarSUM 2023), which is held in conjunction with the 31st ACM International Conference on Multimedia (ACM Multimedia 2023). Through this workshop, we introduce a novel research direction for the multimedia community, namely user-centric narrative summarization of long videos. The main focus is on two key aspects: firstly, on summarization of long videos captured from multiple cameras and secondly, on creating summary with a user-centric storytelling perspective. This workshop will also discuss about various aspects of video summarization including emerging topics, future directions, potential applications and other open problems.
Proceeding Downloads
How People Watch Videos? Viewer Behavior Analysis for Video Archive Summarization
If viewers' behavior in watching videos can be observed, many clues useful for video summarization and other applications can be obtained. For example, which (parts of) videos draw attention of more viewers, which contents are most watched, what kind of ...
An Empirical Study of Multilingual Scene-Text Visual Question Answering
In recent years, the focus on multilingual modeling has intensified, driven by the necessity to enable cross-lingual Text-based Visual Question Answering (TextVQA), which requires the understanding of questions and answers across diverse languages. ...
A New Approach for Evaluating Movie Summarization
An important need in many situations involving video collections (archive video search/reuse, personal video organization/search, movies, tv shows, etc.) is to summarize the video in order to reduce the size and concentrate the amount of high value ...
A Method of Image Dehazing Based on Atmospheric Veil Prediction by ResNet
Image defogging is an important prerequisite for video summary. In existing defogging methods, there are some weaknesses such as too long parameters calculation time such as transmission map and atmospheric veil estimation. We propose a new method ...
Sequential Action Retrieval for Generating Narratives from Long Videos
In this paper, we propose a novel event retrieval method called Sequential Action Retrieval, which is a work in progress, towards generating video and text narratives of long-term events from long videos. Summarizing events of user interest from long ...
Narrative Graph for Narrative Generation from Long Videos
Advancements in camera technology and cloud storage have led to a surge in video content creation, making videos more accessible. However, consuming raw, unprocessed, and lengthy videos can be unengaging. While videos with human-authored narratives (...
A Study on the Use of Attention for Explaining Video Summarization
In this paper we present our study on the use of attention for explaining video summarization. We build on a recent work that formulates the task, called XAI-SUM, and we extend it by: a) taking into account two additional network architectures and b) ...
Multimodal Video Captioning using Object-Auditory Information Fusion with Transformers
Video captioning aims to generate natural language sentences of an input video. Generating coherent natural language sentences is a challenging task due to the complex nature of video content such as object and scene understanding, extraction of object- ...
Story-to-Images Translation: Leveraging Diffusion Models and Large Language Models for Sequence Image Generation
Diffusion models are catalyzing breakthroughs in creative fields, with a notable impact on text-to-image generation. This study centers on the transformation of textual narratives into coherent sequences of images - a process currently hampered by ...
A Systematic Study on Video Summarization: Approaches, Challenges, and Future Directions
With the exponential growth of user-generated videos, video summarization has become a prominent research field to quickly understand the essence of video content. The goal is to automate the task of acquiring key segments from the video while retaining ...
Video Summarization at TRECVID - Past Efforts and What's Next
In recent years, the exponential growth of multimedia content, particularly movies and videos, has posed significant challenges for content consumption and comprehension. The vast amount of available audiovisual data necessitates efficient and effective ...
Index Terms
- Proceedings of the 2nd Workshop on User-centric Narrative Summarization of Long Videos