skip to main content
10.1145/3607540acmconferencesBook PagePublication PagesmmConference Proceedingsconference-collections
NarSUM '23: Proceedings of the 2nd Workshop on User-centric Narrative Summarization of Long Videos
ACM2023 Proceeding
Publisher:
  • Association for Computing Machinery
  • New York
  • NY
  • United States
Conference:
MM '23: The 31st ACM International Conference on Multimedia Ottawa ON Canada 29 October 2023
ISBN:
979-8-4007-0277-8
Published:
29 October 2023
Sponsors:
Next Conference
October 28 - November 1, 2024
Melbourne , VIC , Australia
Bibliometrics
Skip Abstract Section
Abstract

It is our great pleasure to welcome you to the 2nd Workshop on User-Centric Narrative Summarization of Long Videos (- NarSUM 2023), which is held in conjunction with the 31st ACM International Conference on Multimedia (ACM Multimedia 2023). Through this workshop, we introduce a novel research direction for the multimedia community, namely user-centric narrative summarization of long videos. The main focus is on two key aspects: firstly, on summarization of long videos captured from multiple cameras and secondly, on creating summary with a user-centric storytelling perspective. This workshop will also discuss about various aspects of video summarization including emerging topics, future directions, potential applications and other open problems.

Skip Table Of Content Section
SESSION: Invited Talk 1
keynote
How People Watch Videos? Viewer Behavior Analysis for Video Archive Summarization

If viewers' behavior in watching videos can be observed, many clues useful for video summarization and other applications can be obtained. For example, which (parts of) videos draw attention of more viewers, which contents are most watched, what kind of ...

SESSION: Spotlight and Poster Session
research-article
An Empirical Study of Multilingual Scene-Text Visual Question Answering

In recent years, the focus on multilingual modeling has intensified, driven by the necessity to enable cross-lingual Text-based Visual Question Answering (TextVQA), which requires the understanding of questions and answers across diverse languages. ...

research-article
A New Approach for Evaluating Movie Summarization

An important need in many situations involving video collections (archive video search/reuse, personal video organization/search, movies, tv shows, etc.) is to summarize the video in order to reduce the size and concentrate the amount of high value ...

research-article
A Method of Image Dehazing Based on Atmospheric Veil Prediction by ResNet

Image defogging is an important prerequisite for video summary. In existing defogging methods, there are some weaknesses such as too long parameters calculation time such as transmission map and atmospheric veil estimation. We propose a new method ...

research-article
Sequential Action Retrieval for Generating Narratives from Long Videos

In this paper, we propose a novel event retrieval method called Sequential Action Retrieval, which is a work in progress, towards generating video and text narratives of long-term events from long videos. Summarizing events of user interest from long ...

research-article
Open Access
Narrative Graph for Narrative Generation from Long Videos

Advancements in camera technology and cloud storage have led to a surge in video content creation, making videos more accessible. However, consuming raw, unprocessed, and lengthy videos can be unengaging. While videos with human-authored narratives (...

research-article
A Study on the Use of Attention for Explaining Video Summarization

In this paper we present our study on the use of attention for explaining video summarization. We build on a recent work that formulates the task, called XAI-SUM, and we extend it by: a) taking into account two additional network architectures and b) ...

research-article
Multimodal Video Captioning using Object-Auditory Information Fusion with Transformers

Video captioning aims to generate natural language sentences of an input video. Generating coherent natural language sentences is a challenging task due to the complex nature of video content such as object and scene understanding, extraction of object- ...

research-article
Story-to-Images Translation: Leveraging Diffusion Models and Large Language Models for Sequence Image Generation

Diffusion models are catalyzing breakthroughs in creative fields, with a notable impact on text-to-image generation. This study centers on the transformation of textual narratives into coherent sequences of images - a process currently hampered by ...

research-article
Open Access
A Systematic Study on Video Summarization: Approaches, Challenges, and Future Directions

With the exponential growth of user-generated videos, video summarization has become a prominent research field to quickly understand the essence of video content. The goal is to automate the task of acquiring key segments from the video while retaining ...

SESSION: Invited Talk 2
keynote
Video Summarization at TRECVID - Past Efforts and What's Next

In recent years, the exponential growth of multimedia content, particularly movies and videos, has posed significant challenges for content consumption and comprehension. The vast amount of available audiovisual data necessitates efficient and effective ...

Contributors
  • National University of Singapore
  • Queen Mary University of London
  • NEC Corporation
  • National University of Singapore
  • Nagoya University
Index terms have been assigned to the content through auto-classification.

Recommendations