research-article

Salient Time Slice Pruning and Boosting for Person-Scene Instance Search in TV Series

Authors:
Zheng Wang

National Institute of Informatics, Japan

National Institute of Informatics, Japan
View Profile

,
Fan Yang

The University of Tokyo, Japan

The University of Tokyo, Japan
View Profile

,
Shin'ichi Satoh

National Institute of Informatics, Japan The University of Tokyo, Japan

National Institute of Informatics, Japan The University of Tokyo, Japan
View Profile

MMAsia '19: Proceedings of the 1st ACM International Conference on Multimedia in AsiaDecember 2019Article No.: 27Pages 1–6https://doi.org/10.1145/3338533.3366594

Published:10 January 2020Publication History

MMAsia '19: Proceedings of the 1st ACM International Conference on Multimedia in Asia

Pages 1–6

ABSTRACT

It is common that TV audiences want to quickly browse scenes with certain actors in TV series. Since 2016, the TREC Video Retrieval Evaluation (TRECVID) Instance Search (INS) task has started to focus on identifying a target person in a target scene simultaneously. In this paper, we name this kind of task as P-S INS (Person-Scene Instance Search). To find out P-S instances, most approaches search person and scene separately, and then directly combine the results together by addition or multiplication. However, we find that person and scene INS modules are not always effective at the same time, or they may suppress each other in some situations. Aggregating the results shot after shot is not a good choice. Luckily, for the TV series, video shots are arranged in chronological order. We extend our focus from time point (single video shot) to time slice (multiple consecutive video shots) in the time-line. Through detecting salient time slices, we prune the data. Through evaluating the importance of salient time slices, we boost the aggregation results. Extensive experiments on the large-scale TRECVID INS dataset demonstrate the effectiveness of the proposed method.

References

George Awad, Wessel Kraaij, Paul Over, and Shin'ichi Satoh. 2017. Instance search retrospective with focus on TRECVID. International journal of multimedia information retrieval (2017).Google ScholarCross Ref
Mika Fischer, Hazım Kemal Ekenel, and Rainer Stiefelhagen. 2011. Person re-identification in tv series using robust face recognition and user feedback. Multimedia Tools and Applications (2011).Google Scholar
Haiyun Guo, Jinqiao Wang, Yue Gao, Jianqiang Li, and Hanqing Lu. 2016. Multi-view 3d object retrieval with deep embedding network. TIP (2016).Google Scholar
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In CVPR.Google Scholar
Luis Herranz, Shuqiang Jiang, and Xiangyang Li. 2016. Scene recognition with CNNs: objects, scales and dataset bias. In CVPR.Google Scholar
Jiamei Lan, Jun Chen, Zheng Wang, Chao Liang, and Shin'ichi Satoh. 2017. PS Instance Retrieval via Early Elimination and Late Expansion. In ACM MM Workshop.Google Scholar
Duy-Dinh Le, Sang Phan, and Shin'ichi Satoh. 2016. NII-HITACHI-UIT at TRECVID 2016. In TRECVID Workshop.Google Scholar
Duy-Dinh Le, Sebastien Poullot, Xiaomeng Wu, Bertrand Nouvel, and Shin'ichi Satoh. 2010. National Institute of Informatics, Japan at TRECVID 2010.. In TRECVID Workshop.Google Scholar
Jingjing Meng, Junsong Yuan, Yap-Peng Tan, and Gang Wang. 2015. Fast object instance search in videos from one example. In ICIP.Google Scholar
Vinh-Tiep Nguyen, Dinh-Luan Nguyen, Minh-Triet Tran, Duy-Dinh Le, Duc Anh Duong, and Shin'ichi Satoh. 2015. Query-adaptive late fusion with neural network for instance search. In MMSP.Google Scholar
Yuxin Peng, Xin Huang, and Jinwei Qi. 2016. Pku-icst at trecvid 2016: Instance search task. In TRECVID Workshop.Google Scholar
Gerard Salton and Donna Harman. 2003. Information retrieval. John Wiley and Sons Ltd.Google Scholar
Alan F Smeaton, Paul Over, and Wessel Kraaij. 2006. Evaluation campaigns and TRECVid. In ACM international workshop on Multimedia information retrieval.Google ScholarDigital Library
Zheng Wang, Yang Yang, Shuosen Guan, and Chenxia Han. 2016. Whu-nercms at trecvid2016: Instance search task. In TRECVID Workshop.Google Scholar
Wei Zhang, Hongzhi Li, Chong-Wah Ngo, and Shih-Fu Chang. 2014. Scalable visual instance mining with threads of features. In ACM MM.Google Scholar
W Zhang, CC Tan, SA Zhu, T Yao, L Pang, and CW Ngo. 2012. Vireo@ trecvid 2012: Searching with topology, recounting will small concepts, learning with free examples. In TRECVID Workshop.Google Scholar
Zhenxing Zhang, Rami Albatal, Cathal Gurrin, and Alan F Smeaton. 2013. Trecvid 2013 experiments at dublin city university. In TRECVID Workshop.Google Scholar
Zhicheng Zhao, Menglai Wang, and Rui Xiang. 2016. Bupt-mcprl at trecvid 2016. In TRECVID Workshop.Google Scholar
Liang Zheng, Yi Yang, and Qi Tian. 2017. SIFT meets CNN: A decade survey of instance retrieval. TPAMI (2017).Google Scholar
Yousong Zhu, Jinqiao Wang, Chaoyang Zhao, Haiyun Guo, and Hanqing Lu. 2016. Scale-adaptive deconvolutional regression network for pedestrian detection. In ACCV.Google Scholar

Index Terms

Salient Time Slice Pruning and Boosting for Person-Scene Instance Search in TV Series
1. Applied computing
  1. Computers in other domains
    1. Digital libraries and archives
2. Information systems
  1. Information retrieval
    1. Retrieval models and ranking
      1. Rank aggregation
    2. Specialized information retrieval
      1. Multimedia and multimodal retrieval
        Video search
  2. Information systems applications
    1. Multimedia information systems
      1. Multimedia databases

Recommendations

Inferring Attention Shifts for Salient Instance Ranking
Abstract
The human visual system has limited capacity in simultaneously processing multiple visual inputs. Consequently, humans rely on shifting their attention from one location to another. When viewing an image of complex scenes, psychology studies and ...
Read More
Salient object detection via boosting object-level distinctiveness and saliency refinement

We detect saliency via boosting object-level distinctiveness and saliency refinement.Our approach can better uniformly highlight heterogeneous regions of salient objects.A new method only using object-level features to detect coarse saliency is ...
Read More
Extraction of salient contours from cluttered scenes

The responses of neurons in the primary visual cortex (V1) to stimulus inside the receptive field (RF) can be markedly modulated by stimuli outside the classical receptive field. The modulation, relying on contextual configurations, yields excitatory ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

MMAsia '19: Proceedings of the 1st ACM International Conference on Multimedia in Asia
December 2019
403 pages
ISBN:9781450368414
DOI:10.1145/3338533

Copyright © 2019 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 10 January 2020
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Instance Search
Saliency
TV Series
Time-line
Qualifiers
- research-article
- Research
- Refereed limited
Conference

Acceptance Rates
MMAsia '19 Paper Acceptance Rate59of204submissions,29%Overall Acceptance Rate59of204submissions,29%
More
Upcoming Conference
MM '24

Sponsor:

sigmm

MM '24: The 32nd ACM International Conference on Multimedia

October 28 - November 1, 2024

Melbourne , VIC , Australia
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 2
  Total Citations
  View Citations
- 64
  Total Downloads
- Downloads (Last 12 months)1
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Salient Time Slice Pruning and Boosting for Person-Scene Instance Search in TV Series

MMAsia '19: Proceedings of the 1st ACM International Conference on Multimedia in Asia

ABSTRACT

References

Cited By

Index Terms

Recommendations

Inferring Attention Shifts for Salient Instance Ranking

Salient object detection via boosting object-level distinctiveness and saliency refinement

Extraction of salient contours from cluttered scenes

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Salient Time Slice Pruning and Boosting for Person-Scene Instance Search in TV Series

MMAsia '19: Proceedings of the 1st ACM International Conference on Multimedia in Asia

ABSTRACT

References

Cited By

Index Terms

Recommendations

Inferring Attention Shifts for Salient Instance Ranking

Salient object detection via boosting object-level distinctiveness and saliency refinement

Extraction of salient contours from cluttered scenes

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media