abstract

MORE'24 Multimedia Object Re-ID: Advancements, Challenges, and Opportunities

Authors:

Liang ZhengAuthors Info & Claims

ICMR '24: Proceedings of the 2024 International Conference on Multimedia Retrieval

Pages 1336 - 1338

https://doi.org/10.1145/3652583.3658892

Published: 07 June 2024 Publication History

Abstract

Object re-identification (or object re-id) has gained significant attention in recent years, fueled by the increasing demand for advanced video analysis and safety systems. In object re-id, a query can be of different modalities, such as an image, a video, or natural language, containing or describing the object of interest. This workshop aims to bring together researchers, practitioners, and enthusiasts interested in object re-id to delve into the latest advancements, challenges, and opportunities in this dynamic field. The workshop covers a spectrum of topics related to object re-id, including but not limited to deep metric learning, multi-view data generation, video-based object re-id, cross-domain object re-id and real-world applications. The workshop provides a platform for researchers to showcase their work, exchange ideas, and foster potential collaborations. Additionally, it serves as a valuable opportunity for practitioners to stay abreast of the latest developments in object re-id technology.

References

[1]

Shutao Bai, Bingpeng Ma, Hong Chang, Rui Huang, and Xilin Chen. 2022. Salient-to-broad transition for video person re-identification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 7339--7348.

[2]

Rotimi-Williams Bello, Ahmad Mohamed, and Abdullah Talib. 2022. Smart animal husbandry: A review of its data, applications, techniques, challenges and opportunities. Applications, Techniques, Challenges and Opportunities (May 8, 2022) (2022).

[3]

Chuchu Han, Zhedong Zheng, Kai Su, Dongdong Yu, Zehuan Yuan, Changxin Gao, Nong Sang, and Yi Yang. 2022. DMRNet: Learning discriminative features with decoupled networks and enriched pairs for one-step person search. IEEE Transactions on Pattern Analysis and Machine Intelligence (2022).

[4]

Minyue Jiang, Xuanmeng Zhang, Yue Yu, Zechen Bai, Zhedong Zheng, Zhigang Wang, Jian Wang, Xiao Tan, Hao Sun, Errui Ding, et al. 2021. Robust vehicle re-identification via rigid structure prior. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 4026--4033.

[5]

Xintong Jiang, Yaxiong Wang, Yujiao Wu, Bingwen Hu, and Xueming Qian. 2024. CaLa: Complementary Association Learning for Augmenting Composed Image Retrieval. SIGIR (2024).

[6]

Shuang Li, Tong Xiao, Hongsheng Li, Bolei Zhou, Dayu Yue, and Xiaogang Wang. 2017. Person search with natural language description. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1970--1979.

[7]

Xin Lin, Li Zhu, Shuyu Yang, and Yaxiong Wang. 2023. Diff attention: A novel attention scheme for person re-identification. Comput. Vis. Image Underst., Vol. 228 (2023), 103623. https://doi.org/10.1016/J.CVIU.2023.103623

Digital Library

[8]

Yutian Lin, Liang Zheng, Zhedong Zheng, Yu Wu, Zhilan Hu, Chenggang Yan, and Yi Yang. 2019. Improving person re-identification by attribute and identity learning. Pattern Recognition, Vol. 95 (2019), 151--161.

Digital Library

[9]

Yutian Lin, Zhedong Zheng, Hong Zhang, Chenqiang Gao, and Yi Yang. 2020. Bayesian query expansion for multi-camera person re-identification. Pattern Recognition Letters, Vol. 130 (2020), 284--292.

Digital Library

[10]

Qian Liu, Xiubo Geng, Heyan Huang, Tao Qin, Jie Lu, and Daxin Jiang. 2021. MGRC: An End-to-End Multigranularity Reading Comprehension Model for Question Answering. IEEE Transactions on Neural Networks and Learning Systems (2021).

[11]

Xuelin Qian, Yanwei Fu, Yu-Gang Jiang, Tao Xiang, and Xiangyang Xue. 2017. Multi-scale deep learning architectures for person re-identification. In Proceedings of the IEEE international conference on computer vision. 5399--5408.

[12]

Xuelin Qian, Yanwei Fu, Tao Xiang, Yu-Gang Jiang, and Xiangyang Xue. 2019. Leader-based multi-scale attention deep architecture for person re-identification. IEEE transactions on pattern analysis and machine intelligence, Vol. 42, 2 (2019), 371--385.

[13]

Xuelin Qian, Yanwei Fu, Tao Xiang, Wenxuan Wang, Jie Qiu, Yang Wu, Yu-Gang Jiang, and Xiangyang Xue. 2018. Pose-normalized image generation for person re-identification. In Proceedings of the European conference on computer vision (ECCV). 650--667.

Digital Library

[14]

Xuelin Qian, Wenxuan Wang, Li Zhang, Fangrui Zhu, Yanwei Fu, Tao Xiang, Yu-Gang Jiang, and Xiangyang Xue. 2020. Long-term cloth-changing person re-identification. In Proceedings of the Asian Conference on Computer Vision.

[15]

Leigang Qu, Meng Liu, Wenjie Wang, Zhedong Zheng, Liqiang Nie, and Tat-Seng Chua. 2023. Learnable Pillar-based Re-ranking for Image-Text Retrieval. In Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval. 1252--1261.

Digital Library

[16]

Chenggui Sun, Li Bin Song, and Lihang Ying. 2022. Product Re-identification System in Fully Automated Defect Detection. In International Conference on Smart Multimedia. Springer, 144--156.

[17]

Tingyu Wang, Zhedong Zheng, Yaoqi Sun, Chenggang Yan, Yi Yang, and Tat-Seng Chua. 2024. Multiple-environment Self-adaptive Network for Aerial-view Geo-localization. Pattern Recognition, Vol. 152 (2024), 110363.

Digital Library

[18]

Tingyu Wang, Zhedong Zheng, Chenggang Yan, Jiyong Zhang, Yaoqi Sun, Bolun Zheng, and Yi Yang. 2022b. Each part matters: Local patterns facilitate cross-view geo-localization. IEEE Transactions on Circuits and Systems for Video Technology (2022).

[19]

Tingyu Wang, Zhedong Zheng, Zunjie Zhu, Yaoqi Sun, Yi Yang, and Chenggang Yan. 2022c. Learning cross-view geo-localization embeddings via dynamic weighted decorrelation regularization. arXiv preprint arXiv:2211.05296 (2022).

[20]

Xiaodong Wang, Zhedong Zheng, Yang He, Fei Yan, Zhiqiang Zeng, and Yi Yang. 2021b. Soft Person Reidentification Network Pruning via Blockwise Adjacent Filter Decaying. IEEE Transactions on Cybernetics (2021).

[21]

Yaxiong Wang, Hao Yang, Xiuxiu Bai, Xueming Qian, Lin Ma, Jing Lu, Biao Li, and Xin Fan. 2021a. PFAN: Bi-Directional Image-Text Retrieval With Position Focused Attention Network. IEEE Trans. Multim., Vol. 23 (2021), 3362--3376. https://doi.org/10.1109/TMM.2020.3024822

[22]

Yaxiong Wang, Hao Yang, Xueming Qian, Lin Ma, Jing Lu, Biao Li, and Xin Fan. 2019. Position Focused Attention Network for Image-Text Matching. In Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, IJCAI 2019, Macao, China, August 10--16, 2019, Sarit Kraus (Ed.). ijcai.org, 3792--3798. https://doi.org/10.24963/IJCAI.2019/526

[23]

Zheng Wang, Dan Xu, Zhedong Zheng, and Kui Jiang. 2022a. Multimedia Content Understanding in Harsh Environments. In Proceedings of the 30th ACM International Conference on Multimedia. 7372--7373.

Digital Library

[24]

Zhongdao Wang, Liang Zheng, Yixuan Liu, Yali Li, and Shengjin Wang. 2020. Towards real-time multi-object tracking. In European Conference on Computer Vision. Springer, 107--122.

Digital Library

[25]

Longhui Wei, Xiaobin Liu, Jianing Li, and Shiliang Zhang. 2018. VP-ReID: Vehicle and person re-identification system. In Proceedings of the 2018 ACM on International Conference on Multimedia Retrieval. 501--504.

Digital Library

[26]

Fan Yang, Zheng Wang, Yang Wu, Sakriani Sakti, and Satoshi Nakamura. 2022. Tackling multiple object tracking with complicated motions-re-designing the integration of motion and appearance. Image and Vision Computing, Vol. 124 (2022), 104514.

Digital Library

[27]

Shuyu Yang, Yinan Zhou, Zhedong Zheng, Yaxiong Wang, Li Zhu, and Yujiao Wu. 2023. Towards Unified Text-based Person Retrieval: A Large-scale Multi-Attribute and Language Search Benchmark. In Proceedings of the 31st ACM International Conference on Multimedia, MM 2023, Ottawa, ON, Canada, 29 October 2023- 3 November 2023, Abdulmotaleb El-Saddik, Tao Mei, Rita Cucchiara, Marco Bertini, Diana Patricia Tobon Vallejo, Pradeep K. Atrey, and M. Shamim Hossain (Eds.). ACM, 4492--4501. https://doi.org/10.1145/3581783.3611709

Digital Library

[28]

Xuanmeng Zhang, Minyue Jiang, Zhedong Zheng, Xiao Tan, Errui Ding, and Yi Yang. 2020. Understanding image retrieval re-ranking: A graph neural network perspective. arXiv preprint arXiv:2012.07620 (2020).

[29]

Guoshuai Zhao, Chaofeng Zhang, Heng Shang, Yaxiong Wang, Li Zhu, and Xueming Qian. 2023. Generative label fused network for image-text matching. Knowl. Based Syst., Vol. 263 (2023), 110280. https://doi.org/10.1016/J.KNOSYS.2023.110280

Digital Library

[30]

Liang Zheng, Zhi Bie, Yifan Sun, Jingdong Wang, Chi Su, Shengjin Wang, and Qi Tian. 2016. Mars: A video benchmark for large-scale person re-identification. In European conference on computer vision. Springer, 868--884.

[31]

Liang Zheng, Liyue Shen, Lu Tian, Shengjin Wang, Jingdong Wang, and Qi Tian. 2015. Scalable person re-identification: A benchmark. In Proceedings of the IEEE international conference on computer vision. 1116--1124.

Digital Library

[32]

Zhedong Zheng, Minyue Jiang, Zhigang Wang, Jian Wang, Zechen Bai, Xuanmeng Zhang, Xin Yu, Xiao Tan, Yi Yang, Shilei Wen, et al. 2020a. Going beyond real data: A robust visual representation for vehicle re-identification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops. 598--599.

[33]

Zhedong Zheng, Tao Ruan, Yunchao Wei, Yi Yang, and Tao Mei. 2020b. VehicleNet: Learning robust visual representation for vehicle re-identification. IEEE Transactions on Multimedia, Vol. 23 (2020), 2683--2693.

Digital Library

[34]

Zhedong Zheng, Yujiao Shi, Tingyu Wang, Jun Liu, Jianwu Fang, Yunchao Wei, and Tat-seng Chua. 2023 a. UAVM '23: 2023 Workshop on UAVs in Multimedia: Capturing the World from a New Perspective. In Proceedings of the 31st ACM International Conference on Multimedia. 9715--9717. https://doi.org/10.1145/3581783.3610937

Digital Library

[35]

Zhedong Zheng, Xiaohan Wang, Nenggan Zheng, and Yi Yang. 2022a. Parameter-efficient person re-identification in the 3d space. IEEE Transactions on Neural Networks and Learning Systems (2022).

[36]

Zhedong Zheng, Yunchao Wei, and Yi Yang. 2020c. University-1652: A multi-view multi-source benchmark for drone-based geo-localization. In Proceedings of the 28th ACM international conference on Multimedia. 1395--1403.

Digital Library

[37]

Zhedong Zheng and Liang Zheng. 2024. Object Re-identification: Problems, Algorithms and Responsible Research Practice. In The Boundaries of Data. Amsterdam University Press.

[38]

Zhedong Zheng, Liang Zheng, and Yi Yang. 2017. Unlabeled samples generated by gan improve the person re-identification baseline in vitro. In Proceedings of the IEEE international conference on computer vision. 3754--3762.

[39]

Zhedong Zheng, Liang Zheng, Yi Yang, and Fei Wu. 2023 b. U-Turn: Crafting Adversarial Queries with Opposite-Direction Features. International Journal of Computer Vision, Vol. 131, 4 (2023), 835--854.

Digital Library

[40]

Zhedong Zheng, Jiayin Zhu, Wei Ji, Yi Yang, and Tat-Seng Chua. 2022b. 3D Magic Mirror: Clothing Reconstruction from a Single Image via a Causal Perspective. arXiv preprint arXiv:2204.13096 (2022).

Index Terms

MORE'24 Multimedia Object Re-ID: Advancements, Challenges, and Opportunities
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision representations
        Appearance and texture representations
      2. Computer vision tasks
        Visual content-based indexing and retrieval

Recommendations

Multimedia Retrieval Conference Enjoys Texas Hospitality

The Third ACM International Conference on Multimedia Retrieval (ICMR) was held in Dallas, Texas, from 16-19 April 2013. The conference aims to promote intellectual exchanges and interactions among scientists, engineers, students, multimedia researchers ...
MORE '24: Proceedings of the 1st ICMR Workshop on Multimedia Object Re-Identification
A fast multi-scale covariance descriptor for object re-identification

In many surveillance systems, there is a need to determine if a given object (person, group of persons, vehicle, ...) has already been observed over a network of cameras. It is the object re-identification problem. Solving this problem involves matching ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

ICMR '24: Proceedings of the 2024 International Conference on Multimedia Retrieval

May 2024

1379 pages

ISBN:9798400706196

DOI:10.1145/3652583

General Chairs:
Cathal Gurrin
Dublin City University, Ireland
,
Rachada Kongkachandra
Thammasat University, Thailand
,
Klaus Schoeffmann
Klagenfurt University, Austria
,
Program Chairs:
Duc-Tien Dang-Nguyen
University of Bergen, Norway
,
Luca Rossetto
University of Zurich, Switzerland
,
Shin'ichi Satoh
National Institute of Informatics, Japan
,
Liting Zhou
Dublin City University, Ireland

Copyright © 2024 Owner/Author.

Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 June 2024

Check for updates

Author Tags

Qualifiers

Abstract

Conference

ICMR '24

Sponsor:

ICMR '24: International Conference on Multimedia Retrieval

June 10 - 14, 2024

Phuket, Thailand

Acceptance Rates

Overall Acceptance Rate 254 of 830 submissions, 31%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
218
Total Downloads

Downloads (Last 12 months)218
Downloads (Last 6 weeks)30

Reflects downloads up to 28 Feb 2025

Other Metrics

View Author Metrics

Citations

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten