skip to main content
10.1145/3652583.3658892acmconferencesArticle/Chapter ViewAbstractPublication PagesicmrConference Proceedingsconference-collections
abstract

MORE'24 Multimedia Object Re-ID: Advancements, Challenges, and Opportunities

Published: 07 June 2024 Publication History

Abstract

Object re-identification (or object re-id) has gained significant attention in recent years, fueled by the increasing demand for advanced video analysis and safety systems. In object re-id, a query can be of different modalities, such as an image, a video, or natural language, containing or describing the object of interest. This workshop aims to bring together researchers, practitioners, and enthusiasts interested in object re-id to delve into the latest advancements, challenges, and opportunities in this dynamic field. The workshop covers a spectrum of topics related to object re-id, including but not limited to deep metric learning, multi-view data generation, video-based object re-id, cross-domain object re-id and real-world applications. The workshop provides a platform for researchers to showcase their work, exchange ideas, and foster potential collaborations. Additionally, it serves as a valuable opportunity for practitioners to stay abreast of the latest developments in object re-id technology.

References

[1]
Shutao Bai, Bingpeng Ma, Hong Chang, Rui Huang, and Xilin Chen. 2022. Salient-to-broad transition for video person re-identification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 7339--7348.
[2]
Rotimi-Williams Bello, Ahmad Mohamed, and Abdullah Talib. 2022. Smart animal husbandry: A review of its data, applications, techniques, challenges and opportunities. Applications, Techniques, Challenges and Opportunities (May 8, 2022) (2022).
[3]
Chuchu Han, Zhedong Zheng, Kai Su, Dongdong Yu, Zehuan Yuan, Changxin Gao, Nong Sang, and Yi Yang. 2022. DMRNet: Learning discriminative features with decoupled networks and enriched pairs for one-step person search. IEEE Transactions on Pattern Analysis and Machine Intelligence (2022).
[4]
Minyue Jiang, Xuanmeng Zhang, Yue Yu, Zechen Bai, Zhedong Zheng, Zhigang Wang, Jian Wang, Xiao Tan, Hao Sun, Errui Ding, et al. 2021. Robust vehicle re-identification via rigid structure prior. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 4026--4033.
[5]
Xintong Jiang, Yaxiong Wang, Yujiao Wu, Bingwen Hu, and Xueming Qian. 2024. CaLa: Complementary Association Learning for Augmenting Composed Image Retrieval. SIGIR (2024).
[6]
Shuang Li, Tong Xiao, Hongsheng Li, Bolei Zhou, Dayu Yue, and Xiaogang Wang. 2017. Person search with natural language description. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1970--1979.
[7]
Xin Lin, Li Zhu, Shuyu Yang, and Yaxiong Wang. 2023. Diff attention: A novel attention scheme for person re-identification. Comput. Vis. Image Underst., Vol. 228 (2023), 103623. https://doi.org/10.1016/J.CVIU.2023.103623
[8]
Yutian Lin, Liang Zheng, Zhedong Zheng, Yu Wu, Zhilan Hu, Chenggang Yan, and Yi Yang. 2019. Improving person re-identification by attribute and identity learning. Pattern Recognition, Vol. 95 (2019), 151--161.
[9]
Yutian Lin, Zhedong Zheng, Hong Zhang, Chenqiang Gao, and Yi Yang. 2020. Bayesian query expansion for multi-camera person re-identification. Pattern Recognition Letters, Vol. 130 (2020), 284--292.
[10]
Qian Liu, Xiubo Geng, Heyan Huang, Tao Qin, Jie Lu, and Daxin Jiang. 2021. MGRC: An End-to-End Multigranularity Reading Comprehension Model for Question Answering. IEEE Transactions on Neural Networks and Learning Systems (2021).
[11]
Xuelin Qian, Yanwei Fu, Yu-Gang Jiang, Tao Xiang, and Xiangyang Xue. 2017. Multi-scale deep learning architectures for person re-identification. In Proceedings of the IEEE international conference on computer vision. 5399--5408.
[12]
Xuelin Qian, Yanwei Fu, Tao Xiang, Yu-Gang Jiang, and Xiangyang Xue. 2019. Leader-based multi-scale attention deep architecture for person re-identification. IEEE transactions on pattern analysis and machine intelligence, Vol. 42, 2 (2019), 371--385.
[13]
Xuelin Qian, Yanwei Fu, Tao Xiang, Wenxuan Wang, Jie Qiu, Yang Wu, Yu-Gang Jiang, and Xiangyang Xue. 2018. Pose-normalized image generation for person re-identification. In Proceedings of the European conference on computer vision (ECCV). 650--667.
[14]
Xuelin Qian, Wenxuan Wang, Li Zhang, Fangrui Zhu, Yanwei Fu, Tao Xiang, Yu-Gang Jiang, and Xiangyang Xue. 2020. Long-term cloth-changing person re-identification. In Proceedings of the Asian Conference on Computer Vision.
[15]
Leigang Qu, Meng Liu, Wenjie Wang, Zhedong Zheng, Liqiang Nie, and Tat-Seng Chua. 2023. Learnable Pillar-based Re-ranking for Image-Text Retrieval. In Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval. 1252--1261.
[16]
Chenggui Sun, Li Bin Song, and Lihang Ying. 2022. Product Re-identification System in Fully Automated Defect Detection. In International Conference on Smart Multimedia. Springer, 144--156.
[17]
Tingyu Wang, Zhedong Zheng, Yaoqi Sun, Chenggang Yan, Yi Yang, and Tat-Seng Chua. 2024. Multiple-environment Self-adaptive Network for Aerial-view Geo-localization. Pattern Recognition, Vol. 152 (2024), 110363.
[18]
Tingyu Wang, Zhedong Zheng, Chenggang Yan, Jiyong Zhang, Yaoqi Sun, Bolun Zheng, and Yi Yang. 2022b. Each part matters: Local patterns facilitate cross-view geo-localization. IEEE Transactions on Circuits and Systems for Video Technology (2022).
[19]
Tingyu Wang, Zhedong Zheng, Zunjie Zhu, Yaoqi Sun, Yi Yang, and Chenggang Yan. 2022c. Learning cross-view geo-localization embeddings via dynamic weighted decorrelation regularization. arXiv preprint arXiv:2211.05296 (2022).
[20]
Xiaodong Wang, Zhedong Zheng, Yang He, Fei Yan, Zhiqiang Zeng, and Yi Yang. 2021b. Soft Person Reidentification Network Pruning via Blockwise Adjacent Filter Decaying. IEEE Transactions on Cybernetics (2021).
[21]
Yaxiong Wang, Hao Yang, Xiuxiu Bai, Xueming Qian, Lin Ma, Jing Lu, Biao Li, and Xin Fan. 2021a. PFAN: Bi-Directional Image-Text Retrieval With Position Focused Attention Network. IEEE Trans. Multim., Vol. 23 (2021), 3362--3376. https://doi.org/10.1109/TMM.2020.3024822
[22]
Yaxiong Wang, Hao Yang, Xueming Qian, Lin Ma, Jing Lu, Biao Li, and Xin Fan. 2019. Position Focused Attention Network for Image-Text Matching. In Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, IJCAI 2019, Macao, China, August 10--16, 2019, Sarit Kraus (Ed.). ijcai.org, 3792--3798. https://doi.org/10.24963/IJCAI.2019/526
[23]
Zheng Wang, Dan Xu, Zhedong Zheng, and Kui Jiang. 2022a. Multimedia Content Understanding in Harsh Environments. In Proceedings of the 30th ACM International Conference on Multimedia. 7372--7373.
[24]
Zhongdao Wang, Liang Zheng, Yixuan Liu, Yali Li, and Shengjin Wang. 2020. Towards real-time multi-object tracking. In European Conference on Computer Vision. Springer, 107--122.
[25]
Longhui Wei, Xiaobin Liu, Jianing Li, and Shiliang Zhang. 2018. VP-ReID: Vehicle and person re-identification system. In Proceedings of the 2018 ACM on International Conference on Multimedia Retrieval. 501--504.
[26]
Fan Yang, Zheng Wang, Yang Wu, Sakriani Sakti, and Satoshi Nakamura. 2022. Tackling multiple object tracking with complicated motions-re-designing the integration of motion and appearance. Image and Vision Computing, Vol. 124 (2022), 104514.
[27]
Shuyu Yang, Yinan Zhou, Zhedong Zheng, Yaxiong Wang, Li Zhu, and Yujiao Wu. 2023. Towards Unified Text-based Person Retrieval: A Large-scale Multi-Attribute and Language Search Benchmark. In Proceedings of the 31st ACM International Conference on Multimedia, MM 2023, Ottawa, ON, Canada, 29 October 2023- 3 November 2023, Abdulmotaleb El-Saddik, Tao Mei, Rita Cucchiara, Marco Bertini, Diana Patricia Tobon Vallejo, Pradeep K. Atrey, and M. Shamim Hossain (Eds.). ACM, 4492--4501. https://doi.org/10.1145/3581783.3611709
[28]
Xuanmeng Zhang, Minyue Jiang, Zhedong Zheng, Xiao Tan, Errui Ding, and Yi Yang. 2020. Understanding image retrieval re-ranking: A graph neural network perspective. arXiv preprint arXiv:2012.07620 (2020).
[29]
Guoshuai Zhao, Chaofeng Zhang, Heng Shang, Yaxiong Wang, Li Zhu, and Xueming Qian. 2023. Generative label fused network for image-text matching. Knowl. Based Syst., Vol. 263 (2023), 110280. https://doi.org/10.1016/J.KNOSYS.2023.110280
[30]
Liang Zheng, Zhi Bie, Yifan Sun, Jingdong Wang, Chi Su, Shengjin Wang, and Qi Tian. 2016. Mars: A video benchmark for large-scale person re-identification. In European conference on computer vision. Springer, 868--884.
[31]
Liang Zheng, Liyue Shen, Lu Tian, Shengjin Wang, Jingdong Wang, and Qi Tian. 2015. Scalable person re-identification: A benchmark. In Proceedings of the IEEE international conference on computer vision. 1116--1124.
[32]
Zhedong Zheng, Minyue Jiang, Zhigang Wang, Jian Wang, Zechen Bai, Xuanmeng Zhang, Xin Yu, Xiao Tan, Yi Yang, Shilei Wen, et al. 2020a. Going beyond real data: A robust visual representation for vehicle re-identification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops. 598--599.
[33]
Zhedong Zheng, Tao Ruan, Yunchao Wei, Yi Yang, and Tao Mei. 2020b. VehicleNet: Learning robust visual representation for vehicle re-identification. IEEE Transactions on Multimedia, Vol. 23 (2020), 2683--2693.
[34]
Zhedong Zheng, Yujiao Shi, Tingyu Wang, Jun Liu, Jianwu Fang, Yunchao Wei, and Tat-seng Chua. 2023 a. UAVM '23: 2023 Workshop on UAVs in Multimedia: Capturing the World from a New Perspective. In Proceedings of the 31st ACM International Conference on Multimedia. 9715--9717. https://doi.org/10.1145/3581783.3610937
[35]
Zhedong Zheng, Xiaohan Wang, Nenggan Zheng, and Yi Yang. 2022a. Parameter-efficient person re-identification in the 3d space. IEEE Transactions on Neural Networks and Learning Systems (2022).
[36]
Zhedong Zheng, Yunchao Wei, and Yi Yang. 2020c. University-1652: A multi-view multi-source benchmark for drone-based geo-localization. In Proceedings of the 28th ACM international conference on Multimedia. 1395--1403.
[37]
Zhedong Zheng and Liang Zheng. 2024. Object Re-identification: Problems, Algorithms and Responsible Research Practice. In The Boundaries of Data. Amsterdam University Press.
[38]
Zhedong Zheng, Liang Zheng, and Yi Yang. 2017. Unlabeled samples generated by gan improve the person re-identification baseline in vitro. In Proceedings of the IEEE international conference on computer vision. 3754--3762.
[39]
Zhedong Zheng, Liang Zheng, Yi Yang, and Fei Wu. 2023 b. U-Turn: Crafting Adversarial Queries with Opposite-Direction Features. International Journal of Computer Vision, Vol. 131, 4 (2023), 835--854.
[40]
Zhedong Zheng, Jiayin Zhu, Wei Ji, Yi Yang, and Tat-Seng Chua. 2022b. 3D Magic Mirror: Clothing Reconstruction from a Single Image via a Causal Perspective. arXiv preprint arXiv:2204.13096 (2022).

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
ICMR '24: Proceedings of the 2024 International Conference on Multimedia Retrieval
May 2024
1379 pages
ISBN:9798400706196
DOI:10.1145/3652583
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 June 2024

Check for updates

Author Tags

  1. deep metric learning
  2. multi-view generation
  3. multimedia retrieval
  4. object re-identification
  5. representation learning

Qualifiers

  • Abstract

Conference

ICMR '24
Sponsor:

Acceptance Rates

Overall Acceptance Rate 254 of 830 submissions, 31%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 218
    Total Downloads
  • Downloads (Last 12 months)218
  • Downloads (Last 6 weeks)30
Reflects downloads up to 28 Feb 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media