skip to main content
10.1145/3581783.3611880acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

FeaCo: Reaching Robust Feature-Level Consensus in Noisy Pose Conditions

Published: 27 October 2023 Publication History

Abstract

Collaborative perception offers a promising solution to overcome challenges such as occlusion and long-range data processing. However, limited sensor accuracy leads to noisy poses that misalign observations among vehicles. To address this problem, we propose the FeaCo, which achieves robust Feature-level Consensus among collaborating agents in noisy pose conditions without additional training. We design an efficient Pose-error Rectification Module (PRM) to align derived feature maps from different vehicles, reducing the adverse effect of noisy pose and bandwidth requirements. We also provide an effective multi-scale Cross-level Attention Module (CAM) to enhance information aggregation and interaction between various scales. Our FeaCo outperforms all other localization rectification methods, as validated on both the collaborative perception simulation dataset OPV2V and real-world dataset V2V4Real, reducing heading error and enhancing localization accuracy across various error levels. Our code is available at: https://github.com/jmgu0212/FeaCo.git.

Supplemental Material

MP4 File
This is the presentation video of FeaCo. The field of collaborative perception and the specific situations in noisy pose scenario will be shown first. To address the misaligned observations, we propose the FeaCo, which achieves robust Feature-level Consensus among collaborating agents in noisy pose conditions without additional training. The overall framework and two novel modules will be explained. Pose-error Rectification Module (PRM) is proposed to align derived feature maps from different vehicles and multi-scale Cross-level Attention Module (CAM) is provided to enhance information aggregation between various scales. Experiments under different noise settings were conducted in both simulation and real-world datasets to better demonstrate the robustness and effectiveness of the proposed method.

References

[1]
David Astély, Erik Dahlman, Anders Furuskär, Ylva Jading, Magnus Lindström, and Stefan Parkvall. n Parkvall. 2009. LTE: the evolution of mobile broadband. IEEE Communications magazine 47, 4 (2009)44--51.
[2]
Azzedine Boukerche, Horacio ABF Oliveira, Eduardo F Nakamura, and Antonio AF Loureiro. 2008. Vehicular ad hoc networks: A new challenge for localization-based systems. Computer communications 31, 12 (2008), 2838--2849.
[3]
Qi Chen, Xu Ma, Sihai Tang, Jingda Guo, Qing Yang, and Song Fu. 2019. F-cooper: Feature based cooperative perception for autonomous vehicle edge computing system using 3D point clouds. In Proceedings of the 4th ACM/IEEE Symposium on Edge Computing. 88--100.
[4]
Qi Chen, Sihai Tang, Qing Yang, and Song Fu. 2019. Cooper: Cooperative perception for connected autonomous vehicles based on 3d point clouds. In 2019 IEEE 39th International Conference on Distributed Computing Systems (ICDCS). IEEE, 514--524.
[5]
Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani, Matthias Minderer, Georg Heigold, Sylvain Gelly, et al. 2020. An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale. In International Conference on Learning Representations.
[6]
Alexey Dosovitskiy, German Ros, Felipe Codevilla, Antonio Lopez, and Vladlen Koltun. 2017. CARLA: An open urban driving simulator. In Conference on Robot Learning. PMLR, 1--16.
[7]
Ross Girshick. 2015. Fast r-cnn. In Proceedings of the IEEE International Conference on Computer Vision. 1440--1448.
[8]
Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 770--778.
[9]
Yue Hu, Shaoheng Fang, Zixing Lei, Yiqi Zhong, and Siheng Chen. 2022. Where2comm: Communication-efficient collaborative perception via spatial confidence maps. Advances in Neural Information Processing Systems 35 (2022),4874--4886.
[10]
Daniel Jiang and Luca Delgrossi. 2008. IEEE 802.11 p: Towards an international standard for wireless access in vehicular environments. In VTC Spring 2008-IEEE vehicular technology conference. IEEE, 2036--2040.
[11]
Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. 2017. Imagenet classification with deep convolutional neural networks. Commun. ACM 60, 6 (2017), 84--90.
[12]
Alex H Lang, Sourabh Vora, Holger Caesar, Lubing Zhou, Jiong Yang, and Oscar Beijbom. 2019. Pointpillars: Fast encoders for object detection from point clouds. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 12697--12705.
[13]
Yiming Li, Dekun Ma, Ziyan An, Zixun Wang, Yiqi Zhong, Siheng Chen, and Chen Feng. 2022. V2X-Sim: Multi-agent collaborative perception dataset and benchmark for autonomous driving. IEEE Robotics and Automation Letters 7, 4 (2022), 10914--10921.
[14]
Yiming Li, Shunli Ren, Pengxiang Wu, Siheng Chen, Chen Feng, and Wenjun Zhang. 2021. Learning distilled collaboration graph for multi-agent perception. Advances in Neural Information Processing Systems 34 (2021), 29541--29552.
[15]
Zechen Liu, Zizhang Wu, and Roland Tóth. 2020. Smoke: Single-stage monocular 3d object detection via keypoint estimation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops. 996--997.
[16]
Yifan Lu, Quanhao Li, Baoan Liu, Mehrdad Dianati, Chen Feng, Siheng Chen, and Yanfeng Wang. 2023. Robust collaborative 3d object detection in presence of pose errors. In 2023 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 4812--4818.
[17]
Szymon Rusinkiewicz and Marc Levoy. 2001. Efficient variants of the ICP algorithm. In Proceedings third international conference on 3-D digital imaging and modeling. IEEE, 145--152.
[18]
Zhiying Song, Fuxi Wen, Hailiang Zhang, and Jun Li. 2022. An Efficient and Robust Object-Level Cooperative Perception Framework for Connected and Automated Driving. arXiv preprint arXiv:2210.06289 (2022).
[19]
Nicholas Vadivelu, Mengye Ren, James Tu, Jingkang Wang, and Raquel Urtasun.2021. Learning to communicate and correct pose errors. In Conference on Robot Learning. PMLR, 1195--1210.
[20]
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Łukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. Advances in Neural Information Processing Systems 30 (2017).
[21]
Tsun-Hsuan Wang, Sivabalan Manivasagam, Ming Liang, Bin Yang, Wenyuan Zeng, and Raquel Urtasun. 2020. V2vnet: Vehicle-to-vehicle communication for joint perception and prediction. In Computer Vision-ECCV 2020: 16th European Conference, Glasgow, UK, August 23-28, 2020, Proceedings, Part II 16. Springer, 605--621.
[22]
Runsheng Xu, Yi Guo, Xu Han, Xin Xia, Hao Xiang, and Jiaqi Ma. 2021. OpenCDA: an open cooperative driving automation framework integrated with co-simulation. In 2021 IEEE International Intelligent Transportation Systems Conference (ITSC). IEEE, 1155--1162.
[23]
Runsheng Xu, Zhengzhong Tu, Hao Xiang, Wei Shao, Bolei Zhou, and Jiaqi Ma. 2022. CoBEVT: Cooperative Bird's Eye View Semantic Segmentation with Sparse Transformers. In 6th Annual Conference on Robot Learning.
[24]
Runsheng Xu, Xin Xia, Jinlong Li, Hanzhao Li, Shuo Zhang, Zhengzhong Tu, Zonglin Meng, Hao Xiang, Xiaoyu Dong, Rui Song, et al. 2023. V2v4real: A real-world large-scale dataset for vehicle-to-vehicle cooperative perception. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 13712--13722.
[25]
Runsheng Xu, Hao Xiang, Zhengzhong Tu, Xin Xia, Ming-Hsuan Yang, and Jiaqi Ma. 2022. V2X-ViT: Vehicle-to-everything cooperative perception with vision transformer. In Computer Vision-ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23-27, 2022, Proceedings, Part XXXIX. Springer, 107--124.
[26]
Runsheng Xu, Hao Xiang, Xin Xia, Xu Han, Jinlong Li, and Jiaqi Ma. 2022. Opv2v: An open benchmark dataset and fusion pipeline for perception with vehicle-to-vehicle communication. In 2022 International Conference on Robotics and Automation (ICRA). IEEE, 2583--2589.
[27]
Yan Yan, Yuxing Mao, and Bo Li. 2018. Second: Sparsely embedded convolutional detection. Sensors 18, 10 (2018), 3337.
[28]
Haibao Yu, Yizhen Luo, Mao Shu, Yiyi Huo, Zebang Yang, Yifeng Shi, Zhenglong Guo, Hanyu Li, Xing Hu, Jirui Yuan, et al. 2022. Dair-v2x: A large-scale dataset for vehicle-infrastructure cooperative 3d object detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 21361--21370.
[29]
Yunshuang Yuan, Hao Cheng, and Monika Sester. 2022. Keypoints-based deep feature fusion for cooperative vehicle detection of autonomous driving. IEEE Robotics and Automation Letters 7, 2 (2022), 3054--3061.
[30]
Yin Zhou and Oncel Tuzel. 2018. Voxelnet: End-to-end learning for point cloud based 3d object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 4490--4499.

Cited By

View all
  • (2025)Towards Communication-Efficient Cooperative Perception via Planning-Oriented Feature SharingIEEE Transactions on Mobile Computing10.1109/TMC.2024.349685624:4(2551-2563)Online publication date: Apr-2025
  • (2024)FeaKM: Robust Collaborative Perception under Noisy Pose ConditionsProceedings of the 2024 4th International Joint Conference on Robotics and Artificial Intelligence10.1145/3696474.3696686(98-102)Online publication date: 13-Sep-2024
  • (2024)RoCo: Robust Cooperative Perception By Iterative Object Matching and Pose AdjustmentProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3680559(7833-7842)Online publication date: 28-Oct-2024
  • Show More Cited By

Index Terms

  1. FeaCo: Reaching Robust Feature-Level Consensus in Noisy Pose Conditions

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MM '23: Proceedings of the 31st ACM International Conference on Multimedia
    October 2023
    9913 pages
    ISBN:9798400701085
    DOI:10.1145/3581783
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 27 October 2023

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. collaborative perception
    2. feature-level rectification
    3. intermediate fusion
    4. noisy pose conditions
    5. robustness

    Qualifiers

    • Research-article

    Conference

    MM '23
    Sponsor:
    MM '23: The 31st ACM International Conference on Multimedia
    October 29 - November 3, 2023
    Ottawa ON, Canada

    Acceptance Rates

    Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)154
    • Downloads (Last 6 weeks)18
    Reflects downloads up to 05 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2025)Towards Communication-Efficient Cooperative Perception via Planning-Oriented Feature SharingIEEE Transactions on Mobile Computing10.1109/TMC.2024.349685624:4(2551-2563)Online publication date: Apr-2025
    • (2024)FeaKM: Robust Collaborative Perception under Noisy Pose ConditionsProceedings of the 2024 4th International Joint Conference on Robotics and Artificial Intelligence10.1145/3696474.3696686(98-102)Online publication date: 13-Sep-2024
    • (2024)RoCo: Robust Cooperative Perception By Iterative Object Matching and Pose AdjustmentProceedings of the 32nd ACM International Conference on Multimedia10.1145/3664647.3680559(7833-7842)Online publication date: 28-Oct-2024
    • (2024)Efficient Vehicular Collaborative Perception Based on Saptial-Temporal Feature CompressionIEEE Transactions on Vehicular Technology10.1109/TVT.2024.340326373:11(16125-16133)Online publication date: Nov-2024
    • (2024)End-to-end Cooperative Localization via Neural Feature Sharing2024 IEEE Intelligent Vehicles Symposium (IV)10.1109/IV55156.2024.10588492(553-558)Online publication date: 2-Jun-2024
    • (2024)A Survey on Intermediate Fusion Methods for Collaborative Perception Categorized by Real World Challenges2024 IEEE Intelligent Vehicles Symposium (IV)10.1109/IV55156.2024.10588382(2226-2233)Online publication date: 2-Jun-2024
    • (2024)ERMVP: Communication-Efficient and Collaboration-Robust Multi-Vehicle Perception in Challenging Environments2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)10.1109/CVPR52733.2024.01195(12575-12584)Online publication date: 16-Jun-2024

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media