short-paper

DMPCANet: A Low Dimensional Aggregation Network for Visual Place Recognition

Authors:

Yinghao Wang,

Haonan Chen,

Jiong Wang,

Yingying ZhuAuthors Info & Claims

ICMR '22: Proceedings of the 2022 International Conference on Multimedia Retrieval

Pages 24 - 28

https://doi.org/10.1145/3512527.3531427

Published: 27 June 2022 Publication History

Get Access

Abstract

Visual place recognition (VPR) aims to estimate the geographical location of a query image by finding its nearest reference images from a large geo-tagged database. Most of the existing methods adopt convolutional neural networks to extract feature maps from images. Nevertheless, such feature maps are high-dimensional tensors, and it is a challenge to effectively aggregate them into a compact vector representation for efficient retrieval. To tackle this challenge, we develop an end-to-end convolutional neural network architecture named DMPCANet. The network adopts the regional pooling module to generate feature tensors of the same size from images of different sizes. The core component of our network, the Differentiable Multilinear Principal Component Analysis (DMPCA) module, directly acts on tensor data and utilizes convolution operations to generate projection matrices for dimensionality reduction, thereby reducing the dimensionality to one sixteenth. This module can preserve crucial information while reducing data dimensions. Experiments on two widely used place recognition datasets demonstrate that our proposed DMPCANet can generate low-dimensional discriminative global descriptors and achieve the state-of-the-art results.

Supplementary Material

MP4 File (MPCA-ICMR2022-v2.mp4)

DMPCANet Presentation video-short version

Download
18.67 MB

MP4 File (icmr22-sp099.mp4)

We develop an end-to-end convolutional neural network architecture named DMPCANet for visual place recognition. The network adopts the regional pooling module to generate feature tensors of the same size from images of different sizes. The core component of our network, the DMPCA module, preserves crucial information while reducing data dimensions. The module directly acts on tensor data and utilizes convolution operations to generate projection matrices for dimensionality reduction, thereby reducing the dimensionality to one-sixteenth. Extensive experiments on two widely used place recognition datasets demonstrate that our proposed DMPCANet can generate low-dimensional discriminative global descriptors and achieve state-of-the-art results.

Download
18.67 MB

References

[1]

Relja Arandjelovic, Petr Gronat, Akihiko Torii, Tomas Pajdla, and Josef Sivic. 2016. NetVLAD: CNN architecture for weakly supervised place recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition. 5297--5307.

Abstract

Supplementary Material

References

Cited By

Index Terms

Recommendations

Attention-based Pyramid Aggregation Network for Visual Place Recognition

Visual search reranking with RElevant Local Discriminant Analysis

Graph embedding discriminant analysis on Grassmannian manifolds for improved image set matching

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations