research-article

Deep Fisher-Vector Descriptors for Image Retrieval and Scene Recognition

Authors:

Syed Sameed Husain,

Eng-Jon Ong,

Mohamed Faheem Thanveer,

Lisa Silva,

Miroslaw BoberAuthors Info & Claims

MVRMLM '24: Proceedings of 2024 ACM ICMR Workshop on Multimodal Video Retrieval

Pages 20 - 26

https://doi.org/10.1145/3664524.3675365

Published: 28 August 2024 Publication History

Get Access

Abstract

This study presents a novel architecture that significantly enhances the capabilities of large-scale image retrieval and recognition systems. We introduce a novel multi-stream Fisher vector network that integrates a convolutional neural network (CNN) with a Fisher Vector (FV) framework to optimize feature extraction and aggregation. The CNN component generates dense, deep convolutional descriptors, which are subsequently aggregated by the Fisher Vector method to enhance recognition accuracy. Importantly, the CNN and Fisher Vector model parameters are learnt simultaneously in an end-to-end manner. This allows us to account for the evolving distribution of deep descriptors over the course of the learning process. This integrated learning strategy results in a robust model that achieves excellent performance in both image retrieval and recognition tasks, as demonstrated on standard datasets.

References

[1]

R. Arandjelovic, P. Gronat, A. Torii, T. Pajdla, and J. Sivic. 2018. NetVLAD: CNN Architecture for Weakly Supervised Place Recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence 40, 6 (June 2018), 1437–1451.

Crossref

Google Scholar

[2]

J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei. 2009. ImageNet: A Large-Scale Hierarchical Image Database. In IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 248–255.

Google Scholar

[3]

Albert Gordo, Jon Almazán, Jerome Revaud, and Diane Larlus. 2017. End-to-End Learning of Deep Visual Representations for Image Retrieval. International Journal of Computer Vision 124, 2 (Sep 2017).

Digital Library

Google Scholar

[4]

Syed Sameed Husain, Eng-Jon Ong, and Miroslaw Bober. 2019. ACTNET: End-to-End Learning of Feature Activations and Multi-stream Aggregation for Effective Instance Image Retrieval. International Journal of Computer Vision 129 (2019), 1432 – 1450.

Crossref

Google Scholar

[5]

Seongwon Lee, Suhyeon Lee, Hongje Seong, and Euntai Kim. 2023. Revisiting Self-Similarity: Structural Embedding for Image Retrieval. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 23412–23421.

Crossref

Google Scholar

[6]

Florent Perronnin, Yan Liu, Jorge Sanchez, and Herve Poirier. 2010. Large-scale image retrieval with compressed Fisher vectors. In IEEE Conference on Computer Vision and Pattern Recognition. 3384–3391.

Crossref

Google Scholar

[7]

F. Radenovic, A. Iscen, G. Tolias, Y. Avrithis, and O. Chum. 2018. Revisiting Oxford and Paris: Large-Scale Image Retrieval Benchmarking. In IEEE Conference on Computer Vision and Pattern Recognition. 5706–5715.

Google Scholar

[8]

F. Radenovic, G. Tolias, and O. Chum. 2018. Fine-tuning CNN Image Retrieval with No Human Annotation. IEEE Transactions on Pattern Analysis and Machine Intelligence (2018), 1–1.

Google Scholar

[9]

Mingxing Tan and Quoc Le. 2019. EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks. In Proceedings of the 36th International Conference on Machine Learning(Proceedings of Machine Learning Research, Vol. 97), Kamalika Chaudhuri and Ruslan Salakhutdinov (Eds.). 6105–6114.

Google Scholar

[10]

Marvin Teichmann, Andre Araujo, Menglong Zhu, and Jack Sim. 2018. Detect-to-Retrieve: Efficient Regional Aggregation for Image Search. CoRR (2018).

Google Scholar

[11]

X. Wu, G. Irie, K. Hiramatsu, and K. Kashino. 2018. Weighted Generalized Mean Pooling for Deep Image Retrieval. In IEEE International Conference on Image Processing. 495–499.

Google Scholar

[12]

Jian Xu, Chunheng Wang, Cunzhao Shi, and Baihua Xiao. 2018. Weakly Supervised Soft-detection-based Aggregation Method for Image Retrieval. CoRR (2018).

Google Scholar

[13]

A. B. Yandex and V. Lempitsky. 2015. Aggregating Local Deep Features for Image Retrieval. In IEEE International Conference on Computer Vision. 1269–1277.

Google Scholar

[14]

M. Yang, D. He, M. Fan, B. Shi, X. Xue, F. Li, E. Ding, and J. Huang. 2021. DOLG: Single-Stage Image Retrieval with Deep Orthogonal Fusion of Local and Global Features. In 2021 IEEE/CVF International Conference on Computer Vision (ICCV). 11752–11761.

Google Scholar

Index Terms

Deep Fisher-Vector Descriptors for Image Retrieval and Scene Recognition
1. Information systems
  1. Information retrieval
    1. Retrieval models and ranking

Recommendations

Image Retrieval Using Fused Deep Convolutional Features

This paper proposes an image retrieval using fused deep convolutional features to solve the semantic gap between low-level features and high-level semantic features of traditional contend-based image retrieval method. Firstly, the improved network ...
Deep convolutional features for image retrieval
Highlights
- A comprehensive study that explores deep convolutional features for CBIR.
- The ...
Abstract
Nowadays, the use of Convolutional Neural Networks (CNNs) has led to tremendous achievements in several computer vision challenges. CNN-based image retrieval methods vary in complexity, growing capacity, and execution time. This work ...
Food image recognition with deep convolutional features
UbiComp '14 Adjunct: Proceedings of the 2014 ACM International Joint Conference on Pervasive and Ubiquitous Computing: Adjunct Publication

In this paper, we report the feature obtained from the Deep Convolutional Neural Network boosts food recognition accuracy greatly by integrating it with conventional hand-crafted image features, Fisher Vectors with HoG and Color patches. In the ...

Comments

Information & Contributors

Information

Published In

MVRMLM '24: Proceedings of 2024 ACM ICMR Workshop on Multimodal Video Retrieval

June 2024

56 pages

ISBN:9798400706844

DOI:10.1145/3664524

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 28 August 2024

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Funding Sources

Engineering and Physical Sciences Research Council

Conference

ICMR '24

Sponsor:

SIGMM

ICMR '24: International Conference on Multimedia Retrieval

June 10 - 14, 2024

Phuket, Thailand

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
62
Total Downloads

Downloads (Last 12 months)62
Downloads (Last 6 weeks)8

Reflects downloads up to 18 Feb 2025

Other Metrics

View Author Metrics

Citations

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Abstract

References

Index Terms

Recommendations

Image Retrieval Using Fused Deep Convolutional Features

Deep convolutional features for image retrieval

Food image recognition with deep convolutional features

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Funding Sources

Conference

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Login options

Full Access

View options

PDF

eReader

HTML Format

Share

Share this Publication link

Share on social media

Affiliations