skip to main content
10.1145/3581783.3613466acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
short-paper

pyUDLF: A Python Framework for Unsupervised Distance Learning Tasks

Published: 27 October 2023 Publication History

Abstract

The representation of multimedia content experienced tremendous advances in the last decades. Mainly supported by deep learning models, impressive results have been obtained. However, despite such advances in representation, the definition of similarity has been neglected. Effectively computing the similarity between representations remains a challenge. Traditional distance functions, such as the Euclidean distance, are not able to properly consider the relevant similarity information encoded in the dataset manifold. In fact, manifolds are essential to perception in many scenarios, such that exploiting the underlying structure of dataset manifolds plays a central role in multimedia content understanding and retrieval. In this paper, we present a framework for unsupervised distance learning which provides easy and uniform access to methods capable of considering the dataset manifold for redefining similarity. Such methods perform context-sensitive similarity learning based on more global measures, capable of improving the effectiveness of retrieval and machine learning tasks. The framework can use distance, similarity, or ranking information both as input and output and compute traditional retrieval effectiveness measures. Implemented as a wrapper in Python, the framework allows integration with a large number of Python libraries while keeping a back-end in C++ for efficiency. The paper also discusses diverse applications of the methods available in the pyUDLF framework, including image re-ranking, video retrieval, person re-ID, and pre-processing of distance measurements for clustering and classification.

References

[1]
S. Bai and X. Bai. 2016. Sparse Contextual Activation for Efficient Visual Re-Ranking. IEEE Transactions on Image Processing, Vol. 25, 3 (March 2016), 1056--1069.
[2]
Song Bai, Xiang Bai, Qi Tian, and Longin Jan Latecki. 2019. Regularized Diffusion Process on Bidirectional Context for Object Retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 41, 5 (2019), 1213--1226.
[3]
Xiang Bai, Song Bai, and Xinggang Wang. 2015. Beyond diffusion process: Neighbor set similarity for fast re-ranking. Information Sciences, Vol. 325 (2015), 342--354.
[4]
Lucas Barbosa de Almeida, Lucas Pascotti Valem, and Daniel Carlos Guimarães Pedronette. 2022. Graph Convolutional Networks and Manifold Ranking for Multimodal Video Retrieval. In 2022 IEEE International Conference on Image Processing, ICIP. 2811--2815.
[5]
M. Donoser and H. Bischof. 2013. Diffusion Processes for Retrieval Revisited. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 1320--1327.
[6]
Filipe Marcel Fernandes Goncc alves, Daniel Carlos Guimarães Pedronette, and Ricardo da Silva Torres. 2023. Regression by Re-Ranking. Pattern Recognit., Vol. 140 (2023), 109577.
[7]
Jiayan Jiang, Bo Wang, and Zhuowen Tu. 2011. Unsupervised metric learning by Self-Smoothing Operator. In Internation Conference on Computer Vision (ICCV). 794--801.
[8]
Chundi Liu, Guangwei Yu, Maksims Volkovs, Cheng Chang, Himanshu Rai, Junwei Ma, and Satya Krishna Gorti. 2019. Guided Similarity Separation for Image Retrieval. In Advances in Neural Information Processing Systems, Vol. 32.
[9]
Leonardo Tadeu Lopes and Daniel Carlos Guimarães Pedronette. 2023. Self-Supervised Clustering Based on Manifold Learning and Graph Convolutional Networks. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV). 5634--5643.
[10]
Leonardo Tadeu Lopes, Lucas Pascotti Valem, Daniel Carlos Guimarães Pedronette, Ivan Rizzo Guilherme, Joao Paulo Papa, Marcos Cleison Silva Santana, Danilo Colombo, GM Farinella, P Radeva, and J Braz. 2020. Manifold Learning-based Clustering Approach Applied to Anomaly Detection in Surveillance Videos. In Proceedings Of The 15th International Joint Conference On Computer Vision, Imaging And Computer Graphics Theory And Applications, Vol 5: Visapp. Scitepress, 404--412.
[11]
Daniel Carlos Guimarães Pedronette, Jurandy Almeida, and Ricardo da S. Torres. 2016. A graph-based ranked-list model for unsupervised distance learning on shape retrieval. Pattern Recognition Letters, Vol. 83, Part 3 (2016), 357 -- 367.
[12]
Daniel Carlos Guimarães Pedronette and Ricardo da S. Torres. 2012. Exploiting Contextual Information for Image Re-Ranking and Rank Aggregation. International Journal of Multimedia Information Retrieval, Vol. 1, 2 (2012), 115--128.
[13]
Daniel Carlos Guimarães Pedronette and Ricardo da S. Torres. 2013. Image Re-Ranking and Rank Aggregation based on Similarity of Ranked Lists. Pattern Recognition, Vol. 46, 8 (2013), 2350--2360.
[14]
Daniel Carlos Guimarães Pedronette and Ricardo da S. Torres. 2016. A correlation graph approach for unsupervised manifold learning in image retrieval tasks. Neurocomputing, Vol. 208 (2016), 66--79.
[15]
Daniel Carlos Guimarães Pedronette, Otávio A.B. Penatti, and Ricardo da S. Torres. 2014. Unsupervised manifold learning using Reciprocal kNN Graphs in image re-ranking and rank aggregation tasks. Image and Vision Computing, Vol. 32, 2 (2014), 120--130.
[16]
Daniel Carlos Guimarães Pedronette, Lucas Pascotti Valem, Jurandy Almeida, and Ricardo da Silva Torres. 2019. Multimedia Retrieval Through Unsupervised Hypergraph-Based Manifold Ranking. IEEE Trans. Image Process., Vol. 28, 12 (2019), 5824--5838.
[17]
Daniel Carlos Guimarães Pedronette, Lucas Pascotti Valem, and Ricardo da Silva Torres. 2021a. A BFS-Tree of ranking references for unsupervised manifold learning. Pattern Recognit., Vol. 111 (2021), 107666.
[18]
Daniel Carlos Guimarães Pedronette, Lucas Pascotti Valem, and Longin Jan Latecki. 2021b. Efficient Rank-Based Diffusion Process with Assured Convergence. J. Imaging, Vol. 7, 3 (2021), 49.
[19]
Vanessa Helena Pereira-Ferrero, Lucas Pascotti Valem, and Daniel Carlos Guimarães Pedronette. 2023. Feature augmentation based on manifold ranking and LS? for image classification. Expert Syst. Appl., Vol. 213, Part (2023), 118995.
[20]
João Gabriel Camacho Presotto, Samuel Felipe dos Santos, Lucas Pascotti Valem, Fábio Augusto Faria, João Paulo Papa, Jurandy Almeida, and Daniel Carlos Guimarães Pedronette. 2022. Weakly supervised learning based on hypergraph manifold ranking. J. Vis. Commun. Image Represent., Vol. 89 (2022), 103666.
[21]
Bionda Rozin, Vanessa Helena Pereira-Ferrero, Leonardo Tadeu Lopes, and Daniel Carlos Guimarães Pedronette. 2021. A rank-based framework through manifold learning for improved clustering tasks. Inf. Sci., Vol. 580 (2021), 202--220.
[22]
Lucas Pascotti Valem, Carlos Renan De Oliveira, Daniel Carlos Guimarães Pedronette, and Jurandy Almeida. 2018a. Unsupervised Similarity Learning through Rank Correlation and kNN Sets. ACM Trans. Multim. Comput. Commun. Appl., Vol. 14, 4 (2018), 80:1--80:23.
[23]
Lucas Pascotti Valem and Daniel Carlos Guimarães Pedronette. 2017. An Unsupervised Distance Learning Framework for Multimedia Retrieval. In ACM on International Conference on Multimedia Retrieval, ICMR. 107--111.
[24]
Lucas Pascotti Valem and Daniel Carlos Guimarães Pedronette. 2022. Person Re-ID through unsupervised hypergraph rank selection and fusion. Image Vis. Comput., Vol. 123 (2022), 104473.
[25]
Lucas Pascotti Valem, Daniel Carlos Guimarães Pedronette, and Jurandy Almeida. 2018b. Unsupervised similarity learning through Cartesian product of ranking references. Pattern Recognition Letters, Vol. 114 (2018), 41--52. Data Representation and Representation Learning for Video Analysis.
[26]
Lucas Pascotti Valem, Daniel Carlos Guimarães Pedronette, and Longin Jan Latecki. 2023 a. Graph Convolutional Networks based on manifold learning for semi-supervised image classification. Comput. Vis. Image Underst., Vol. 227 (2023), 103618.
[27]
Lucas Pascotti Valem, Daniel Carlos Guimarães Pedronette, and Longin Jan Latecki. 2023 b. Rank Flow Embedding for Unsupervised and Semi-Supervised Manifold Learning. IEEE Trans. Image Process., Vol. 32 (2023), 2811--2826.
[28]
Lucas Pascotti Valem, Daniel Carlos Guimarães Pedronette, Ricardo da S. Torres, Edson Borin, and Jurandy Almeida. 2015. Effective, Efficient, and Scalable Unsupervised Distance Learning in Image Retrieval Tasks. ACM International Conference on Multimedia Retrieval (ICMR) (2015).
[29]
Xingwei Yang, Xiang Bai, Longin Jan Latecki, and Zhuowen Tu. 2008. Improving Shape Retrieval by Learning Graph Transduction. In European Conference on Computer Vision (ECCV), Vol. 4. 788--801.
[30]
Xingwei Yang, Suzan Koknar-Tezel, and Longin Jan Latecki. 2009. Locally constrained diffusion process on locally densified distance spaces with applications to shape retrieval. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 357--364.
[31]
Xingwei Yang, L. Prasad, and L.J. Latecki. 2013. Affinity Learning with Diffusion on Tensor Product Graph. IEEE Trans. on Pattern Analysis and Machine Intelligence, Vol. 35, 1 (2013), 28--38.

Cited By

View all
  • (2024)Neighbor Embedding Projection and Rank-Based Manifold Learning for Image Retrieval2024 37th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI)10.1109/SIBGRAPI62404.2024.10716269(1-6)Online publication date: 30-Sep-2024

Index Terms

  1. pyUDLF: A Python Framework for Unsupervised Distance Learning Tasks

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MM '23: Proceedings of the 31st ACM International Conference on Multimedia
    October 2023
    9913 pages
    ISBN:9798400701085
    DOI:10.1145/3581783
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 27 October 2023

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. distance learning
    2. framework
    3. multimedia retrieval
    4. unsupervised

    Qualifiers

    • Short-paper

    Funding Sources

    • Brazilian National Council for Scientific and Technological Development - CNPq
    • Petrobras
    • São Paulo Research Foundation - FAPESP

    Conference

    MM '23
    Sponsor:
    MM '23: The 31st ACM International Conference on Multimedia
    October 29 - November 3, 2023
    Ottawa ON, Canada

    Acceptance Rates

    Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)26
    • Downloads (Last 6 weeks)3
    Reflects downloads up to 05 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Neighbor Embedding Projection and Rank-Based Manifold Learning for Image Retrieval2024 37th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI)10.1109/SIBGRAPI62404.2024.10716269(1-6)Online publication date: 30-Sep-2024

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media