keynote

Graphs in Computer Vision then and now: how Deep Learning has reinvigorated Structural Pattern Recognition

Author:
Donatello Conte

Computer Science Laboratory (LIFAT - EA 6300), Université de Tours, France

Computer Science Laboratory (LIFAT - EA 6300), Université de Tours, France
View Profile

WWW '22: Companion Proceedings of the Web Conference 2022April 2022Pages 1007–1008https://doi.org/10.1145/3487553.3526096

Published:16 August 2022Publication History

WWW '22: Companion Proceedings of the Web Conference 2022

Pages 1007–1008

ABSTRACT

Computer Vision Problems, such as object detection, object tracking, action recognition and so on, have been, in the past, usually addressed through Statistical Pattern Recognition techniques. SVM, Regression or Neural Networks, are some examples of classical statistical techniques that have been used, quite effectively, in many application contexts of computer vision.

Nevertheless, some attempts have been proposed using more complex data structures (notably graphs) for solving Computer Vision Tasks. However, in terms of performances, their use did not have the same success as techniques based on vector representations. First part of this talk will present some of these proposals, in the context of object tracking ([1]), people re-identification ([3]) and action recognition ([2]). An graph representation is proposed in [1] to deal with occlusion problem. The representation is based on a graph pyramid, namely, each moving region is represented at different levels of resolution using a graph for each level. The algorithm compares the topmost levels of each pyramid in the association phase between moving objects in two consecutive frames. If the comparison outcome is sufficient to assign a label to each node the tracking algorithm stops. Instead, if some ambiguities arise (as it is the case when two objects over- lap), the algorithm is repeated using the next levels of the pyramids, until either a consistent labelling is found. The purpose of re-identification (re-id) is to identify people coming back into the field of view of a camera or to recognize an individual through different cameras in a distributed network. At the heart of the process there is a comparison between signatures given probe and gallery sets. In [3] graphs are used to represent people appearance and comparison is done by means of Graph Kernels. Finally, action recognition is a classification problem in which each video representing an action has to be classified with the correct action label. In [2] we proposed to represent videos using graph sequences and proposed a model inspired from bag-of-words techniques to classify a sequence.

Recently, graphs have gained a lot of attention in the Computer Vision community thanks to the use of this kind of data within deep learning techniques. Graph Neural Networks have demonstrated their effectiveness in solving Computer Vision problems, and in some cases recent proposals have bridged the gap between statistical and structural pattern recognition. Second part of the talk will be devoted to illustrate some of these examples ([4, 5, 6]). Starting from the already mentioned applications in Computer Vision (object tracking, action recognition), we will discuss the new proposals based on Deep Learning with graphs and the open problems in this context.

References

Donatello Conte, Pasquale Foggia, Jean-Michel Jolion, and Mario Vento. 2006. A graph-based, multi-resolution algorithm for tracking objects in presence of occlusions. Pattern Recognition 39, 4 (2006), 562–572.Google ScholarDigital Library
Xavier Cortés, Donatello Conte, and Hubert Cardot. 2018. Bags of graphs for human action recognition. In Joint IAPR International Workshops on Statistical Techniques in Pattern Recognition (SPR) and Structural and Syntactic Pattern Recognition (SSPR). Springer, 429–438.Google ScholarDigital Library
Amal Mahboubi, Luc Brun, and Donatelo Conte. 2018. A structural approach to Person Re-identification problem. In 2018 24th International Conference on Pattern Recognition (ICPR). IEEE, 1616–1621.Google ScholarCross Ref
Akshay Rangesh, Pranav Maheshwari, Mez Gebre, Siddhesh Mhatre, Vahid Ramezani, and Mohan M Trivedi. 2021. Trackmpnn: A message passing graph neural architecture for multi-object tracking. arXiv preprint arXiv:2101.04206(2021).Google Scholar
Zonghan Wu, Shirui Pan, Fengwen Chen, Guodong Long, Chengqi Zhang, and S Yu Philip. 2020. A comprehensive survey on graph neural networks. IEEE transactions on neural networks and learning systems 32, 1(2020), 4–24.Google ScholarCross Ref
Sijie Yan, Yuanjun Xiong, and Dahua Lin. 2018. Spatial temporal graph convolutional networks for skeleton-based action recognition. In Thirty-second AAAI conference on artificial intelligence.Google ScholarCross Ref

Index Terms

Graphs in Computer Vision then and now: how Deep Learning has reinvigorated Structural Pattern Recognition
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision problems
  2. Machine learning
    1. Machine learning approaches

Recommendations

Three-dimensional representations for computer graphics and computer vision

Representing complex three-dimensional objects in a computer involves more than just evaluating its display capabilities. Other factors are the uses and costs of the representation, what operations can be performed on it and, ultimately, how useful it ...
Read More
Three-dimensional representations for computer graphics and computer vision
SIGGRAPH '78: Proceedings of the 5th annual conference on Computer graphics and interactive techniques

Representing complex three-dimensional objects in a computer involves more than just evaluating its display capabilities. Other factors are the uses and costs of the representation, what operations can be performed on it and, ultimately, how useful it ...
Read More
Facial Expression Recognition from Occluded Images Using Deep Convolution Neural Network with Vision Transformer
Image and Graphics
Abstract
Facial expression recognition (FER) is a challenging task due to various unrestricted conditions. Normal facial expression algorithms work well on frontal faces. However, detection expression from the occluded faces is still a challenging task. In ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
WWW '22: Companion Proceedings of the Web Conference 2022
April 2022
1338 pages
ISBN:9781450391306
DOI:10.1145/3487553
Editors:
Frédérique Laforest
INSA Lyon, France
,
Raphaël Troncy
EURECOM, France
,
Lionel Médini
Université Lyon 1, France
,
Ivan Herman
W3C / retired
Copyright © 2022 Owner/Author
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 16 August 2022
Check for updates
Author Tags
Computer Vision
Graph Neural Networks
Structural Pattern Recognition
Qualifiers
- keynote
- Research
- Refereed limited
Conference

Acceptance Rates
Overall Acceptance Rate1,899of8,196submissions,23%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 62
  Total Downloads
- Downloads (Last 12 months)17
- Downloads (Last 6 weeks)4
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format .

View HTML Format

Graphs in Computer Vision then and now: how Deep Learning has reinvigorated Structural Pattern Recognition

WWW '22: Companion Proceedings of the Web Conference 2022

ABSTRACT

References

Cited By

Index Terms

Recommendations

Three-dimensional representations for computer graphics and computer vision

Three-dimensional representations for computer graphics and computer vision

Facial Expression Recognition from Occluded Images Using Deep Convolution Neural Network with Vision Transformer

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

HTML Format

Caption

Graphs in Computer Vision then and now: how Deep Learning has reinvigorated Structural Pattern Recognition

WWW '22: Companion Proceedings of the Web Conference 2022

ABSTRACT

References

Cited By

Index Terms

Recommendations

Three-dimensional representations for computer graphics and computer vision

Three-dimensional representations for computer graphics and computer vision

Facial Expression Recognition from Occluded Images Using Deep Convolution Neural Network with Vision Transformer

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

HTML Format

Share this Publication link

Share on Social Media