skip to main content
10.1145/3651781.3651801acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicscaConference Proceedingsconference-collections
research-article

Gaze Point Estimation for Four-Person-Table-Meeting Using Omnidirectional Camera

Published: 30 May 2024 Publication History

Abstract

Group meeting around a table involves multiple people. Analyzing the communication of group meetings needs to under-stand the interaction among multiple people. In this paper, we propose a method of gaze estimation for four-person-table-meeting by using an omnidirectional camera. An omnidirectional camera is centered on a table with four sides while four participants sit at each side of the table. The advantage of this setup is that the behavior of all the participants can be observed simultaneously by using only a single camera; however, the problems caused by the presentation of an equirectangular image, distortion, and disconnectivity, must be coped with. In this paper, four perspective images are generated from an equirectangular image so that each perspective image with 90 degrees field of view covers one participant. The task of gaze estimation of group meeting is reformulated as two sub-tasks for a specific subject: a classification task to determine which generated perspective image is gazed at, and a regression task to compute gaze point position in the gazed perspective image. A neural network is developed for this goal, and the effectiveness of the proposed method is shown by the experimental results.

References

[1]
R. Yonetani, K. M. Kitani and Y. Sato, "Recognizing Micro-Actions and Reactions From Paired Egocentric Videos," in 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 2016.
[2]
S. Alletto, G. Serra, S. Calderara, F. Solera and R. Cucchiara, "From Ego to Nos-Vision: Detecting Social Relationship in First-Person Views," in 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops, Columbus, OH, USA, 2014.
[3]
S. Alletto, G. Serra, S. Calderara and R. Cucchiara, "Understanding social relationships in egocentric vision," Pattern Recognition, vol. 48, no. 12, pp. 4082-4096, 2015.
[4]
C. Bai, S. Kumar, J. Leskovec, M. Metzger, J. F. Nunamaker and V. Subrahmanian, "Predicting the Visual Focus of Attention in Multi-Person Discussion Videos," in Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence (IJCAI-19), 2019.
[5]
T. Baltrusaitis, A. Zadeh, Y. C. Lim and L.-P. Morency, "OpenFace 2.0: Facial Behavior Analysis Toolkit," in 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018), Xi'an, China, 2018.
[6]
Y. Niibori and S. Li, "Measuring Two-People Communication from Omnidirectional Video," in Proceedings of IJCAI 2019 3rd Workshop on Artificial Intelligence in Affective Computing, 2020.
[7]
J. Bao, B. Liu and J. Yu, "ESCNet: Gaze Target Detection With the Understanding of 3D Scenes," in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, USA, 2022.
[8]
B. Wang, T. Hu, B. Li, X. Chen and Z. Zhang, "GaTector: A Unified Framework for Gaze Object Prediction," in 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, USA, 2022.
[9]
P. Kellnhofer, A. Recasens, S. Stent, W. Matusik and A. Torralba, "Gaze360: Physically Unconstrained Gaze Estimation in the Wild," in Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2019.
[10]
S. Nonaka, S. Nobuhara and K. Nishino, "Dynamic 3D Gaze From Afar: Deep Gaze Estimation From Temporal Eye-Head-Body Coordination," in 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, USA, 2022.
[11]
M. Zhang, Y. Liu and F. Lu, "GazeOnce: Real-Time Multi-Person Gaze Estimation," in 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, USA, 2022.
[12]
N. Fujii and S. Li, "Estimating Gaze Points from Facial Landmarks by a Remote Spherical Camera," in 2020 25th International Conference on Pattern Recognition (ICPR), Milan, Italy, 2021.
[13]
E. Olson, "AprilTag: A robust and flexible visual fiducial system," in 2011 IEEE International Conference on Robotics and Automation (ICRA), Shanghai, China, 2011.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICSCA '24: Proceedings of the 2024 13th International Conference on Software and Computer Applications
February 2024
395 pages
ISBN:9798400708329
DOI:10.1145/3651781
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 30 May 2024

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. deep learning
  2. gaze estimation
  3. omnidirectional camera
  4. table meeting

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

ICSCA 2024

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 37
    Total Downloads
  • Downloads (Last 12 months)37
  • Downloads (Last 6 weeks)5
Reflects downloads up to 02 Mar 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media