skip to main content
10.1145/1452392.1452425acmconferencesArticle/Chapter ViewAbstractPublication Pagesicmi-mlmiConference Proceedingsconference-collections
research-article

Deducing the visual focus of attention from head pose estimation in dynamic multi-view meeting scenarios

Published: 20 October 2008 Publication History

Abstract

This paper presents our work on recognizing the visual focus of attention during dynamic meeting scenarios. We collected a new dataset of meetings, in which acting participants were to follow a predefined script of events, to enforce focus shifts of the remaining, unaware meeting members. Including the whole room, all in all, a total of 35 potential focus targets were annotated, of which some were moved or introduced spontaneously during the meeting. On this dynamic dataset, we present a new approach to deduce the visual focus by means of head orientation as a first clue and show, that our system recognizes the correct visual target in over 57% of all frames, compared to 47% when mapping head pose to the first-best intersecting focus target directly.

References

[1]
S. Ba and J. Odobez. A Cognitive and Unsupervised Map Adaptation Approach to the Recognition of Focus of Attention from Head Pose. In Proceedings of International Conference on Multimedia and Expo, 2007.
[2]
S. Ba and J. Odobez. Multi-party Focus of Attention Recognition in Meetings from Head Pose and Multimodal Contextual Cues. In Proceedings of International Conference on Acoustics, Speech and Signal Processing, 2008.
[3]
C. Canton Ferrer, J. Casas, and M. Pardàs. Head Orientation Estimation using Particle Filtering in Multiview Scenarios. In Proceedings of the Second International Evaluation Workshop on Classification of Events, Activities and Relationships, 2007.
[4]
H. Ekenel, M. Fischer, and R. Stiefelhagen. Face Recognition in Smart Rooms. In Proceedings of 4th Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms, 2007.
[5]
Harold L. Kundel and Marcia Polansky. Measurement of Observer Agreement. Radiology, 228--303.
[6]
Keni Bernardin and Rainer Stiefelhagen. Audio-Visual Multi-Person Tracking and Identification for Smart Environments. In Proceedings of ACM Multimedia, 2007.
[7]
O. Lanz and R. Brunelli. Joint Bayesian Tracking of Head Location and Pose from Low-resolution Video. In Proceedings of the Second International Evaluation Workshop on Classification of Events, Activities and Relationships, 2007.
[8]
O. Lanz, P. Chippendale, and R. Brunelli. An Appearance-based Particle Filter for Visual Tracking in Smart Rooms. In Proceedings of the Second International Evaluation Workshop on Classification of Events, Activities and Relationships, 2007.
[9]
Michael Voit, Kai Nickel, and Rainer Stiefelhagen. Head Pose Estimation in Single- and Multi-view Environments - Results on the CLEAR'07 Benchmarks. In Proceedings of the Second International Evaluation Workshop on Classification of Events, Activities and Relationships, 2007.
[10]
Rainer Stiefelhagen. Tracking Focus of Attention in Meetings. In Proceedings of IEEE International Conference on Multimodal Interfaces, page 273, 2002.
[11]
Rainer Stiefelhagen, Rachel Bowers, and John Garofolo. Classification of Events, Activities and Relationships - Evaluation and Workshop. http://www.clear-evaluation.org/.
[12]
K. C. Smith, S. O. Ba, D. Gatica Perez, and J.-M. Odobez. Tracking the Multi-Person Wandering Visual Focus of Attention. In Proceedings of International Conference on Multimodal Interfaces, 2006.

Cited By

View all
  • (2024)Infants' Developing Environment: Integration of Computer Vision and Human Annotation to Quantify where Infants Go, what They Touch, and what They See2024 IEEE International Conference on Development and Learning (ICDL)10.1109/ICDL61372.2024.10644441(1-8)Online publication date: 20-May-2024
  • (2024)Evaluating Applicability of Machine Learning Models in Navigated Transcranial Magnetic Stimulation Systems2024 IEEE 25th International Conference of Young Professionals in Electron Devices and Materials (EDM)10.1109/EDM61683.2024.10615211(1790-1796)Online publication date: 28-Jun-2024
  • (2022)Automatic engagement estimation in smart education/learning settings: a systematic review of engagement definitions, datasets, and methodsSmart Learning Environments10.1186/s40561-022-00212-y9:1Online publication date: 12-Nov-2022
  • Show More Cited By

Index Terms

  1. Deducing the visual focus of attention from head pose estimation in dynamic multi-view meeting scenarios

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      ICMI '08: Proceedings of the 10th international conference on Multimodal interfaces
      October 2008
      322 pages
      ISBN:9781605581989
      DOI:10.1145/1452392
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 20 October 2008

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. data collection
      2. dynamic meetings
      3. eye gaze
      4. head orientation
      5. visual focus of attention

      Qualifiers

      • Research-article

      Conference

      ICMI '08
      Sponsor:
      ICMI '08: INTERNATIONAL CONFERENCE ON MULTIMODAL INTERFACES
      October 20 - 22, 2008
      Crete, Chania, Greece

      Acceptance Rates

      Overall Acceptance Rate 453 of 1,080 submissions, 42%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)10
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 15 Feb 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Infants' Developing Environment: Integration of Computer Vision and Human Annotation to Quantify where Infants Go, what They Touch, and what They See2024 IEEE International Conference on Development and Learning (ICDL)10.1109/ICDL61372.2024.10644441(1-8)Online publication date: 20-May-2024
      • (2024)Evaluating Applicability of Machine Learning Models in Navigated Transcranial Magnetic Stimulation Systems2024 IEEE 25th International Conference of Young Professionals in Electron Devices and Materials (EDM)10.1109/EDM61683.2024.10615211(1790-1796)Online publication date: 28-Jun-2024
      • (2022)Automatic engagement estimation in smart education/learning settings: a systematic review of engagement definitions, datasets, and methodsSmart Learning Environments10.1186/s40561-022-00212-y9:1Online publication date: 12-Nov-2022
      • (2021)Head pose estimation: A survey of the last ten yearsSignal Processing: Image Communication10.1016/j.image.2021.11647999(116479)Online publication date: Nov-2021
      • (2020)How Can a Robot Calculate the Level of Visual Focus of Human’s AttentionProceedings of International Joint Conference on Computational Intelligence10.1007/978-981-15-3607-6_27(329-342)Online publication date: 23-May-2020
      • (2019)Predicting the visual focus of attention in multi-person discussion videosProceedings of the 28th International Joint Conference on Artificial Intelligence10.5555/3367471.3367669(4504-4510)Online publication date: 10-Aug-2019
      • (2019)Visual Focus of Attention and Spontaneous Smile Recognition Based on Continuous Head Pose Estimation by Cascaded Multi-Task LearningInternational Journal of Pattern Recognition and Artificial Intelligence10.1142/S021800141940006833:07(1940006)Online publication date: 7-Jun-2019
      • (2019)Walking Your Virtual Dog: Analysis of Awareness and Proxemics with Simulated Support Animals in Augmented Reality2019 IEEE International Symposium on Mixed and Augmented Reality (ISMAR)10.1109/ISMAR.2019.000-8(157-168)Online publication date: Oct-2019
      • (2019)Classroom Attention Analysis Based on Multiple Euler Angles Constraint and Head Pose EstimationMultiMedia Modeling10.1007/978-3-030-37731-1_27(329-340)Online publication date: 24-Dec-2019
      • (2018)Physical-Virtual Agents for Healthcare SimulationProceedings of the 18th International Conference on Intelligent Virtual Agents10.1145/3267851.3267876(99-106)Online publication date: 5-Nov-2018
      • Show More Cited By

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media