research-article

Multi-mode saliency dynamics model for analyzing gaze and attention

Authors:
Ryo Yonetani

Kyoto University

Kyoto University
View Profile

,
Hiroaki Kawashima

Kyoto University

Kyoto University
View Profile

,
Takashi Matsuyama

Kyoto University

Kyoto University
View Profile

ETRA '12: Proceedings of the Symposium on Eye Tracking Research and ApplicationsMarch 2012Pages 115–122https://doi.org/10.1145/2168556.2168574

Published:28 March 2012Publication History

ETRA '12: Proceedings of the Symposium on Eye Tracking Research and Applications

Pages 115–122

ABSTRACT

We present a method to analyze a relationship between eye movements and saliency dynamics in videos for estimating attentive states of users while they watch the videos. The multi-mode saliency-dynamics model (MMSDM) is introduced to segment spatio-temporal patterns of the saliency dynamics into multiple sequences of primitive modes underlying the saliency patterns. The MMSDM enables us to describe the relationship by the local saliency dynamics around gaze points, which is modeled by a set of distances between gaze points and salient regions characterized by the extracted modes. Experimental results show the effectiveness of the proposed model to classify the attentive states of users by learning the statistical difference of the local saliency dynamics on gaze-paths at each level of attentiveness.

References

Brandherm, B., Prendinger, H., and Ishizuka, M. 2007. Interest estimation based on dynamic bayesian networks for visual attentive presentation agents. In Proc. of ICMI, 346--349. Google ScholarDigital Library
Bregler, C. 1997. Learning and recognizing human dynamics in video sequences. In Proc. of CVPR, 568--574. Google ScholarDigital Library
Calder, A., Lawrence, A., Keane, J., Scott, S., Owen, A., Christoffels, I., and Young, A. 2002. Reading the mind from eye gaze. Neuropsychologia 40, 8, 1129--1138.Google ScholarCross Ref
Dorr, M., Martinetz, T., Gegenfurtner, K., and Barth, E. 2010. Variability of eye movements when viewing dynamic natural scenes. Journal of vision 10, 10, 1--17.Google ScholarCross Ref
Doshi, A., and Trivedi, M. M. 2010. Attention estimation by simultaneous observation of viewer and view. In Proc. of CVPR Workshop, 21--27.Google Scholar
Fletcher, L., and Zelinsky, A. 2009. Driver inattention detection based on eye gaze-road event correlation. International Journal of Robotics Research 28, 6, 774--801. Google ScholarDigital Library
Goldstein, R. B., Woods, R. L., and Peli, E. 2007. Where people look when watching movies: do all viewers look at the same place? Computers in Biology and Medicine 37, 7, 957--64. Google ScholarDigital Library
Hirayama, T., Dodane, J.-B., Kawashima, H., and Matsuyama, T. 2010. Estimates of user interest using timing structures between proactive content-display updates and eye movements. IEICE Trans. on Information and Systems E-93D, 6, 1470--1478.Google Scholar
Itti, L., Koch, C., and Niebur, E. 1998. A model of saliency-based visual attention for rapid scene analysis. IEEE Trans. on PAMI 20, 11, 1254--1259. Google ScholarDigital Library
Ji, Q., Lan, P., and Looney, C. 2006. A probabilistic framework for modeling and real-time monitoring human fatigue. IEEE Trans. on Systems, Man and Cybernetics, Part A: Systems and Humans 36, 5, 862--875. Google ScholarDigital Library
Just, M., and Carpenter, P. 1976. Eye fixations and cognitive processes. Cognitive Psychology 480, 441--480.Google ScholarCross Ref
Kahneman, D. 1973. Attention and effort. Prentice Hall.Google Scholar
Kawashima, H., and Matsuyama, T. 2005. Multiphase learning for an interval-based hybrid dynamical system. IEICE Trans. on Fundamentals E88-A, 11, 3022--3035. Google ScholarDigital Library
Li, Y., Wang, T., and Shum, H.-Y. 2002. Motion texture: a two-level statistical model for character motion synthesis. ACM Trans. on Graphics 21, 3, 465--472. Google ScholarDigital Library
Munn, S. M., Stefano, L., and Pelz, J. B. 2008. Fixation-identification in dynamic scenes. In Proc. of APGV, 33--42. Google ScholarDigital Library
Nakano, Y. I., and Ishii, R. 2010. Estimating user's engagement from eye-gaze behaviors in human-agent conversations. In Proc. of IUI, 139--148. Google ScholarDigital Library
North, B., Blake, A., Isard, M., and Rittscher, J. 2000. Learning and classification of complex dynamics. IEEE Trans. on PAMI 22, 9, 1016--1034. Google ScholarDigital Library
Pavlovic, V., Rehg, J. M., and Maccormick, J. 2000. Learning switching linear models of human motion. In Proc. of NIPS, 981--987.Google Scholar
Qvarfordt, P., and Zhai, S. 2005. Conversing with the user based on eye-gaze patterns. In Proc. of CHI, 221--230. Google ScholarDigital Library
Schneider, W., and Shiffrin, R. M. 1977. Controlled and automatic human information processing: I. detection, search, and attention. Psychological Review 84, 1--66.Google ScholarCross Ref
Walther, D., and Koch, C. 2006. Modeling attention to salient proto-objects. Neural Networks: the Official Journal of the International Neural Network Society 19, 9, 1395--1407. Google ScholarDigital Library
Wildes, R. P. 1998. A measure of motion salience for surveillance applications. In Proc. of ICIP, 183--187.Google ScholarCross Ref

Index Terms

Multi-mode saliency dynamics model for analyzing gaze and attention
1. Human-centered computing
  1. Human computer interaction (HCI)

Recommendations

A neural network implementation of a saliency map model

The saliency map model proposed by Itti and Koch [Itti, L., & Koch, C. (2000). A saliency-based search mechanism for overt and covert shifts of visual attention. Vision Research, 40, 1489-1506] has been a popular model for explaining the guidance of ...
Read More
Perceptually guided high-fidelity rendering exploiting movement bias in visual attention

A major obstacle for real-time rendering of high-fidelity graphics is computational complexity. A key point to consider in the pursuit of “realism in real time” in computer graphics is that the Human Visual System (HVS) is a fundamental part of the ...
Read More
A saliency-based bottom-up visual attention model for dynamic scenes analysis

This work proposes a model of visual bottom-up attention for dynamic scene analysis. Our work adds motion saliency calculations to a neural network model with realistic temporal dynamics [(e.g., building motion salience on top of De Brecht and Saiki ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
ETRA '12: Proceedings of the Symposium on Eye Tracking Research and Applications
March 2012
420 pages
ISBN:9781450312219
DOI:10.1145/2168556
Conference Chairs:
Carlos H. Morimoto
University of São Paulo, Brazil
,
Howell Istance
Montfort University, UK
,
Editor:
Stephen N. Spencer
University of Washington
,
Program Chairs:
Jeffrey B. Mulligan
NASA
,
Pernilla Qvarfordt
FX Palo Alto
Copyright © 2012 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 28 March 2012
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
attentive state estimation
saliency map
switching linear dynamical system
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate69of137submissions,50%
Upcoming Conference
ETRA '24

Sponsor:

sigchi

sigchi

The 2024 Symposium on Eye Tracking Research and Applications

June 4 - 7, 2024

Glasgow , United Kingdom
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 16
  Total Citations
  View Citations
- 340
  Total Downloads
- Downloads (Last 12 months)15
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Multi-mode saliency dynamics model for analyzing gaze and attention

ETRA '12: Proceedings of the Symposium on Eye Tracking Research and Applications

ABSTRACT

References

Cited By

Index Terms

Recommendations

A neural network implementation of a saliency map model

Perceptually guided high-fidelity rendering exploiting movement bias in visual attention

A saliency-based bottom-up visual attention model for dynamic scenes analysis