Skip to main content

Advertisement

Log in

Cross-task cognitive workload estimation using eye tracking

  • Original Paper
  • Published:
Signal, Image and Video Processing Aims and scope Submit manuscript

Abstract

Cognitive workload is a key factor in understanding human cognitive performance, especially in scenarios that require intensive information processing. This study introduces an innovative method to estimate cognitive workload using eye-tracking data and proposes a novel deep learning model called BiTCADNet (Bidirectional Temporal Convolutional self-Attention Dense Network). Experiments using the newly created dataset "Cognitive-Eye-Movement" and the publicly available dataset "CL-Drive" show that BiTCADNet significantly outperforms traditional deep learning models in terms of accuracy, precision, recall, and F1 scores are significantly better than traditional machine learning methods. The proposed method provides a more effective way to monitor and evaluate cognitive workload in real-time, opening the way for its applications in various human-computer interaction environments.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Algorithm 1
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Explore related subjects

Discover the latest articles and news from researchers in related subjects, suggested using machine learning.

Data Availability

All the data contained in this study can be obtained upon request to the corresponding author.

References

  1. Kosch, T., et al.: A survey on measuring cognitive workload in human–computer interaction. ACM Comput. Surv. 55, 283:1-283:39 (2023)

    Article  MATH  Google Scholar 

  2. Ehrmann, D.E., et al.: Evaluating and reducing cognitive load should be a priority for machine learning in healthcare. Nat. Med. 28, 1331–1333 (2022)

    Article  MATH  Google Scholar 

  3. Sweller, J., Van Merriënboer, J.J.G., Paas, F.: Cognitive architecture and instructional design: 20 years later. Educ. Psychol. Rev. 31, 261–292 (2019)

    Article  Google Scholar 

  4. Hart, S.G.: Nasa-task load index (NASA-TLX); 20 years later. Proc. Hum. Fact. Ergon. Soc. Annu. Meet. 50, 904–908 (2006)

    Article  Google Scholar 

  5. Ktistakis, E., et. al.: COLET: A dataset for COgnitive workLoad estimation based on eye-tracking. Comput. Methods Programs Biomed. 224 (2022)

  6. Gjoreski, M., et al.: Datasets for cognitive load inference using wearable sensors and psychological traits. Appl. Sci. 10, 3843 (2020)

    Article  MATH  Google Scholar 

  7. Oppelt, M.P., et al.: ADABase: a multimodal dataset for cognitive load estimation. Sensors 23, 340 (2023)

    Article  MATH  Google Scholar 

  8. Salehi Fadardi, M., Salehi Fadardi, J., Mahjoob, M. Doosti, H.: Post-saccadic eye movement indices under cognitive load: a path analysis to determine visual performance. J. Ophthalm. Vis. Res. (2022)

  9. Müller, D., Mann, D.: Algorithmic gaze classification for mobile eye-tracking (2021)

  10. Appel, T., et al.: Cross-task and cross-participant classification of cognitive load in an emergency simulation game. IEEE Trans. Affect. Comput. 14, 1558–1571 (2023)

    Article  MATH  Google Scholar 

  11. Lu, X., Zhao, Z., Ke, W., Yan, Q., Liu, Z.: Young-gaze: an appearance-based gaze estimation solution for adolescents. SIViP 18, 7145–7155 (2024)

  12. Deng, F., Bi, Y., Liu, Y., Yang, S.: Remaining useful life prediction of machinery: a new multiscale temporal convolutional network framework. IEEE Trans. Instrum. Meas. 71, 1–13 (2022)

    MATH  Google Scholar 

  13. Castilla, D., et al.: Improving the understanding of web user behaviors through machine learning analysis of eye-tracking data. User Model. User-Adapt. Interact. (2023)

  14. Zhao, Q., Yang, L., Lyu, N.: A driver stress detection model via data augmentation based on deep convolutional recurrent neural network. Expert Syst. Appl. 238 (2024)

  15. Bengio, Y., Courville, A., Vincent, P.: A Review and New Perspectives, Representation Learning (2014). arXiv:1206.5538

  16. Chen, Y., Jiang, H., Li, C., Jia, X., Ghamisi, P.: Deep feature extraction and classification of hyperspectral images based on convolutional neural networks. IEEE Trans. Geosci. Remote Sens. 54, 6232–6251 (2016)

    Article  MATH  Google Scholar 

  17. Tang, P., Du, P., Xia, J., Zhang, P., Zhang, W.: Channel attention-based temporal convolutional network for satellite image time series classification. IEEE Geosci. Remote Sens. Lett. 19, 1–5 (2022)

    MATH  Google Scholar 

  18. Elmadjian, C., Gonzales, C., da Costa, R.L., Morimoto, C.H.: Online eye-movement classification with temporal convolutional networks. Behav. Res. Methods (2022)

  19. Bai, S., Kolter, J.Z., Koltun, V.: An empirical evaluation of generic convolutional and recurrent networks for sequence modeling. (2018). arXiv:1803.01271

  20. Huang, G., Liu, Z., Van Der Maaten, L., Weinberger, K.Q.: Densely connected convolutional networks (2017)

  21. Yu, F., Koltun, V.: Multi-Scale Context Aggregation by Dilated Convolutions (2016). arXiv:1511.07122

  22. Ioffe, S., Szegedy, C.: Batch normalization: accelerating deep network training by reducing internal covariate shift (2015)

  23. Nair, V., Hinton, G.E.: Rectified linear units improve restricted Boltzmann machines (2010)

  24. Angkan, P., et al.: Multimodal brain–computer interface for in-vehicle driver cognitive load measurement: dataset and baselines. IEEE Trans. Intell. Transp. Syst. 1–16 (2024)

  25. Zhong, X., Hou, W.: A method for classifying cognitive load of visual tasks based on eye tracking features 131–138 (2023)

  26. Powers, D.M.W.: Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation (2020). arXiv:2010.16061

  27. Setianto, E.J., Contessa Djamal, E., Nugraha, F., Kasyidi, F.: Eye tracking and emotion recognition using multiple spatial-temporal networks (2022)

  28. Ansari, S., Naghdy, F., Du, H., Pahnwar, Y.: Driver mental fatigue detection based on head posture using new modified reLU-BiLSTM deep neural network. IEEE Trans. Intell. Transp. Syst. 23, 10957–10969 (2022)

    Article  Google Scholar 

  29. Tang, W., et al.: Omni-scale CNNs: a simple and effective kernel size configuration for time series classification (2022). https://openreview.net/forum?id=PDYs7Z2XFGv

Download references

Funding

This work was supported by the 1. National Key Research and Development Program of China (2022YFC2407006) and 2. Key Research and Development Plan of Shandong Province(2021CXGC011103).

Author information

Authors and Affiliations

Authors

Contributions

LinYang was responsible for the conceptualization and methodology of the study, and wrote the main manuscript text. HanbinRen and BiaoWang contributed to the investigation and validation of the research findings. LeiWang, AijuanYang, and WenchangXu were involved in funding acquisition and supervision. LeiWang and WenchangXu were responsible for the review and editing of the main manuscript. All authors reviewed and agreed to the published version of the manuscript.

Corresponding author

Correspondence to Lei Wang.

Ethics declarations

Conflict of interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Ethical approval

This article does not contain any studies with human participants or animals performed by any of the authors without the proper ethical approval.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Yang, L., Wang, L., Xu, W. et al. Cross-task cognitive workload estimation using eye tracking. SIViP 19, 354 (2025). https://doi.org/10.1007/s11760-025-03931-0

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s11760-025-03931-0

Keywords