Abstract
Cognitive workload is a key factor in understanding human cognitive performance, especially in scenarios that require intensive information processing. This study introduces an innovative method to estimate cognitive workload using eye-tracking data and proposes a novel deep learning model called BiTCADNet (Bidirectional Temporal Convolutional self-Attention Dense Network). Experiments using the newly created dataset "Cognitive-Eye-Movement" and the publicly available dataset "CL-Drive" show that BiTCADNet significantly outperforms traditional deep learning models in terms of accuracy, precision, recall, and F1 scores are significantly better than traditional machine learning methods. The proposed method provides a more effective way to monitor and evaluate cognitive workload in real-time, opening the way for its applications in various human-computer interaction environments.








Similar content being viewed by others
Explore related subjects
Discover the latest articles and news from researchers in related subjects, suggested using machine learning.Data Availability
All the data contained in this study can be obtained upon request to the corresponding author.
References
Kosch, T., et al.: A survey on measuring cognitive workload in human–computer interaction. ACM Comput. Surv. 55, 283:1-283:39 (2023)
Ehrmann, D.E., et al.: Evaluating and reducing cognitive load should be a priority for machine learning in healthcare. Nat. Med. 28, 1331–1333 (2022)
Sweller, J., Van Merriënboer, J.J.G., Paas, F.: Cognitive architecture and instructional design: 20 years later. Educ. Psychol. Rev. 31, 261–292 (2019)
Hart, S.G.: Nasa-task load index (NASA-TLX); 20 years later. Proc. Hum. Fact. Ergon. Soc. Annu. Meet. 50, 904–908 (2006)
Ktistakis, E., et. al.: COLET: A dataset for COgnitive workLoad estimation based on eye-tracking. Comput. Methods Programs Biomed. 224 (2022)
Gjoreski, M., et al.: Datasets for cognitive load inference using wearable sensors and psychological traits. Appl. Sci. 10, 3843 (2020)
Oppelt, M.P., et al.: ADABase: a multimodal dataset for cognitive load estimation. Sensors 23, 340 (2023)
Salehi Fadardi, M., Salehi Fadardi, J., Mahjoob, M. Doosti, H.: Post-saccadic eye movement indices under cognitive load: a path analysis to determine visual performance. J. Ophthalm. Vis. Res. (2022)
Müller, D., Mann, D.: Algorithmic gaze classification for mobile eye-tracking (2021)
Appel, T., et al.: Cross-task and cross-participant classification of cognitive load in an emergency simulation game. IEEE Trans. Affect. Comput. 14, 1558–1571 (2023)
Lu, X., Zhao, Z., Ke, W., Yan, Q., Liu, Z.: Young-gaze: an appearance-based gaze estimation solution for adolescents. SIViP 18, 7145–7155 (2024)
Deng, F., Bi, Y., Liu, Y., Yang, S.: Remaining useful life prediction of machinery: a new multiscale temporal convolutional network framework. IEEE Trans. Instrum. Meas. 71, 1–13 (2022)
Castilla, D., et al.: Improving the understanding of web user behaviors through machine learning analysis of eye-tracking data. User Model. User-Adapt. Interact. (2023)
Zhao, Q., Yang, L., Lyu, N.: A driver stress detection model via data augmentation based on deep convolutional recurrent neural network. Expert Syst. Appl. 238 (2024)
Bengio, Y., Courville, A., Vincent, P.: A Review and New Perspectives, Representation Learning (2014). arXiv:1206.5538
Chen, Y., Jiang, H., Li, C., Jia, X., Ghamisi, P.: Deep feature extraction and classification of hyperspectral images based on convolutional neural networks. IEEE Trans. Geosci. Remote Sens. 54, 6232–6251 (2016)
Tang, P., Du, P., Xia, J., Zhang, P., Zhang, W.: Channel attention-based temporal convolutional network for satellite image time series classification. IEEE Geosci. Remote Sens. Lett. 19, 1–5 (2022)
Elmadjian, C., Gonzales, C., da Costa, R.L., Morimoto, C.H.: Online eye-movement classification with temporal convolutional networks. Behav. Res. Methods (2022)
Bai, S., Kolter, J.Z., Koltun, V.: An empirical evaluation of generic convolutional and recurrent networks for sequence modeling. (2018). arXiv:1803.01271
Huang, G., Liu, Z., Van Der Maaten, L., Weinberger, K.Q.: Densely connected convolutional networks (2017)
Yu, F., Koltun, V.: Multi-Scale Context Aggregation by Dilated Convolutions (2016). arXiv:1511.07122
Ioffe, S., Szegedy, C.: Batch normalization: accelerating deep network training by reducing internal covariate shift (2015)
Nair, V., Hinton, G.E.: Rectified linear units improve restricted Boltzmann machines (2010)
Angkan, P., et al.: Multimodal brain–computer interface for in-vehicle driver cognitive load measurement: dataset and baselines. IEEE Trans. Intell. Transp. Syst. 1–16 (2024)
Zhong, X., Hou, W.: A method for classifying cognitive load of visual tasks based on eye tracking features 131–138 (2023)
Powers, D.M.W.: Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation (2020). arXiv:2010.16061
Setianto, E.J., Contessa Djamal, E., Nugraha, F., Kasyidi, F.: Eye tracking and emotion recognition using multiple spatial-temporal networks (2022)
Ansari, S., Naghdy, F., Du, H., Pahnwar, Y.: Driver mental fatigue detection based on head posture using new modified reLU-BiLSTM deep neural network. IEEE Trans. Intell. Transp. Syst. 23, 10957–10969 (2022)
Tang, W., et al.: Omni-scale CNNs: a simple and effective kernel size configuration for time series classification (2022). https://openreview.net/forum?id=PDYs7Z2XFGv
Funding
This work was supported by the 1. National Key Research and Development Program of China (2022YFC2407006) and 2. Key Research and Development Plan of Shandong Province(2021CXGC011103).
Author information
Authors and Affiliations
Contributions
LinYang was responsible for the conceptualization and methodology of the study, and wrote the main manuscript text. HanbinRen and BiaoWang contributed to the investigation and validation of the research findings. LeiWang, AijuanYang, and WenchangXu were involved in funding acquisition and supervision. LeiWang and WenchangXu were responsible for the review and editing of the main manuscript. All authors reviewed and agreed to the published version of the manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Ethical approval
This article does not contain any studies with human participants or animals performed by any of the authors without the proper ethical approval.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Yang, L., Wang, L., Xu, W. et al. Cross-task cognitive workload estimation using eye tracking. SIViP 19, 354 (2025). https://doi.org/10.1007/s11760-025-03931-0
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11760-025-03931-0