Towards a Continuous Method for Mental Workload Registration

Radüntz, Thea; Freude, Gabriele

doi:10.1007/978-3-319-20373-7_17

Thea Radüntz⁵ &
Gabriele Freude⁵

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9174))

Included in the following conference series:

International Conference on Engineering Psychology and Cognitive Ergonomics

3093 Accesses
2 Citations

Abstract

Continuous mental workload registration is a key technology for evaluating and optimizing work conditions in human-machine systems. Despite the urgent need for this technology, its technical measurement is still lacking. The long-term goal of this work is the establishment of precisely such an objective method. The article describes the development of a continuous method for neuronal mental workload registration during the execution of cognitive tasks. The sample consists of 54 people in paid work. The electroencephalogram as well as further workload relevant biosignal data and the NASA-TLX as a subjective questionnaire method are registered. Results from the workload classification of the EEG segments are presented. They are in concordance with the results expected from different task requirements on the executive functions. Findings from the subjective ratings, accuracy rates, and cardiovascular parameters underscore this fact.

You have full access to this open access chapter, Download conference paper PDF

A New Method for the Objective Registration of Mental Workload

A New Method for Mental Workload Registration

Neuronal Mental Workload Registration during Execution of Cognitive Tasks

Keywords

1 Introduction

High demands on cognitive capacity and the ability to cope with workload are increasingly imposed on employees due to advanced information and communication technology, highly interactive work environments, and work assistance systems. Although the main goal of automatization is to simplify work, employees increasingly complain about high mental workload and stress. Problems arise from information overload, frequent work interruptions or from a multitude of irrelevant information [5, 6, 9]. Simultaneously, automation and supervisory control tasks can be linked to monotonous work that reduce employees’ arousal [2, 3, 8, 10, 11]. Hence, the long-term negative consequences of inappropriate workload on individual’s health constitute a serious problem of our modern society. Furthermore, increased error rates due to inappropriate workload constitute a safety risk for other persons [12].

An objective method for continuous mental workload registration is therefore absolutely essential. Efficient execution of work tasks is only possible in an optimal workload range, which can be effectively measured where information processing takes place, i.e. the brain. Neuronal workload measurement is therefore a key technology for optimizing work conditions in human-machine systems.

On the basis of monitoring the neuronal brain state it is possible for instance to define optimal task sharing between human and machine with efficient cognitive processing for the operator. The benefits for employees are maintenance of autonomy and working ability resulting from the moderate levels of mental workload when working with a human-machine system. Another important benefit is the prevention of negative impacts due to sustained overload or underload on the mental health and cognitive capacity of the working population.

This article describes the development of a continuous method for neuronal mental workload registration during the execution of cognitive tasks.

2 Methods

Cognitive tasks were conducted in a laboratory setting with the long-term goal of implementing a system capable of continuously monitoring an operator’s mental workload and recognizing critical states (e.g. high load and low load). The tests took place in the shielded lab of the Federal Institute of Occupational Safety and Health in Berlin.

During the execution of the tasks, which had diverse levels of complexity and difficulty, we registered the electroencephalogram (EEG), as well as further workload relevant biosignal data (e.g. heart rate, blood pressure). The NASA-TLX was conducted as subjective questionnaire method [4]. Based on the implementation of a MATLAB toolbox consisting of modules for EEG pre-processing, segmentation, analysis and template generation, mental workload can be indexed according to Lei’s Logistic Function Model (LFM) and workload can be individually classified in the ranges of low, moderate and high load [7].

2.1 Procedure

The experiment was carried out with each subject fully in a single day. It consisted of two parts: a training phase and the main experiment. During the training phase subjects were familiarized with the cognitive tasks. The cognitive tasks were the same as those of the main experiment but shorter in time. They were repeated until the subject reached an accuracy index of at least 80 %. The training phase was to create similar individual starting conditions in respect to the performance, so that we could investigate the workload’s effect independent from learning effects.

The main experiment started after a short break subsequent to the training phase. The tasks were presented in the same counterbalanced order as presented during the training phase. At the beginning and at the end of the main experiment biosignal rest measurements of about 3 min took place. The experiments were controlled remotely through a remote desktop connection, an intercommunication system and a video monitoring system.

2.2 Subjects

The sample consists of 57 people in paid work and shows high variability in respect to the cognitive capacity and hence to the experienced mental workload. Three persons did not complete all cognitive tasks and had to be excluded from further analysis. Table 1 describes the sample set of 54 people used.

Table 1. Sample set

Full size table

2.3 Tasks

The simulation of different cognitive task requirements is realized through the implementation of a task battery in the E-Prime application suite. The battery consists of tasks with diverse complexity and difficulty inducing different levels of mental workload. The implemented tasks are listed in Table 2.

Table 2. Task battery

Full size table

In this paper we concentrate on the analysis and evaluation of four tasks: switch-PAR and switch-NUM as the easiest ones, switch-XXX as a switching task with working memory requirements and moderate workload [1], and AOSPAN as a demanding dual task (see Figs. 1, 2, 3, 4). The latter is a translated version of the AOSPAN task developed by [13]. The analysis of the rest measurements serves as a reference point measurement.

2.4 Subjective Ratings

Subjective workload was captured with a computerized version of the NASA-TLX. After each task during the training phase, subjects were asked to rate the workload sources in 15 pairwise comparisons of NASA-TLX’s six workload dimensions: mental demand, physical demand, temporal demand, performance, effort, frustration. This required the subject to choose which dimension is more relevant to workload in the specific task. Hence, we gained an individual weighting of these subscales based on their perceived importance.

After each task during the main experiment, subjects were asked again to rate the task within a 100-point range with 5-point steps. They indicated their rating by clicking on a 5-point step box with an optical mouse.

2.5 Physiological Measures

The electroencephalogram as well as the blood pressure (BP) and the heart rate (HR) were digitally recorded only during the main task.

EEG. The EEG was captured by 25 electrodes placed at positions according to the 10-20-system and recorded with reference to Cz and at a sample rate of 500 Hz. For signal recording we used an amplifier from BrainProducts GmbH and their BrainRecorder software.

The recorded EEG signal is widowed with a Hamming function and filtered with a bandpass filter (order 100) between 0.5 and 40 Hz. Subsequently, independent component analysis (ICA) is applied to the signal and the calculated independent components are visually inspected and classified as either an artifact or signal component. The signal components are projected back onto the scalp channels. The artifact-corrected EEG signal is transformed to average reference and cut into segments of 10 s length, overlapping by 5 s. Subsequently, the workload relevant frequency bands ($\theta $: 4-8 Hz, $\alpha $: 8-12 Hz) are computed over the segments using the Fast Fourier Transformation (FFT).

Individual system training for each person is done on the basis of the $\theta $- and $\alpha $-band power distributions over the segments of the first minute of each task. The mean values computed for each person, task and frequency band are stored. Next, the cumulative distribution function over all training segments of each person is built and the previously stored mean values are used to extract the corresponding p-values from the cumulative distribution function. These p-values are averaged over all persons per task and a task specific overall p-value is gained for each frequency band. These overall p-values are then used for extracting the individual task specific q-values from the cumulative distribution functions of each person. Finally, the individual q-values and the NASA-TLX ratings are used for the individual parametrization of the system.

Hence, after successful system training and generation of individual parameters for $b_0$, $b_1$, and $b_2$ we get a personalized Logistic Function Model (LFM) [7] for each person:

$$\begin{aligned} W = \frac{1}{1 + e^{-\left( b_0 + b_1 \cdot \theta + b_2 \cdot \alpha \right) } } \end{aligned}$$

(1)

Here, the relative frequency values ($\theta $, $\alpha $) can be applied and a workload index W for each segment calculated. Due to the nature of the logistic function, this workload index is in the range of 0 to 1. Segments with a workload index $W \le 0.2$ are classified as low load segments, with $0.2 < W < 0.8$ as moderate load segments, and with $W \ge 0.8$ as high load segments. Hence, we obtain for each person and task three percentage values for the portion of the segments of each sector (LLS: low load segments, MLS: moderate load segments, HLS: high load segments).

Cardiovascular Parameters. Blood pressure was recorded continuously by the FMS Finometer Pro device. A finger cuff was placed around the subject’s finger and systolic and diastolic blood pressure as well as the heart rate were detected automatically. The recorded data was processed in the time domain.

2.6 Performance

We concentrated on the analysis of the individual accuracy rates for all four tasks. For AOSPAN, correct responses include the number of sets in which the letters are recalled in correct serial order and correct math problem solving.

2.7 Statistical Analysis

Six ANOVAs were carried out utilizing repeated measures design, one within-subject factor ($\theta $, $\alpha $, systolic BP, HR, accuracy rate or NASA-TLX), six levels (the four tasks and the two rest measurements) for the factors $\theta $, $\alpha $, systolic BP, and HR, or four levels (the four tasks) for the factors accuracy rates and NASA-TLX. Differences between the levels were examined and tested with a post-hoc test (Bonferroni).

3 Initial Results

The results computed over 54 subjects, the four tasks, and the rest measurements will be presented in the following section. They comprise the obtained subjective ratings and task performance as well as the mental workload indexed segments from the EEG, the systolic BP and HR.

3.1 Subjective Ratings and Performance

Subjective Ratings. Figure 5(a) shows the average workload index for the selected tasks switch-PAR, switch-NUM, switch-XXX, and AOSPAN as representatives of two low, a moderate and a high workload tasks. Workload means changed significantly during the experiment (Greenhouse-Geisser F(5.96; 316.01) = 65.023, p$<$0.001). Post-hoc analysis revealed significant changes of the subjectively rated mean workload index between the tasks apart from the two easy tasks among each other.

Furthermore, the analysis of the NASA-TLX sub-scales indicates the predominant role of mental demands at the implemented task battery. Hence, the induced workload originates from information processing and should be reflected in the EEG.

Performance. Figure 5(b) shows the average accuracy rates for the selected tasks switch-PAR, switch-NUM, switch-XXX, and AOSPAN. Accuracy rate means changed significantly during the experiment (Greenhouse-Geisser F(3.71; 196.67) = 173.256, p$<$0.001). Post-hoc analysis revealed significant changes of the mean accuracy rates between all tasks.

3.2 Physiological Measures

EEG. Analysis of the classified EEG segments demonstrates a proportion increase of the high load segments and a proportion decrease of the low load segments with increasing task difficulty level. Means of LLS and HLS changed significantly during the experiment (Greenhouse-Geisser F(6.47; 0.28) = 20.89, p$<$0.001; Greenhouse-Geisser F(5.36; 289.16) = 23.24, p = 0.001). Results obtained from the assessment of the EEG segments are presented in Fig. 6.

Post-hoc analysis of the proportion of HLS showed that the means were significantly larger during the AOSPAN task than all other measurements. Significant differences were identified also between the switch-XXX task and the switch-NUM task as well as between the switch-XXX and the rest measurement at the end. The later showed significant changes to the switch-PAR task, too.

The proportion of LLS revealed significant changes between AOSPAN and all other measurements. Similar behavior was observed for the switch-XXX task. Furthermore, LLS’s proportion of the rest measurement at the end was significantly larger then switch-PAR and the rest measurement at the beginning. No significant differences could be found between the easiest tasks switch-PAR and switch-NUM, neither among themselves nor to the rest measurement at the beginning.

Cardiovascular Parameters. Both systolic BP and HR differed between the measurements significantly (Greenhouse-Geisser F(4.45; 235.65) = 17.62, p$<$0.001; Greenhouse-Geisser F(5.89; 312.26) = 20.92, p$<$0.01).

HR during the rest measurement at the end was, according to post-hoc analysis, lower than during all four tasks. HR during the rest measurement at the beginning was significantly lower then switch-NUM, switch-XXX, and AOSPAN. Furthermore, significant changes in HR could be found between the tasks except for switch-XXX and AOSPAN.

Systolic BP means were significantly larger during the AOSPAN task than in switch-PAR, switch-NUM, and the rest measurements. Additionally, they were significantly larger during switch-XXX than in the two easier switch tasks and the rest measurements. No significant changes could be found between the easy switch tasks switch-PAR and switch-NUM. Furthermore, there were no significant changes between the rest measurements at the beginning and at the end, the rest measurement at the end and the two easier switch tasks, the rest measurement at the beginning and switch-PAR.

Results of systolic BP and HR are presented in Fig. 7(a) and (b).

4 Discussion

The registration of mental workload by means of the EEG is the central issue addressed by this paper. We induced different levels of mental workload on the basis of a task battery but for the sake of convenience, we concentrated here on the switch-PAR, switch-NUM, switch-XXX and AOSPAN tasks. Cognitive requirements of the first two tasks are quite low and the tasks can be assumed to be representative of an easy task. The switch-XXX task is more demanding due to higher requirements on the working memory and rule switching. It can be classified as a moderate to difficult task but not as challenging as the AOSPAN task. The AOSPAN task demands memory control while dealing with distraction due to the math problem solving. It is a dual-task with high workload requirements.

Subjective ratings derived from the NASA-TLX questionnaire demonstrate significant workload differences between the more demanding tasks (switch-XXX and AOSPAN) and the easy tasks. No significant difference could be identified among the subjects in respect of their experienced workload between the two easy tasks switch-PAR and switch-NUM. Accuracy rates show significant differences between all tasks but remarkably larger breaks between the difficult AOSPAN task and all others but also between the moderate task and the two easy tasks. Although there is a significant difference between switch-PAR and switch-NUM, it is pretty clear that the two tasks are located in the low workload level compared to the other two tasks. However, we notice the switch-PAR task to be slightly more difficult than the switch-NUM task.

Cardiovascular parameter indicate significant differences between the more demanding tasks (switch-XXX and AOSPAN) and the two easy tasks. They also show significant differences between both demanding tasks and the rest measurements at the beginning and the end of the experiment. What is more, HR indicates small differences between both easy tasks and also between the rest measurement at the end and all other tasks. Surprisingly, no significant difference can be observed between the more difficult tasks. Here we have to ask, if the cardiovascular parameters are not able to finely distinguish between more demanding tasks, maybe due to a ceiling effect.

The EEG as a direct signal of brain activity and the frequently observed variability of the $\theta $- and $\alpha $-band according to attention, fatigue and mental workload, constitute the theoretical background for the implementation of the LFM method for neuronal mental state monitoring. Proportion analysis results of the HLS and LLS are in concordance with the expected results due to difficulty levels resulting from the requirements of the tasks on the executive functions. The moderate switch-XXX task contains significantly less LLS than the easy tasks and the rest measurements. The more demanding AOSPAN task includes even less LLS. This differences between AOSPAN and all other conducted measurements were found to be significant.

In respect of the HLS, the AOSPAN task again shows substantially higher values than all other measurements. Considering also its small proportion of LLS, AOSPAN is a high mental workload task. Switch-XXX includes significantly higher proportions of HLS than switch-NUM and the rest measurement at the end. However, no significant differences in respect of the HLS could be found between it and the switch-PAR as well as the rest measurement at the beginning. As a side note, this fits well to the assumption that the switch-PAR task is a bit more demanding then the switch-NUM task. If one additionally considers switch-XXX’s proportion of LLS, we can assume that it ranges between the difficult and the easy task. Hence, it can be considered as a moderate workload task.

The easy tasks indicate no significant differences to the rest measurement at the beginning. Neither in respect to their proportion of LLS nor to their proportion of HLS. Interestingly, there is a significant difference of HLS’s as well as LLS’s proportion from the rest measurement at the end, but only for switch-PAR. This fact is solidly in line with the accuracy rates indicating the switch-PAR task as slightly more difficult than the switch-NUM task.

To sum up, our study concurs with the expectations for an increase in the HLS and a decrease in the LLS. Based on these findings of neuronal brain states an optimal task sharing between human and machine could be defined and a moderate mental workload could be achieved. The prevention of negative impacts due to sustained over- or underload on the mental health and cognitive capacity of the working population would be the next step to take. To accomplish this, a consolidated study of over- and underload conditions has to be conducted and measured by means of continuous mental workload registration.

Finally, brain state monitoring can contribute to the modulation of workload, protect and advise against overload and underload, and can be used for ergonomic evaluation and improvement of human-machine systems and information intensive occupations.

References

Allport, D.A., Styles, E.A., Hsieh, S.: Shifting intentional set: exploring the dynamic control of tasks. In: Umilta, C., Moscovitch, M. (eds.) Attention and Performance XV, pp. 421–452. MIT Press, Cambridge (1994)
Google Scholar
Debitz, U., Gruber, H., Richter, G.: Psychische Gesundheit am Arbeitsplatz. Teil 2: Erkennen, Beurteilen und Verhüten von Fehlbeanspruchungen, 3rd edn., InfoMediaVerlag (2003)
Google Scholar
Hacker, W., Richter, P.: Psychische Fehlbeanspruchung. Psychische Ermüdung, Monotonie, Sättigung und Stress (Spezielle Arbeits- und Ingenieurpsychologie in Einzeldarstellungen), 2nd edn., Springer, Berlin (1984)
Google Scholar
Hart, S.G., Staveland, L.E.: Development of the NASA TLX: results of empirical and theoretical research. In: Hancock, P.A., Meshkati, N. (eds.) Human Mental Workload, pp. 139–183. North Holland, Amsterdam (1988)
Chapter Google Scholar
Kompier, M.A.J., Kristensen, T.S.: Organisational work stress interventions in a theoretical, methodological and practical context. In: Dunham, J. (ed.) Stress in the Workplace: Past, Present and Future, pp. 164–190. Whurr Publishers, London (2001)
Google Scholar
Landsbergis, P.A., Cahill, J., Schnall, P.: The changing organisation of work and the safety and health of working people: a commentary. J. Occup. Environ. Med. 45(1), 61–72 (2003)
Article Google Scholar
Lei, S.: Driver mental states monitoring based on brain signals. Ph.D. thesis, TU Berlin, Germany (2011)
Google Scholar
May, J.F., Baldwin, C.L.: Driver fatigue: the importance of identifying causal factors of fatigue when considering detection and countermeasure technologies. Transp. Res. Part F 12, 218–224 (2008)
Article Google Scholar
NIOSH - NORA Organization of work team members . The changing organization of work and the safety and health of working people. Cincinnati: NIOSH-Publications Dissemination, April 2002
Google Scholar
Parasuraman, R., Molloy, R., Singh, I.L.: Performance consequences of automation induced complacency. Int. J. Aviat. Psychol. 3(1), 1–23 (1993)
Article Google Scholar
Parasuraman, R., Mouloua, M., Molloy, R.: Monitoring automation failures in human machine systems. In: Mouloua, M., Parasuraman, R. (eds.) Human Performance In Automated Systems: Current Research Trends, pp. 45–49. Earlbaum, Hilsdale, NJ (1994)
Google Scholar
Sträter, O.: Warum passieren menschliche Fehler und was kann man dagegen tun? Forum Prävention. AUVA - Allgemeine Unfallversicherungsanstalt, Wien (2001)
Google Scholar
Unsworth, N., Heitz, R.P., Schrock, J.C., Engle, R.W.: An automated version of the operation span task. Behav. Res. Methods 37, 498–505 (2005)
Article Google Scholar

Download references

Acknowledgments

We would like to thank Dr. Sergei Schapkin, Dr. Patrick Gajewski, and Prof. Michael Falkenstein for selection of the battery’s tasks. We would like to thank Mr. Ludger Blanke for technical support during the timing tests for the tasks. In addition, we would like to thank Ms. Xenija Weißbecker-Klaus, Mr. Robert Sonnenberg, Dr. Sergei Schapkin, and Ms. Marion Exner for general task testing and for conducting the laboratory experiments. Furthermore, we would like to thank Ms. Marion Exner for daily operational support and our student assistant Jon Scouten for proofreading. More information about the project where our EEG data were acquired can be found under the following link: http://www.baua.de/de/Forschung/Forschungsprojekte/f2312.html?nn=2799254.

Author information

Authors and Affiliations

Federal Institute for Occupational Safety and Health, Mental Health and Cognitive Capacity, Nöldnerstr. 40/42, 10317, Berlin, Germany
Thea Radüntz & Gabriele Freude

Authors

Thea Radüntz
View author publications
You can also search for this author in PubMed Google Scholar
Gabriele Freude
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Thea Radüntz .

Editor information

Editors and Affiliations

Coventry University, Coventry, United Kingdom
Don Harris

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Radüntz, T., Freude, G. (2015). Towards a Continuous Method for Mental Workload Registration. In: Harris, D. (eds) Engineering Psychology and Cognitive Ergonomics. EPCE 2015. Lecture Notes in Computer Science(), vol 9174. Springer, Cham. https://doi.org/10.1007/978-3-319-20373-7_17

Download citation

DOI: https://doi.org/10.1007/978-3-319-20373-7_17
Published: 21 July 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-20372-0
Online ISBN: 978-3-319-20373-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics