Evolutionary Algorithms for the Design of Neural Network Classifiers for the Classification of Pain Intensity

Mamontov, Danila; Polonskaia, Iana; Skorokhod, Alina; Semenkin, Eugene; Kessler, Viktor; Schwenker, Friedhelm

doi:10.1007/978-3-030-20984-1_8

Danila Mamontov¹⁶,
Iana Polonskaia¹⁶,
Alina Skorokhod¹⁶,
Eugene Semenkin¹⁶,
Viktor Kessler¹⁷ &
…
Friedhelm Schwenker¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11377))

Included in the following conference series:

IAPR Workshop on Multimodal Pattern Recognition of Social Signals in Human-Computer Interaction

593 Accesses
6 Citations

Abstract

In this paper we present a study on multi-modal pain intensity recognition based on video and bio-physiological sensor data. The newly recorded SenseEmotion dataset consisting of 40 individuals, each subjected to three gradually increasing levels of painful heat stimuli, has been used for the evaluation of the proposed algorithms. We propose and evaluated evolutionary algorithms for the design and adaptation of the structure of deep artificial neural network architectures. Feedforward Neural Network and Recurrent Neural Network have been considered for the optimisation by using a Self-Configuring Genetic Algorithm (SelfCGA) and Self-Configuring Genetic Programming (SelfCGP).

You have full access to this open access chapter, Download conference paper PDF

EEuGene: Employing Electroencephalograph Signals in the Rating Strategy of a Hardware-Based Interactive Genetic Algorithm

Evolving an emotion recognition module for an intelligent agent using genetic programming and a genetic algorithm

Article 03 February 2016

User Evaluation Prediction Models Based on Conjoint Analysis and Neural Networks for Interactive Evolutionary Computation

Keywords

1 Introduction

Automatic pain and emotion recognition have been developed based on a specific modality such as video signals, particularly facial expressions [7, 15, 21, 22, 30, 32] or biophysiological signals [1, 3, 5, 10, 12, 16, 17]. More recently, multi-modal systems where several modalities are combined to improve the pain intensity recognition performance have been investigated [2, 8, 9, 13, 14, 19, 33]. In recent works also the audio modality has been successfully applied to pain intensity estimation [29, 31], but still the most common modalities involved in the assessment of pain intensity are the video and biophysiological channels.

Nowadays, artificial Neural Networks (ANNs) are considered to be powerful methods for pattern recognition and for data analysis [23, 24], in particular, the so-called deep neural networks have achieved enormous attention in recent times. However, the design of an ANN architecture for a classification or estimation task is still an open issue and the success of an ANN-architecture is typically highly depending on the experience of the machine learning or ANN engineer. Automatic generation, design and evaluation of ANN architectures would be a useful concept as in many problems the optimal architecture is not known beforehand. Typically, developers use a trial and error method to determine the ANN-structure.

Here is a brief list of hyper-parameters that people often vary by hand during ANN hyper optimisation: learning rate, batch size, training epoch, types of layers, number of layers, activation function for the neurons in different layers, and the dropout parameter. In our experimental study we are focussing mainly on the following hyper-parameters: number of layers and their types, number of neurons per layer and their activation functions.

In this paper, we propose to use Evolutionary Algorithms (EAs) for the ANN structure adaptation. We use two types of ANNs: Feedforward Neural Network (FNN) and Recurrent Neural Network (RNN). FNN and RNN structures are optimised using a Self-Configuring Genetic Algorithm (SelfCGA) and Self-Configuring Genetic Programming (SelfCGP) accordingly, borrowed from [25, 26]. Self-Configuring modifications allow us to overcome the problems of selecting settings for the Genetic Algorithm (GA) and Genetic Programming (GP).

We use the Keras framework [4] in Python to train and build ANNs.

The remainder of this work is organised as follows. Section 2 consists of the description of the dataset. Section 3 has a short description of the Self-Configuring technique for EAs used for the ANN structure adaptation. In Sect. 6 a description of Particle Swarm Optimisation with parasitic behaviour (PSOPB) for feature selection is provided. Experiments as well as the corresponding results are presented in Sect. 7 followed by the discussion and conclusion in Sect. 8.

2 Dataset Description

The data utilized in the present work was recently collected with the goal of generating a multimodal corpus designed specifically for research in the domain of emotion and pain recognition. It consists of 40 participants (20 male, 20 female), each subjected to two sessions of experiments of about 40 min each, during which several pain and emotion stimuli were triggered and the demeanour of each participant was recorded using audio, video and biophysiological sensors.

The pain stimuli were elicited through heat generated by a Medoc Pathway thermal simulator^{Footnote 1}. The experiment was repeated for each participant twice, each time with the ATS thermode attached to a different forearm (left and right). Before the data was recorded, each participant’s pain threshold temperature and pain tolerance temperature were determined. Based on both temperatures, an intermediate heat stimulation temperature was computed such that the range between both the threshold and tolerance temperatures was divided into 2 equally spaced ranges.

A specific emotional elicitation was triggered simultaneously to each pain elicitation in the form of pictures and video clips. The latter were carefully selected with the purpose of triggering specific emotional responses. This allowed a categorisation of the emotion stimuli using a two-dimensional valence-arousal space in the following groups: positive (positive valence, high arousal); negative (negative valence, low arousal); neutral (neutral valence, neutral arousal).

Each heat temperature (pain stimulation) was triggered randomly 30 times with a randomised pause lasting between 8 and 12 s between consecutive stimuli. The randomised and simultaneous emotion stimuli were distributed for each heat temperature (pain stimulation) as well as the baseline temperature (no pain stimulation) as follows: 10 positive, 10 negative and 10 neutral emotion elicitations. Each stimulation consisted of a 2-s onset during which the temperature was gradually elevated starting from the baseline temperature until the specific heat temperature was reached. Following this, the attained temperature was maintained for 4 s before being gradually decreased until the baseline temperature was reached. A recovery phase of 8–12 s followed before the next pain stimulation was elicited (see Fig. 1 for more details).

Therefore, each participant is represented by two sets of data, each one representing the experiments conducted on each forearm (left and right). Each dataset consists of 120 pain stimuli with 30 stimuli per temperature ($T_{0}$: baseline, $T_{1}$: threshold, $T_{2}$: intermediate, $T_{3}$: tolerance), and 120 emotion stimuli with 40 stimuli per emotion category (positive, negative, neutral).

The synchronous data recorded from the experiments consists of 3 high-resolution video streams from 3 different perspectives, 2 audio lines recorded respectively from a headset and a directional microphone, and 4 physiological channels, namely the electromyographic activity of the trapezius muscle (EMG), the galvanic skin response (GSR), the electrocardiogram (ECG) and the respiration (RSP). Furthermore, an additional video and audio stream were recorded using the Microsoft Kinect sensor.

The focus of the present work is the investigation of the relevance of both audio and video channels regarding the task of pain intensity recognition. Thus the recognition of the different categories of emotion or the impact of the emotion stimuli on pain recognition will not be investigated.

In this paper, we have been concentrated on a binary classification of pain level according to $T_{0}$ and $T_{3}$ temperature. All approaches have been tested on two parts of the dataset for the left and right forearms.

3 Evolutionary Algorithms

Evolutionary algorithms (EAs) are common population-based methods used for global optimisation problems [6]. During this investigation, we take into account two EAs: the Genetic Algorithm, which represents solutions in a binary string form, and the Genetic Programming algorithm, where solutions are encoded as binary trees. The main advantage of EAs in contradistinction to gradient methods lies in their “creativity” - due to the recombination pieces of solutions from the population, unexpectedly effective results can arise that would otherwise be difficult to predict. However, at the same time, the main disadvantage is the large amount of computation required for this in the case of poorly selected settings. Indeed, in the course of evolution, it is necessary to test many bad hypotheses. To address this problem, it is necessary to use modified algorithms. For instance, effective combinations of evolutionary operators allow an optimal solution to be found with fewer objective function evaluations.

3.1 Self-configuration of EAs

Both GA and GP have many adjustable parameters. Since the number of parameter combinations is large, the use of brute force is not always possible. In order to overcome this defect in the study, we use the operator-based Self-Configuration technique. The main idea is that this technique should choose the most useful combinations of operators from all available ones in GA and GP. In the beginning, all operators have the same probability to be chosen for new offspring generation. During the course of work, Self-Configuration changes these probabilities based on the offspring fitness improvement generated by the a certain operator. There are several evolutionary operators, namely selection, crossover and level of mutation.

We have added the following types of operators for the Self-Configuration in this study. There are three types of selection: proportional, rank-based and tournament with three tournament sizes (2, 5 and 9). Three types of crossover (one-point, two-point, and uniform) for GA and two types (standard and one-point) for GP are included. Three levels of mutation: weak $\frac{1}{5*n}$, medium $\frac{1}{n}$, and strong $\frac{5}{n}$ are also included. Where n is an actual depth of a tree in GP and a length of a binary string in GA.

The general procedure of SelfCGA and SelfCGP:

1.
Set equal probabilities for all possible options of each operator type (each of the operators has its own probability distribution)
2.
Initialize the first population
3.
Select types of selection, recombination and mutation
4.
Identify parents using the selected selection operator
5.
Cross parents using the selected crossover type
6.
Mutate the offspring with the selected probability level of mutation
7.
Estimate the fitness of the new offspring
8.
Repeat steps 3–7 until the new generation is formed
9.
Recalculate the operator type probabilities using the average fitness of offspring obtained with the certain operator
10.
Check the stop conditions, and if it is not reached, go to step 3, otherwise stop the search and take the offspring with the best fitness as a final solution.

3.2 Fitness Function

The fitness function is an indicator of the solution success in EAs. It ranges from 0 to 1 (a perfect solution would have a fitness of 1). We should define this function for each certain problem we solve. In this research, we should evaluate the effectiveness of different ANN structures. Usually, in the course of ANN training, cross-entropy is used as a loss function for backpropagation. The cross-entropy ranges between 0 and 1 (a perfect model would have a cross-entropy loss of 0). It allows ANN to be trained most effectively. Therefore, we take the cross-entropy (in the case of binary classification problems it is the binary version) for calculating the fitness function as follows:

$$\begin{aligned} fitness=\frac{1}{1+mean\_loss} \end{aligned}$$

(1)

Where $mean\_loss$ is the average loss of participant independent leave one participant out cross validation performance with m participants from the dataset:

$$\begin{aligned} mean\_loss=\frac{\sum _{i=1}^{m}binary\_crossentropy_i}{m} \end{aligned}$$

(2)

m is equal to 5 within the frame of SelfCGP and SelfCGA work. After the final structures are found, they are tested on the whole dataset with m equals 40.

4 RNN Structure Optimisation Using SelfCGP

As is mentioned above, we apply SelfCGP for the RNN structure adaptation. SelfCGP is already successfully used in the design of FNNs for solving various data analysis problems [27].

Keras has several kinds of recurrent layers. The following layer types have been used: Simple recurrent neural network (a fully-connected RNN where the output is to be fed back to input); LSTM (a long-short term memory RNN); Dense (a regular densely-connected NN layer). In addition, we take Dropout layers (randomly setting a fraction rate of input units to 0 at each update during the training time, which helps prevent overfitting) [28]. It is very important to give an opportunity for SelfCGP to design RNNs with Dropout layers. It is worth noting that ANNs cannot consist only of Dropout layers, so we include them in the functional set, but not in the terminal one. Next, we need to define several different coefficients of Dropout, for instance, $0.1, 0.2,\ldots ,0.9$.

4.1 Terminal and Functional Sets

The terminal set can contain a lot of variations of layers with the different parameters described above. In this study, we have included follow layer types in the terminal set: SimpleRNN, LSTM, GRU, and Dense layers. In addition, all the activation functions available in Keras have been included in the terminal set: softmax, elu, selu, softplus, softsign, relu, tanh, sigmoid, hard_sigmoid and linear. The range of the available number of neurons per layer has been set from 1 to 40.

Therefore, the terminal set contains 400 elements (all possible combinations of layers, activation functions and numbers of neurons).

The functional set should include possible operations on elements from the terminal set. We have defined two operations to be included in the functional set:

1.
Sequential union (“+”)
2.
Sequential union with additional Dropout layer (“+” Dropout with coefficients $\{0.1, 0.2,\ldots ,0.9\}$).

4.2 The Structure Encoding Description

GP uses binary trees for encoding all structures (unlike GA that uses binary strings). In this study, we propose using the following method of encoding ANN structures into trees. For instance, Fig. 2 shows the structure encoded into the tree.

The code below represents the decoded structure from Fig. 2 already in a form suitable for Keras. All leaves belong to the terminal set, and at the same time, nodes with two child belong to the functional set.

This kind of encoding allows to encode various types of ANN structures with an unlimited number of layers. We prevent the design of trees with only Dense layers by using a restriction on the presence of at least one recurrent layer.

4.3 Experiment Description

As a baseline, we take FNN with one hidden layer and 40 neurons calculated by the following function:

$$\begin{aligned} N_{neurons}=\frac{n_{inputs}+n_{outputs}}{2}+1 \end{aligned}$$

(3)

When using GSR features $n_{inputs}=77$ and $n_{outputs}=2$ then $N_{neurons}=40$.

The final best structure found by SelfCGP is tested on all 40 patients using the cross-validation described above. After that, we calculate the Student’s t-test to determine statistically significant differences among all results.

The problem we solve has no time dependence, and at first glance, it would appear that the use of RNNs will be ineffective. However, according to [11], RNNs can surpass the effectiveness of FNNs for problems with no time dependency. Since an RNN requires the presence of a time factor for learning, but the problem is static, in this paper we duplicate the input feature vector in time. Thus, a constant signal is emitted. We have defined the input vector repeats for 3 and 5 times ($SelfCGP_{3st}$ and $SelfCGP_{5st}$) for tests. We also compare training on 1 and 3 epochs. The main parameters of SelfCGP are: the population size is 100 individuals, the number of generations is 100, the maximum depth is 3, and the fully growth at initialization step.

5 FNN Structure Optimisation Using SelfCGA

We have tested two different methods of FNN encoding. The first one uses only Dense layers, but the second one uses Dense and Dropout layers. In this case, SelfCGA is used for finding the optimal number of neurons, layer activation function and the total number of layers. The length that describes one part of the FNN (Dense layer + Dropout layer) is divided into four sets. The first set represents the type of activation function of the Dense layer. The second represents the number of neurons in the Dense layer. The third set represents the presence or absence of the Dropout layer after Dense, and then the fourth set represents the fraction of the input units to drop. The FNN in the genotype can be represented as shown in Figs. 3 and 4. After the second and fourth layers of the dense type, the layers of the droplet types do not follow, and the next part of the network immediately begins. The architecture of each network is coded into the chromosomes of SelfCGA, where each chromosome is composed of $(n - m)*4$ genes. n is the maximal number of layers (or we can call it parts of the network: Dense + Dropout), which must be selected before running the program, m is the number of inactive layers (parts of the neural network which contain 0 neurons on the Dense layer not expressed in the phenotype). If we use only one type of layer, we can remove the part which describes Dense layer, and if we use more types of layers, we can add more parts in the string.

FNN is optimised by the Adam algorithm.

6 Feature Selection Using PSOPB

Within this research, the dimension of the feature space is reduced by Particle Swarm Optimisation with parasitic behavior (PSOPB) [20]. This dataset contains big data (data of large dimensions). Thus, the solution of the classification problem becomes a complicated task for some algorithms. For example, using an artificial neural network, the number of neurons in the first layer is equal to the dimensionality of the feature space, and the number of weights that any training algorithm needs to find could be even greater. Thus, the problem of consuming a large number of resources, both temporary and memory, arises. A possible solution is to reduce the feature space. The decrease dimension of the feature space has already been made by classical methods, such as PCA, so another approach was chosen to solve this problem. Each attribute is evaluated by a certain feature. Thus, the string of input classification parameters is the vector of binomial values for the optimisation algorithm (see Fig. 5) [18].

The fitness of the “particle” is the result of a classification algorithm (accuracy, f1-measure or something else). The fitness function is a “deleting” of parameters with a null feature (see Fig. 6).

RNN is chosen like the Fitness function of PSOPB.

The structure of RNN contains one hidden layer with 40 neurons of the LSTM type, n input neurons (depends on the number of features) and two output neurons.

Accuracy is chosen like the fitness value of the particle.

7 Experiments

7.1 Feature Selection Results

Experiments are conducted using GSR features. It has two parts: right and left, so the results of the PSOPB working are two binomial vectors of parameter coefficients.

PSOPB with 10 generations and 30 individuals in the population found two binomial vectors for each (“left” and “right” forearms) problem. The strings below are the final results of PSOPB. These strings show which GSR feature takes part in the designing of the classifier and which does not.

For the left forearm:

$$\begin{aligned}&X=[0,0,1,1,1,0,1,0,0,0,0,0,0,1,0,0,0,0,1,0,1,0,0,1,0,1,\\&\qquad \qquad 1,1,1,0,1,1,1,0,1,0,0,0,0,1,1,1,1,1,0,0,1,1,1,1,0,1,\\&\qquad \qquad \qquad \qquad 1,1,0,0,1,1,0,1,0,1,1,1,1,1,1,0,1,1,1,0,1,1,1,0,1] \end{aligned}$$

For the right forearm:

$$\begin{aligned}&X=[0,1,0,0,0,1,1,0,0,0,0,1,1,1,1,0,1,0,1,1,0,0,1,1,0,\\&\qquad \qquad 1,0,1,0,1,0,0,0,1,1,1,0,0,0,1,1,0,0,0,1,1,1,1,0,0,\\&\qquad \qquad \qquad \qquad 0,0,1,0,1,1,1,0,0,1,1,1,0,0,0,1,1,0,0,1,1,1,1,0,1,1,0] \end{aligned}$$

Therefore, we have 44 active features for the “left” part and 39 active features for the “right” part.

Table 1. RNN accuracy on original data (77 features) and reduced by PSOPB (44 for Left Forearm and 39 for Right Forearm)

Full size table

The results of the work of the two algorithms are compared with each other by the Student’s t-test. Experiments are conducted in different conditions to find the relation between RNN structure or settings and the work of PSOPB (Table 1). The mean accuracy was obtained by conducting 40 runs with different patients for testing. 39 people are taken for training on the RNN and one person for the test.

7.2 RNN Optimisation Results

The following structures are found by SelfCGP. For the left forearm problem:

As we can see, there are 4 layers with only GRU and SimpleRNN types of layers.

For the right forearm problem:

In this case, there are only 3 SimpleRNN layers and “tanh” as an activation function for each layer.

The Table 2 shows the average participant independent leave one participant out cross validation performance.

Table 2. SelfCGP without reduction

Full size table

As can be seen from the Table 2, the best average value for the classification accuracy is achieved by training in 3 epochs on the structure obtained with the help of SelfCGP and 3 time steps for the left forearm problem and 5 time steps for the right forearm problem (bold values). There are no statistically significant differences among all the results according to the Student’s t-test.

PSOPB reduced the dimensionality from 77 to 44 for the left forearm problem. The best structure found by SelfCGP is:

Table 3. SelfCGP with reduction

Full size table

For the right forearm problem PSOPB allowed the dimensionality to be reduced from 77 to 39. SelfCGP found the following structure:

The Table 3 shows the average for the RNN with reduced dimension dataset participant independent leave one participant out cross validation performance.

Here there are also no statistically significant differences among all the results according to the Student’s t-test.

7.3 FNN Optimisation Results

Below are two example Neural Network topologies found by SelfCGA that showed the best results.

The structure found by the SelfCGA for the group of features of GSR on the data of the right forearm is given:

The structure was found using the self-configuring GA for group video features using the data of the left forearm:

These architectures outperforms the other found models. Still, its performance is not significantly better.

Tables 4 and 5 show the results of mean accuracy for different neural network architectures obtained by cross-validation. The Table 4 includes the results which was get using GSR features, and the Table 5 includes results for video features. 1st encoding type means that SelfCGA used only dense layers in neural network arcitecture construction process, and 2st encoding type means that SelfCGA could build neural networks using Dense and Dropout layers.

Table 4. SelfCGA for GSR features

Full size table

Table 5. SelfCGA for video features

Full size table

8 Conclusion

In this paper, we have presented two EAs for the design of ANN classifiers. With the obtained results, we can state that SelfCGP allows the structure of RNNs to be optimised, but the results of the Student’s t-test do not allow us to assert that the obtained improvements are statistically significant for these problems. Reducing the dimension space by PSOPB did not change the accuracy in the work of the FNN in statistical terms, but the dimension space was reduced by about half. It is better to calculate big and complicate models like an RNN with an optimised structure by SelfCGA or SelfCGP on this dataset. Therefore, we can conclude that this method of reducing the dimension space can be implemented in work with the SenseEmotion dataset.

Notes

1.
http://medoc-web.com/products/pathway-model-ats/.

References

Amirian, M., Kächele, M., Schwenker, F.: Using radial basis function neural networks for continuous and discrete pain estimation from bio-physiological signals. In: Schwenker, F., Abbas, H.M., El Gayar, N., Trentin, E. (eds.) ANNPR 2016. LNCS (LNAI), vol. 9896, pp. 269–284. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46182-3_23
Chapter Google Scholar
Aung, M.S.H., et al.: The automatic detection of chronic pain-related expression: requirements, challenges and multimodal dataset. IEEE Trans. Affect. Comput. 7, 435–451 (2016)
Article Google Scholar
Bellmann, P., Thiam, P., Schwenker, F.: Multi-classifier-systems: architectures, algorithms and applications. In: Pedrycz, W., Chen, S.-M. (eds.) Computational Intelligence for Pattern Recognition. SCI, vol. 777, pp. 83–113. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-89629-8_4
Chapter Google Scholar
Chollet, F., et al.: Keras (2015). https://keras.io
Chu, Y., Zhao, X., Yao, J., Zhao, Y., Wu, Z.: Physiological signals based quantitative evaluation method of the pain. In: Proceedings of the 19th IFAC World Congress, pp. 2981–2986 (2014)
Google Scholar
Coello, C.A.C., Lamont, G.B., Van Veldhuizen, D.A., et al.: Evolutionary Algorithms for Solving Multi-objective Problems, vol. 5. Springer, New York (2007). https://doi.org/10.1007/978-0-387-36797-2
Book MATH Google Scholar
Florea, C., Florea, L., Vertan, C.: Learning pain from emotion: transferred hot data representation for pain intensity estimation. In: Agapito, L., Bronstein, M.M., Rother, C. (eds.) ECCV 2014. LNCS, vol. 8927, pp. 778–790. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-16199-0_54
Chapter Google Scholar
Glodek, M., Scherer, S., Schwenker, F.: Conditioned hidden Markov model fusion for multimodal classification. In: Twelfth Annual Conference of the International Speech Communication Association (2011)
Google Scholar
Glodek, M., et al.: Multiple classifier systems for the classification of audio-visual emotional states. In: D’Mello, S., Graesser, A., Schuller, B., Martin, J.-C. (eds.) ACII 2011. LNCS, vol. 6975, pp. 359–368. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-24571-8_47
Chapter Google Scholar
Gruss, S., et al.: Pain intensity recognition rates via biopotential feature patterns with support vector machines. PLoS ONE 10, e0140330 (2015)
Article Google Scholar
Hagenbuchner, M., Tsoi, A.C., Scarselli, F., Zhang, S.J.: A fully recursive perceptron network architecture. In: 2017 IEEE Symposium Series on Computational Intelligence (SSCI), pp. 1–8. IEEE (2017)
Google Scholar
Kächele, M., Thiam, P., Amirian, M., Schwenker, F., Palm, G.: Methods for person-centered continuous pain intensity assessment from bio-physiological channels. IEEE J. Sel. Top. Signal Process. 10, 854–864 (2016)
Article Google Scholar
Kächele, M., et al.: Multimodal data fusion for person-independent, continuous estimation of pain intensity. In: Iliadis, L., Jayne, C. (eds.) EANN 2015. CCIS, vol. 517, pp. 275–285. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-23983-5_26
Chapter Google Scholar
Kächele, M., Werner, P., Al-Hamadi, A., Palm, G., Walter, S., Schwenker, F.: Bio-visual fusion for person-independent recognition of pain intensity. In: Schwenker, F., Roli, F., Kittler, J. (eds.) MCS 2015. LNCS, vol. 9132, pp. 220–230. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-20248-8_19
Chapter Google Scholar
Kaltwang, S., Rudovic, O., Pantic, M.: Continuous pain intensity estimation from facial expressions. In: Bebis, G., et al. (eds.) ISVC 2012. LNCS, vol. 7432, pp. 368–377. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-33191-6_36
Chapter Google Scholar
Kessler, V., Thiam, P., Amirian, M., Schwenker, F.: Pain recognition with camera photoplethysmography. In: 2017 Seventh International Conference on Image Processing Theory, Tools and Applications (IPTA), pp. 1–5. IEEE (2017)
Google Scholar
Kestler, H., et al.: De-noising of high-resolution ECG signals by combining the discrete wavelet transform with the wiener filter. In: Computers in Cardiology, pp. 233–236. IEEE (1998)
Google Scholar
Meshheryakov, R., Khodashinskij, I., Gusakova, E.: Evaluation of feature space for intrusion detection system. News of Southern Federal University. Tech. Sci. 12(149) (2013)
Google Scholar
Olugbade, T.A., Bianchi-Berthouze, N., Marquardt, N., Williams, A.C.: Pain level recognition using kinematics and muscle activity for physical rehabilitation in chronic pain. In: IEEE Proceedings of International Conference on Affective Computing and Intelligent Interaction, pp. 243–249 (2015)
Google Scholar
Qin, Q., Cheng, S., Zhang, Q., Li, L., Shi, Y.: Biomimicry of parasitic behavior in a coevolutionary particle swarm optimization algorithm for global optimization. Appl. Soft Comput. 32, 224–240 (2015)
Article Google Scholar
Schels, M., Schwenker, F.: A multiple classifier system approach for facial expressions in image sequences utilizing GMM supervectors. In: 2010 20th International Conference on Pattern Recognition (ICPR), pp. 4251–4254. IEEE (2010)
Google Scholar
Schmidt, M., Schels, M., Schwenker, F.: A hidden Markov model based approach for facial expression recognition in image sequences. In: Schwenker, F., El Gayar, N. (eds.) ANNPR 2010. LNCS (LNAI), vol. 5998, pp. 149–160. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-12159-3_14
Chapter Google Scholar
Schwenker, F., Kestler, H.A., Palm, G.: Three learning phases for radial-basis-function networks. Neural Netw. 14(4–5), 439–458 (2001)
Article Google Scholar
Schwenker, F., Trentin, E.: Pattern classification and clustering: a review of partially supervised learning approaches. Pattern Recognit. Lett. 37, 4–14 (2014)
Article Google Scholar
Semenkin, E., Semenkina, M.: Self-configuring genetic algorithm with modified uniform crossover operator. In: Tan, Y., Shi, Y., Ji, Z. (eds.) ICSI 2012. LNCS, vol. 7331, pp. 414–421. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-30976-2_50
Chapter Google Scholar
Semenkin, E., Semenkina, M.: Self-configuring genetic programming algorithm with modified uniform crossover. In: 2012 IEEE Congress on Evolutionary Computation (CEC), pp. 1–6. IEEE (2012)
Google Scholar
Semenkin, E., Semenkina, M., Panfilov, I.: Neural network ensembles design with self-configuring genetic programming algorithm for solving computer security problems. In: Herrero, Á., et al. (eds.) International Joint Conference CISIS 2012-ICEUTE 2012-SOCO 2012 Special Sessions. AISC, vol. 189, pp. 25–32. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-33018-6_3
Chapter Google Scholar
Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15(1), 1929–1958 (2014)
MathSciNet MATH Google Scholar
Thiam, P., et al.: Multi-modal pain intensity recognition based on the sense emotion database. IEEE (2019)
Google Scholar
Thiam, P., Kessler, V., Schwenker, F.: Hierarchical combination of video features for personalised pain level recognition. In: 25th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning, pp. 465–470 (2017)
Google Scholar
Thiam, P., Schwenker, F.: Multi-modal data fusion for pain intensity assessment and classification. In: 2017 Seventh International Conference on Image Processing Theory, Tools and Applications (IPTA), pp. 1–6. IEEE (2017)
Google Scholar
Werner, P., Al-Hamadi, A., Niese, R., Walter, S., Gruss, S., Traue, H.C.: Towards pain monitoring: facial expression, head pose, a new database, an automatic system and remaining challenges. In: Proceedings of the British Machine Vision Conference, pp. 1–13 (2013)
Google Scholar
Werner, P., Al-Hamadi, A., Niese, R., Walter, S., Gruss, S., Traue, H.C.: Automatic pain recognition from video and biomedical signals. In: Proceedings of the International Conference on Pattern Recognition (ICPR), pp. 4582–4587 (2014)
Google Scholar

Download references

Acknowledgements

The reported study was funded by Krasnoyarsk Regional Fund of Science according to the participation in the internship Recurrent neural Networks, Deep Learning for Video retrieval. We gratefully acknowledge the support of NVIDIA Corporation with the donation of the Tesla K40 GPU used for this research. The work of FS was support by the SenseEmotion project funded by the Federal Ministry of Education and Research (BMBF).

Author information

Authors and Affiliations

Reshetnev Siberian State University of Science and Technology, 31 Krasnoyarskiy Rabochiy Prospect, Krasnoyarsk, 660014, Russia
Danila Mamontov, Iana Polonskaia, Alina Skorokhod & Eugene Semenkin
Institute of Neural Information Processing, Ulm University, James Franck Ring, 89081, Ulm, Germany
Viktor Kessler & Friedhelm Schwenker

Authors

Danila Mamontov
View author publications
You can also search for this author in PubMed Google Scholar
Iana Polonskaia
View author publications
You can also search for this author in PubMed Google Scholar
Alina Skorokhod
View author publications
You can also search for this author in PubMed Google Scholar
Eugene Semenkin
View author publications
You can also search for this author in PubMed Google Scholar
Viktor Kessler
View author publications
You can also search for this author in PubMed Google Scholar
Friedhelm Schwenker
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Friedhelm Schwenker .

Editor information

Editors and Affiliations

Ulm University, Ulm, Germany
Friedhelm Schwenker
University of Southern California, Playa Vista, CA, USA
Stefan Scherer

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mamontov, D., Polonskaia, I., Skorokhod, A., Semenkin, E., Kessler, V., Schwenker, F. (2019). Evolutionary Algorithms for the Design of Neural Network Classifiers for the Classification of Pain Intensity. In: Schwenker, F., Scherer, S. (eds) Multimodal Pattern Recognition of Social Signals in Human-Computer-Interaction. MPRSS 2018. Lecture Notes in Computer Science(), vol 11377. Springer, Cham. https://doi.org/10.1007/978-3-030-20984-1_8

Download citation

DOI: https://doi.org/10.1007/978-3-030-20984-1_8
Published: 15 May 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-20983-4
Online ISBN: 978-3-030-20984-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)

Evolutionary Algorithms for the Design of Neural Network Classifiers for the Classification of Pain Intensity

Abstract

Similar content being viewed by others

EEuGene: Employing Electroencephalograph Signals in the Rating Strategy of a Hardware-Based Interactive Genetic Algorithm

Evolving an emotion recognition module for an intelligent agent using genetic programming and a genetic algorithm

User Evaluation Prediction Models Based on Conjoint Analysis and Neural Networks for Interactive Evolutionary Computation

Keywords

1 Introduction

2 Dataset Description