Classification of ADHD with fMRI data and multi-objective optimization

doi:10.1016/j.cmpb.2020.105676

Computer Methods and Programs in Biomedicine

Volume 196, November 2020, 105676

https://doi.org/10.1016/j.cmpb.2020.105676 Get rights and content

Highlights

•
A novel multi-objective optimization classification scheme is proposed.
•
The scheme uses a cost sensitive three objective model to handle the class imbalance problem.
•
A preferred subset of pareto optimal classifiers can be obtained based on the decision maker's preference.
•
Results show that the proposed scheme performs considerably better than some traditional methods.

Abstract

Background and objective

Dataset imbalance is an important problem in neuroimaging. Imbalanced datasets would cause the performance degradation of a classifier by utilizing imbalanced learning, which tends to overfocus on the majority class. In this paper, we consider an imbalanced neuroimaging classification problem, namely, classification of attention deficit hyperactivity disorder (ADHD) using resting-state functional magnetic resonance imaging.

Methods

We propose a multi-objective classification scheme based on support vector machine (SVM). Our scheme addresses the imbalanced dataset problem by using a three objective SVM model with the positive and negative empirical errors being handled explicitly and separately. Moreover, an interactive multi-objective method incorporating the decision maker's preference is adopted, thus a preferred subset of pareto optimal classifiers for decision making can be obtained.

Results

The proposed scheme is assessed on five datasets from the ADHD- 200 consortium. Numerical results show that the proposed multi-objective scheme considerably outperforms some traditional classification methods in the literature.

Conclusion

The proposed multi-objective classification scheme avoids hyper-parameter selection, it effectively addresses dataset imbalanced problem from algorithm level. The scheme can not only be used in the diagnosis of ADHD but also in the diagnosis of other diseases, such as Alzheimer and Autism etc.

Graphical abstract

Introduction

Attention deficit hyperactivity disorder (ADHD) is a common childhood disorders which lasts to adulthood in most cases. It is defined as a neurodevelopmental disorder in the fifth edition of diagnostic and statistical manual of mental disorders, mainly characterized by attention deficits, excessive activity and behavioral impulses [1]. The worldwide pooled prevalence of ADHD is reported to be 3.4% in children and adolescents [2]. So far, the etiology and pathogenesis of ADHD is still not clear [3], and the diagnosis of ADHD currently relies mainly on the subjective experience of doctors. Therefore objective diagnosis and effective treatment of ADHD is one of the significant topics in the field of neuroscience.

Medical imaging technologies such as electroencephalography, functional near-infrared spectroscopy, magnetic resonance imaging (MRI), and functional magnetic resonance imaging (fMRI) have been used for computer-aided diagnosis of ADHD. see, e.g., [4], [5], [6], [7], [8], [9]. As a method of fMRI, resting-state fMRI (rs-fMRI) has shown prominent advantages in the pathological analysis of psychiatric diseases. Various feature extraction, selection and classification methods have been used in rs-fMRI based disease diagnosis. Castellanos et al. [10] found functional connectivity (FC) information of fMRI can be a prominent feature for ADHD diagnosis. Du et al. [11] used graph kernel principal components analysis (PCA) to extract features and proposed a discriminative subnetwork to classify ADHD. Miao and Zhang [12] discussed the classification of ADHD with fMRI data, and used relief algorithm to obtain a subset of fractional amplitude of low-frequency fluctuation features. Itani et al. [13] proposed a multi-level approach based on decision trees for ADHD classification. Qureshi et al. [14] computed the global connectivity maps of fMRI and used hierarchical extreme learning machine to classify ADHD. Riaz et al. [15] integrated non-imaging and imaging data and used a machine learning framework to study alterations of functional connectivity between ADHD and normal control (NC) subjects. Considering the data imbalanced property, they generated synthetic minority class samples using synthetic minority over-sampling technique (SMOTE) [16]. It needs to be noted that recently multi-objective evolutionary computation algorithms, such as multi-objective particle swarm algorithm [17] and multi-objective self-adaptive particle swarm algorithms [18] etc. have shown some success in feature selection (see [19] for a survey) and can also be used for rs-fMRI feature selection.

Most of the classification algorithms mentioned above make the assumption of well-balanced training datasets and equal misclassification costs. However, dataset imbalance is a critical problem in ADHD rs-fMRI datasets. Due to imbalanced learning, imbalanced datasets may lead to overfocus on the majority class, hence degrade the performance of a classifier. There are many approaches proposed to handle the imbalanced dataset problem (also called the class imbalance problem), see [20], [21], [22], [23]. These approaches basically fall into two major categories: data level approaches and algorithm level approaches. The main idea of data level approaches to handle the imbalance is to resample the training set, whereas algorithm level approaches normally adopt the idea of introducing unequal misclassification costs in decision making process. In ADHD classification, data level approaches such as SMOTE have been used to handle the dataset imbalance. However, by performing random oversampling the minority or under-sampling the majority class, these strategies for creating balanced training datasets may lead to suboptimal performance [24].

Considering the multi-objective nature of classification problems, Shao et al. [25] have proposed a bi-objective classification method for the classification of ADHD. However, dataset imbalance is not considered. In this work, incorporating a decision maker's preference, we propose a novel reference point based three objective optimization classification scheme. The main contributions of our work are as follows.

•
We propose using a cost sensitive three objective classification model which is based on SVM to handle the ADHD dataset imbalance problem.
•
From a practical viewpoint, an interactive multi-objective optimization method incorporating decision maker's preference information is proposed. A preferred subset of pareto optimal classifiers can be obtained, thus a classifier with the best performance can be selected.

The rest of the paper is organized as follows. In Section 2, we first introduce the acquisition and preprocessing of data. Then the three objective classification scheme for ADHD diagnosis is proposed. As a main part of the scheme, a three objective classification model based on L₁-norm SVM is introduced and a reference point based multi-objective optimization method is proposed. Section 3 shows some computational experiments. An interactive multi-objective decision making example and some comparison results of the three objective classification scheme and some other methods are given. In Section 4, we give some further discussions about the results. Finally we draw the conclusion in Section 5.

Section snippets

Data and preprocessing

In this study, the rs-fMRI datasets are downloaded from the Neuro Bureau ADHD-200 consortium (http://fcon1000projectsnitrc.org/) [26]. Datasets are from three sites, they are Kennedy Krieger Institute (KKI), New York University Medical Center (NYU) and Peking University (Peking), respectively. Five datasets, namely, KKI, NYU and Peking-1, Peking-2 and Peking-joint are used in our experiment, where Peking-joint consists of three datasets Peking-1, Peking-2 and Peking-3. There are four kinds of

Results

We use our proposed reference point based multi-objective classification scheme to classify the five datasets from ADHD-200 consortium. Each dataset was randomly stratified into three datasets namely training set, validation set, and testing set. The ratio 6:2:2 is used, i.e., 60% of each dataset is used to train the model, 20% is used as the validation set to select the classifier and 20% is used for testing.

We take Peking-1 dataset as an example to describe the classification process in

Discussion

In our experiment, KKI and Peking-1 datasets in the ADHD-200 consortium are small and highly imbalanced. For KKI, the total number of samples is 83, and only 22 samples are ADHD samples; while for Peking-1 dataset, there are 85 samples in total, and only 24 samples are positive.

Among the four traditional machine learning methods RF, ELM, L₁SVM and L₂SVM, RF method obtained higher average accuracy values than ELM, L₁SVM and L₂SVM methods on the KKI, NYU, Peking-1, and Peking-joint datasets.

Conclusion

In this paper we have proposed a reference point based multi-objective classification scheme to classify ADHD. Our scheme uses a three objective SVM formulation based on L₁-norm. It considers the empirical errors for positive and negative samples separately, thus the class imbalance problem can be handled effectively from algorithm level. Furthermore, considering a decision making process, normally a decision maker has her/his own preferences. Therefore, we adopted an interactive

Declaration of Competing Interest

The authors do not have financial and personal relationships with other people or organizations that could inappropriately influence (bias) their work.

Acknowledgments

This work was supported by the Scientific and Technological Innovation Foundation of Shunde Graduate School, University of Science and Technology Beijing (No. BK19CE017), and the National Environmental Corrosion Platform of China. We also would like to thank the associate editor and the anonymous reviewers for their insightful and detailed comments, which have greatly improved the quality of our article.

References (40)

S. Dey et al.
Attributed graph distance measure for automatic detection of attention deficit hyperactive disordered subjects
Front. Neural Circuits
(2014)
J.L. Marcano et al.
Classification of ADHD and non-ADHD subjects using a universal background model
Biomed. Signal Process. Control
(2018)
F.X. Castellanos et al.
Cingulate-precuneus interactions: a new locus of dysfunction in adult attention-deficit/hyperactivity disorder
Biol. Psychiatry
(2008)
J. Du et al.
Network-based classification of ADHD patients using discriminative subnetwork selection and graph kernel PCA
Comput. Med. Imaging Graph.
(2016)
S. Itani et al.
A multi-level classification frame-work for multi-site medical data: application to the ADHD-200 collection
Exp. Syst. Appl.
(2018)
A. Riaz et al.
Fusion of fMRI and non-imaging data for ADHD classification
Comput. Med. Imaging Graph.
(2018)
S. Cui et al.
An improved sup- port vector machine-based diabetic readmission prediction
Comput. Methods Progr. Biomed.
(2018)
Z. Wang et al.
Feature rearrangement based deep learning system for predicting heart failure mortality
Comput. Methods Progr. Biomed.
(2020)
L. Shao et al.
Classification of ADHD with bi-objective optimization
J. Biomed. Inform.
(2018)
P. Bellec et al.
The neuro bureau ADHD-200 preprocessed repository
Neuroimage
(2017)

N. Tzourio-Mazoyer et al.

Automated anatomical labeling of activations in SPM using a macroscopic anatomical parcellation of the MNI MRI single-subject brain

Neuroimage

(2002)

A. Lasisi et al.

Principal components analysis and track quality index: a machine learning approach

Transp. Res. C: Emerg. Technol.

(2018)

H. Aytug et al.

Exploring the trade-off between generalization and empirical errors in a one-norm SVM

Eur. J. Oper. Res.

(2012)

L. Shao et al.

Discrete representation of non-dominated sets in multi-objective linear programming

Eur. J. Oper. Res.

(2016)

Diagnostic and Statistical Manual of Mental Disorders: DSM- 5

(2013)

G.V. Polanczyk et al.

Annual research review: a meta-analysis of the worldwide prevalence of mental disorders in children and adolescents

J. Child Psychol. Psychiatry

(2015)

J.F. Saad et al.

Is the Theta/Beta EEG marker for ADHD inherently flawed?

J. Attent. Disord.

(2015)

C.W. Chang et al.

ADHD classification by a texture analysis of anatomical brain MRI data

Front. Syst. Neuro- Sci.

(2012)

X. Peng et al.

Extreme learning machine-based classification of ADHD using brain structural MRI data

PLOS ONE

(2013)

Z. Liang et al.

Design of multichannel functional near-infrared spectroscopy system with application to propofol and sevoflurane anesthesia monitoring

Neurophotonics

(2016)

Cited by (16)

Automated detection of ADHD: Current trends and future perspective
2022, Computers in Biology and Medicine
Attention deficit hyperactivity disorder (ADHD) is a heterogenous disorder that has a detrimental impact on the neurodevelopment of the brain. ADHD patients exhibit combinations of inattention, impulsiveness, and hyperactivity. With early treatment and diagnosis, there is potential to modify neuronal connections and improve symptoms. However, the heterogeneous nature of ADHD, combined with its comorbidities and a global shortage of diagnostic clinicians, means diagnosis of ADHD is often delayed. Hence, it is important to consider other pathways to improve the efficiency of early diagnosis, including the role of artificial intelligence. In this study, we reviewed the current literature on machine learning and deep learning studies on ADHD diagnosis and identified the various diagnostic tools used. Subsequently, we categorized these studies according to their diagnostic tool as brain magnetic resonance imaging (MRI), physiological signals, questionnaires, game simulator and performance test, and motion data. We identified research gaps include the paucity of publicly available database for all modalities in ADHD assessment other than MRI, as well as a lack of focus on using data from wearable devices for ADHD diagnosis, such as ECG, PPG, and motion data. We hope that this review will inspire future work to create more publicly available datasets and conduct research for other modes of ADHD diagnosis and monitoring. Ultimately, we hope that artificial intelligence can be extended to multiple ADHD diagnostic tools, allowing for the development of a powerful clinical decision support pathway that can be used both in and out of the hospital.
A review of ADHD detection studies with machine learning methods using rsfMRI data
2024, NMR in Biomedicine
Different cortical connectivities in human females and males relate to differences in strength and body composition, reward and emotional systems, and memory
2024, Brain Structure and Function
Diagnosis of Autism Spectrum Disorder (ASD) Using Recursive Feature Elimination–Graph Neural Network (RFE–GNN) and Phenotypic Feature Extractor (PFE)
2023, Sensors
Sampling inequalities affect generalization of neuroimaging-based diagnostic classifiers in psychiatry
2023, BMC Medicine
Multiple measurement analysis of resting-state fMRI for ADHD classification in adolescent brain from the ABCD study
2023, Translational Psychiatry

View all citing articles on Scopus

View full text

Classification of ADHD with fMRI data and multi-objective optimization

Highlights

Abstract

Background and objective

Methods

Results

Conclusion

Graphical abstract

Introduction

Section snippets

Data and preprocessing

Results

Discussion

Conclusion

Declaration of Competing Interest

Acknowledgments

Front. Neural Circuits

Biomed. Signal Process. Control

Biol. Psychiatry

Comput. Med. Imaging Graph.

Exp. Syst. Appl.

Comput. Med. Imaging Graph.

Comput. Methods Progr. Biomed.

Comput. Methods Progr. Biomed.

J. Biomed. Inform.

Neuroimage

Neuroimage

Transp. Res. C: Emerg. Technol.

Eur. J. Oper. Res.

Eur. J. Oper. Res.

Diagnostic and Statistical Manual of Mental Disorders: DSM- 5

Annual research review: a meta-analysis of the worldwide prevalence of mental disorders in children and adolescents

J. Child Psychol. Psychiatry

Is the Theta/Beta EEG marker for ADHD inherently flawed?

J. Attent. Disord.

ADHD classification by a texture analysis of anatomical brain MRI data

Front. Syst. Neuro- Sci.

Extreme learning machine-based classification of ADHD using brain structural MRI data

PLOS ONE

Design of multichannel functional near-infrared spectroscopy system with application to propofol and sevoflurane anesthesia monitoring

Neurophotonics