Cervical cancer classification using convolutional neural networks and extreme learning machines

https://doi.org/10.1016/j.future.2019.09.015Get rights and content

Highlights

  • CNN-based cervical cancer detection and classification system is proposed.

  • Extreme learning machine (ELM)-based classifier is used after the CNN.

  • 99.7% accuracy is obtained in the 2-class problem using the Herlev database.

  • 97.2% accuracy is obtained in the 7-class problem.

Abstract

Cervical cancer is one of the main reasons of death from cancer in women. The complication of this cancer can be limited if it is diagnosed and treated at an early stage. In this paper, we propose a cervical cancer cell detection and classification system based on convolutional neural networks (CNNs). The cell images are fed into a CNNs model to extract deep-learned features. Then, an extreme learning machine (ELM)-based classifier classifies the input images. CNNs model is used via transfer learning and fine tuning. Alternatives to the ELM, multi-layer perceptron (MLP) and autoencoder (AE)-based classifiers are also investigated. Experiments are performed using the Herlev database. The proposed CNN-ELM-based system achieved 99.5% accuracy in the detection problem (2-class) and 91.2% in the classification problem (7-class).

Introduction

The cervix of a human is covered by a thin layer of tissues consisting of cells. If a cell is changed into a malignant cell that can grow and divide rapidly and becomes a tumor, we call this situation the cervical cancer. This cancer can be treated if it is detected at an early stage. The diagnosis is normally done by a screening process and a biopsy. Image processing techniques can be applied to find the spread of the cancer. Cervical cancer is the fourth-most common cause of death from cancer in women [1].

Medical image processing and intelligent systems play a role in the analysis of the malignant cells. With the development of new techniques, they become cost-effective and less time-consuming. They are now becoming popular over conventional methods such as Pap Smear, Colposcopy, and Cervicography. These techniques are unbiased to human experience; however, we want to stress that they cannot replace the subjective (expert doctor) evaluation, but can assist them to a great degree.

State-of-the-art machine learning techniques and wireless communication technologies have enabled us to develop a complete medical diagnosis system that can operate in real-time, accurately, and without human interaction. Yet, there are many issues that need to be solved, for example, packet loss during transmission, high bandwidth requirement for medical video data transfer, and a robust algorithm to deal with many variations in data. To address some of this issues, edge-based cloud computing was proposed for voice pathology detection [2], [3], Internet of Things (IoTs) and cloud-based framework was realized in [4], deep learning for emotion recognition was proposed in [5], edge-based communication was introduced in [6], and a disease monitoring system was proposed in [7].

Computer-aided systems in cancerous cell detection have been used in the literature for quite a few times. In breast cancer recognition, different feature extraction methods such as local binary pattern, histogram of gradient orientation, and Laplacian Gaussian filter were used [8], [9], [10]. Local texture analysis was used to diagnose pulmonary nodules in [11]. To analyze dermoscopy images for skin cancer, directional filters and color component features were used in [12], [13]. A method for voice pathology detection using different input modalities were proposed in [14], [15].

Recently, deep learning has brought a big improvement in accuracy in many applications. Due to its high accuracies in many areas, it has become the state-of-the-art machine learning technique. A good survey on various cancerous cells detection using deep learning can be found in [16]. Deep learning was successfully used in EEG pathology detection [17], [18], environment classification [19], lung nodule detection [20], breast cancer detection [21], skin cancer detection [22], medical image analysis [23], audio–visual emotion recognition [24] and diseases prediction [25], [26]. With the increase in many types of sensors, processing of Big Data has added an extra dimension to the deep learning. Big Data has successfully handled in several medical related applications [27], [28].

Due to the success of deep learning in many medical applications, in this paper, we propose a deep learning-based system to detect and classify cervical cancerous cells. In particular, we use convolutional neural networks (CNNs) followed by an extreme learning machine (ELM)-based classifier in the system. We investigate different models of CNNs via transfer learning. We also investigate various classifiers such as multi-layer perceptron (MLP), ELM, and autoencoder (AE). A public database, the Herlev database [29], has been used in the experiments. The contributions of the work are (i) introducing CNNs in cervical cancer cell detection and classification, (ii) introducing ELM-based classifier, and (iii) introducing AE-based classifier. The CNN extracts deep-learned features from the raw input. These features are fed into an ELM-based classifier or an AE-based classifier. To the best of our knowledge, this is the first attempt to investigate the use of the ELM and the AE after the CNN in the cervical cancer detection system.

The paper is organized as follows. Some related previous works are briefed in Section 2. The proposed system is described in Section 3. Experimental results and discussion are provided in Section 4. The conclusion of the work and some future directions are given in Section 5.

Section snippets

Previous works

There are several works to classify and detect cervical cancer in the literature. In 2000’s, most of the works involved hand-crafted features and traditional classifiers such as support vector machine (SVM), artificial neural networks (ANNs), and k-means neural networks (K-NNs). We describe some of the related previous works in the following paragraphs.

Zhang et al. proposed an SVM-based cervical cancer classification system in [30]. They used a subset of Herlev database [29], and the subset

Proposed system

The CNN models have been successful in many image processing applications including medical image analysis. Inspired by this, we propose a CNNs-based cervical cancer detection and classification system. The CNNs-based systems need huge data for training, and it is very difficult to get a large database of medical images. Therefore, transfer learning and fine tuning are popular when the database size is small. A deep CNN model can be trained using a large amount of data, and the trained model

Experiments

We used the Herlev database in our experiments. The database was developed in Herlev University Hospital (Denmark), and is available at http://fuzzy.iau.dtu.dk/download/smear2005. There are total 917 cells and seven classes. Three classes belong to normal and four classes belong to abnormal. The number of images per class is shown in Table 2. There are 242 images for normal and 675 images for abnormal.

We used a 5-fold cross validation approach. For fine tuning, we used 80% of the data in

Conclusion

A cervical cancer detection and classification system using CNN was proposed. The ELM-based classifier or the AE-based classifier after the CNN model was integrated into the system. Both shallow CNN model and deep CNN models were investigated. Using the Herlev database, the proposed system with the ELM-based classifier achieved 99.7% accuracy in the 2-class problem and 97.2% accuracy in the 7-class problem. These accuracies were better than any reported previous accuracies.

In a future work, the

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgment

The authors acknowledge funding from the Research and Development (R&D) Program (Research Pooling Initiative), Ministry of Education, Riyadh, Saudi Arabia , (RPI-KSU).

Ahmed Ghoneim received his M.Sc. degree in software modeling from University of Menoufia, Egypt, and the Ph.D. degree from the University of Magdeburg (Germany) in the area of software engineering, in 1999 and 2007 respectively. He is currently an associate professor at the department of software engineering, College of Computer Science and Information Sciences, king Saud University. His research activities address software evolution; service oriented engineering, software development

References (50)

  • WangPin et al.

    Utomatic cell nuclei segmentation and classification of cervical pap smear images

    Biomed. Signal Process. Control

    (2019)
  • MarinakisY. et al.

    Pap smear diagnosis using a hybrid intelligent scheme focusing on genetic algorithm based feature selection and nearest neighbor classification

    Comput. Biol. Med.

    (2009)
  • HossainM.S. et al.

    Applying deep learning for epilepsy seizure detection and brain mapping visualization

    ACM Trans. Multimedia Comput. Commun. Appl. (ACM TOMM)

    (2019)
  • HuangGuang-Bin et al.

    Extreme learning machine: Theory and applications

    Neurocomputing

    (2006)
  • AmoryA. et al.

    Deep convolutional tree networks

    Future Gener. Comput. Syst.

    (2019)
  • World Cancer Report 2014. World Health Organization. 2014. pp. Chapter 5.12. ISBN...
  • MuhammadG. et al.

    Smart health solution integrating IoT and cloud: a case study of voice pathology monitoring

    IEEE Commun. Mag.

    (2017)
  • ChenM.

    Edge-CoCaCo: Toward Joint Optimization of Computation, Caching, and Communication on Edge Cloud

    IEEE Wireless Commun.

    (2018)
  • HossainM.S. et al.

    Cloud-based collaborative media service framework for HealthCare

    Int. J. Distrib. Sens. Netw.

    (2014)
  • WeiJ. et al.

    Computer-aided detection of breast masses on full field digital mammograms

    Med. Phys.

    (2005)
  • EltonsyN.H. et al.

    A concentric morphology model for the detection of masses in mammography

    IEEE Trans. Med. Imaging

    (2007)
  • GhoneimA.

    Medical image forgery detection for smart healthcare

    IEEE Commun. Mag.

    (2018)
  • HanF. et al.

    Texture feature analysis for computer-aided diagnosis on pulmonary nodules

    J. Digit. Imaging

    (2015)
  • BarataC. et al.

    A system for the detection of pigment network in dermoscopy images using directional filters

    IEEE Trans. Biomed. Eng.

    (2012)
  • SadeghiM. et al.

    Detection and analysis of irregular streaks in dermoscopic images of skin lesions

    IEEE Trans. Med. Imaging

    (2013)
  • Cited by (182)

    View all citing articles on Scopus

    Ahmed Ghoneim received his M.Sc. degree in software modeling from University of Menoufia, Egypt, and the Ph.D. degree from the University of Magdeburg (Germany) in the area of software engineering, in 1999 and 2007 respectively. He is currently an associate professor at the department of software engineering, College of Computer Science and Information Sciences, king Saud University. His research activities address software evolution; service oriented engineering, software development methodologies, Quality of Services, Net-Centric Computing, and Human Computer Interaction (HCI). He is a member of the IEEE.

    Ghulam Muhammad is a profess or in the Department of Computer Engineering, College of Computer and Information Sciences at King Saud University (KSU), Riyadh, Saudi Arabia. Prof. Ghulam received his Ph.D. degree in Electrical and Computer Engineering from Toyohashi University and Technology, Japan in 2006. He was a recipient of the Japan Society for Promotion and Science (JSPS) fellowship from the Ministry of Education, Culture, Sports, Science and Technology, Japan. His research interests include image and speech processing, cloud and multimedia for healthcare, biometrics, and security. Prof. Ghulam has authored and co-authored more than 200 publications including IEEE / ACM / Springer / Elsevier journals, and flagship conference papers. He has a U.S. patent on audio processing. He received the best faculty award of Computer Engineering department at KSU during 2014–2015. He has supervised more than 12 Ph.D. and Master Theses. Prof. Ghulam is involved in many research projects as a principal investigator and a co-principal investigator.

    M. Shamim Hossain is a Professor at the Department of Software Engineering, College of Computer and Information Sciences, King Saud University, Riyadh, Saudi Arabia. He is also an adjunct professor at the School of Electrical Engineering and Computer Science, University of Ottawa, Canada. He received his Ph.D. in Electrical and Computer Engineering from the University of Ottawa, Canada. His research interests include cloud networking, smart environment (smart city, smart health), social media, IoT, edge computing and multimedia for health care, deep learning approach to multimedia processing, and multimedia big data. He has authored and coauthored approximately 220 publications including refereed journals, conference papers, books, and book chapters. He has served as cochair, general chair, workshop chair, publication chair, and TPC for over 12 IEEE and ACM conferences and workshops. Currently, he is the cochair of the 2nd IEEE ICME workshop on Multimedia Services and Tools for smart-health (MUST-SH 2019). He is a recipient of a number of awards, including the Best Conference Paper Award and the 2016 ACM Transactions on Multimedia Computing, Communications and Applications (TOMM) Nicolas D. Georganas Best Paper Award, the 2019 King Saud University Scientific Excellence Award (Research Quality), and the Research in Excellence Award from the College of Computer and Information Sciences (CCIS), King Saud University. He is on the editorial board of the IEEE transactions on Multimedia, IEEE Network, IEEE Multimedia, IEEE Wireless Communications, IEEE Access, Journal of Network and Computer Applications (Elsevier), and International Journal of Multimedia Tools and Applications (Springer). He also presently serves as a lead guest editor of IEEE Network, and IEEE Access. Previously, he served as a guest editor of IEEE Communications Magazine, IEEE Transactions on Information Technology in Biomedicine (currently JBHI), IEEE Transactions on Cloud Computing, International Journal of Multimedia Tools and Applications (Springer), Cluster Computing (Springer), Future Generation Computer Systems (Elsevier), Computers and Electrical Engineering (Elsevier), Sensors (MDPI), and International Journal of Distributed Sensor Networks. He is a senior member of both the IEEE, and ACM.

    View full text