Abstract
Whole slide imaging in histopathology is one of the most important aspects of computational pathology. Nucleus identification and extraction can play a critical part in digital microscopic examination. This work is an extension of our previous published work on childhood medulloblastoma biopsy machine learning classification where the classifier was based on ground truth annotated data. However complete automation would entail automatic segmentation of the cells. The paper explores various segmentation techniques for cell identification from biopsy tissue samples of childhood medulloblastoma microscopic images based on conventional machine learning methods. The study is based on indigenous patient data collected from medical centers of the region. The performance of the segmentation algorithms was compared using Jaccard and Dice coefficient metric.
You have full access to this open access chapter, Download conference paper PDF
Similar content being viewed by others
Keywords
1 Introduction
Identification of the nucleus characteristics is the most important study of a pathologist in the identification of any disease. The problem in the manual assessment of histopathology images includes inter-observer variability for challenging tissue samples with tenuous visual features. Moreover due to lack of adequate patient to doctor ratio the whole slide scanning period for an individual clinical expert is time-consuming. There are a significant number of nucleus present in a slide and inspecting them carefully is a tedious job for the clinicians where patient throughput is tremendously high. Computational pathology [1] with computer vision methods has gained momentum in the recent past to alleviate the conventional approaches [2,3,4,5]. The prime focus in digital pathology applications is to closely identify the nucleus area and its morphological characteristics. Therefore a principal component in computational pathology is segmentation [4, 6,7,8] of the nucleus eliminating the background from the whole slide images. Thus extracting the nucleus more accurately contributes significantly to medical software. The goal of this work is to segment nuclei relating to childhood medulloblastoma microscopic images and contribute towards computational pathology research. We have collected images from real-time patient data and annotated the ground truth, under the guidance of a clinical expert. We studied the result of various conventional segmentation method on our data. Most of the work on medulloblastoma are texture based and no work on medulloblastoma has been carried out till date on the identification of the nuclei from the tissue samples, which is a vital part of the diagnosis. The segmentation of tissue images is a lot more challenging than cytological images due to the high presence of debris, hemorrhages and nuclei diffusion. Also, hyperchromasia and overlapped nuclei are always seen. Machine learning in the past has shown conceivable results in challenging datasets [9,10,11]. The serious problem in medical research is the availability of data. We did not have a benchmark medulloblastoma data set available and so we address the problem for future researchers by creating our own data set with painstakingly marked ground truth nuclei in the samples.
2 Background
Literature reveals that work on medulloblastoma is very recent and the World Health Organization [12] (W.H.O) has characterized it into grade IV malignant tumors only in 2002. Computer-aided study for such tumor has been seen only from 2012 to the best of our knowledge. All the research [13,14,15,16] works are performed by St. Jude’s Children Hospital, USA. However, no work on cell segmentation has been attempted in the past years. Further, the studies carried out were based on the textural feature with the help of which it was attempted to categorize medulloblastoma into anaplastic and non-anaplastic subtypes. But, according to W.H.O, it can be classified into four different subgroups, based on its severity and cell characteristics. Prognosis and other health management depend largely on the subtype the tumour is classified as. Hence, for a cell-based study, the most vital role is that of automated cell segmentation from the tissue samples, which we plan to address in this paper. Previously classification of childhood medulloblastoma into its W.H.O. subtypes have been achieved [17] based on manually segmented cells. Complete automation would be achieved by automated cell extraction.
3 Methods and Materials
3.1 Study Region
The study region is North East India, Guwahati city in particular.
3.2 Data Collection
The tissue blocks were collected from Guwahati Medical College and Hospital (GMCH), Guwahati from the Neurosurgery Department. Next, the blocks were stained with H&E at Ayursundra Pvt. Ltd and finally images were captured under clinical supervision at Guwahati Neurological Research Center (GNRC), Dispur. The images were captured at 100× microscopic resolution and stored in jpeg format. A total of 94 images were collected from 15 slides. 18 normal cell images were also captured from the slides, where present in the sample. Ground truth, for identifying the cells in the tissue samples, were meticulously marked by us in red using MS paint and was later verified under expert supervision. For the particular study, we have marked 37 images which include both properly and poorly stained whole slides. From these images, a total of 1272 cells were identified and marked. Few images are shown in Fig. 1. The ground truth includes both normal and abnormal cells including the various subtypes. The marked ground truth was later segmented using Kmeans color segmentation method for evaluation purpose.
3.3 Ethical Statement
This study was a part of a joint project undertaken by the Institute of Advanced Study in Science and Technology (IASST) and GMCH. Permission for the same was granted from ethical bodies of both the institutions [IASST: Registration number ECR/248/Indt/AS/2015 of Rule 122DD, Drugs and Cosmetics Rule, 1945 of India; GMCH:MC/190/2007/pt-1/E-C/32 dated 30.5.2017].
3.4 Inclusion/Exclusion Criteria
The samples were taken from children less than the age of 15 years and only cases confirmed by the biopsy were included for the study. The cases that do not fall under MB after the clinical findings, summation of history and pathologic features were not considered in the study.
3.5 Overview of the Work
In our work, we studied 8 segmentation techniques that include color based segmentation, region-based segmentation, segmentation based on clustering technique, local and global segmentation methods. The model is depicted in Fig. 2. The algorithms for this purpose were Adaptive segmentation [18, 19], HSV color segmentation [20], YCbCr color segmentation, Ostu segmentation [21], Fuzzy [22, 23] modelling based segmentation, Watershed segmentation [24, 25], and Kmeans [26] clustering and entropy [27, 28] filter segmentation. The HSV and YCbCr were color-based segmentation method where the images were first converted to the respective color channels and then global thresholding was applied on the pixel intensity values. The thresholding value for color-based segmentation was static and therefore proper smooth segmentation of the cells were not possible as different images had different color intensity histograms. We then used the adaptive and Ostu segmentation which was based on finding an iterative best threshold value for cell segmentation. This method gave a better result than our previous attempt but had a high segmentation of overlapped nuclei. We then tried the traditional watershed segmentation for minimizing the segmentation output of overlapped cells, but the watershed segmentation underperformed due to inefficiency to choose an appropriate local minima and maxima regions in the image. Finally, Kmeans and Fuzzy clustering were used for segmentation, which performed higher than the previous attempts. The Kmeans segmentation grouped the whole image into clusters of different colors based on a similarity value. The initial cluster value used was 3. Before applying the Kmeans clustering method the RGB images were first converted to their respective LUV color channel. All the segmentation methods applied is based on hard partition, where each point belong to any one cluster, while the fuzzy method is a soft partition method, where a single data point can belong to two different clusters based on its membership value and the total sum of all membership points is 1. The fuzzy clustering method gave us a better output than most of the other methods. However the most appropriate was entropy-based segmentation for our images. The segmentation module for all algorithms was followed by same morphological operations of opening using a disk structuring element of radius 10 pixels in length for noise removal. Next, the objects that had less than 80 pixels were empirically identified as noise and were removed from consideration. Later, the performance evaluation among the segmentation modules and the ground truth were done using Dice and Jaccard coefficient metric. Mathematically, it is given as
The Jaccard metrics is given by
where TP = True positive values where both the segmented image and the ground truth image has pixel intensity value 1, FP = False positive where the segmented image has pixel intensity value 1 but ground truth has pixel intensity value 0 for the same region, FN = False negative where segmented image has intensity value 0 but ground truth has pixel intensity value 1.
4 Results
This section presents the result of our experimentation with various segmentation techniques with the ground truth segmented data. An image of annotated and ground-truth data is displayed in Fig. 3. Next, various segmentation techniques, as detailed above, are depicted in Fig. 4. A total of 37 images were tested for all the segmentation methods and Dice (using Eq. 1) and Jaccard coefficient (using Eq. 2) for each image was calculated based on the ground truth information. The individual image scores for 25 samples out of 37 are given in Tables 1 and 2. The average scores of the various segmentation algorithms is shown using bar graph in Fig. 5(a) and (b). Boxplot (Fig. 6) is an efficient and simple technique to represent graphically a set of data which belong to a same variable. It displays not only the location of the data values but also variation and are especially useful for showing comparisons. The performance of the segmentation methods were in the order HSV<Watershed<YCbCr<Kmeans<Adaptive<Ostu<Fuzzy<Entropy. The highest Dice and Jaccard coefficient that we have got is 79.2% (Table 1) and 66.1% (Table 2) using entropy-based segmentation method. From Fig. 6 it is clear that Entropy, Fuzzy and Ostu have the highest median, almost equal to each other. Further among these three Entropy median is located more in the middle of the 1st and 3rd quartile indicating that the scores are more evenly distributed. It also has the shortest distance between the 1st and 3rd quartile indicating higher consistency of scores and smaller whiskers indicating lower variation in the scores.
5 Discussions
It is seen that segmentation of histopathological tissue samples is a lot more difficult [29] then segmentation of cytological images. High cellularity, presence of high debris, nuclear diffusion, overlapping nucleus are some of the reasons for such difficulty. Moreover the computational pathology for medical data is highly data dependent and lack of benchmark data is a great hindrance for such work to carry out. Data variation is seen in particular due to difference in preparation of slides. For our dataset we have obtained a better performance for entropy segmentation followed by fuzzy modelling, Ostu thresholding, Adaptive segmentation, Kmeans color segmentation, YCbCr color segmentation, watershed and HSV. This study can be seen as a pathway to the introduction of digital MB cell analysis. The module could be integrated with medical image analysis for digital report generation and also may be extended for other tissue samples.
References
Gurcan, M., et al.: Histopathological image analysis: a review. IEEE Rev. Biomed. Eng. 2, 147–171 (2009)
Petushi, S., et al.: Large-scale computations on histology images reveal grade-differentiating parameters for breast cancer. BMC Med. Imaging 6(1), 1–14 (2006)
Naik, S., et al.: Gland segmentation and computerized Gleason grading of prostate histology by integrating low-, high-level and domain-specific information. MI- AAB Workshop, vol. 34, pp. 1–8. Springer, Australas (2007). Australas Phys. Eng. Sci. Med.
Lu, C., et al.: A robust automatic nuclei segmentation technique for quantitative histopathological image analysis. Anal. Quant. Cytol. Histol. 34, 296–308 (2012)
Su, H., Xing, F., Yang, L.: Robust cell detection of histopathological brain tumor images using sparse reconstruction and adaptive dictionary selection. IEEE Trans. Med. Imaging 35(6), 1575–1586 (2016)
Chang, H., et al.: Invariant delineation of nuclear architecture in glioblastoma multi-forme for clinical and molecular association. IEEE Trans. Med. Imaging 32(4), 670–682 (2013)
Filipczuk, P., et al.: Computer-aided breast cancer diagnosis based on the analysis of cytological images of fine needle biopsies. IEEE Trans. Med. Imaging 32(12), 2169–2178 (2013)
Sethi, A., et al.: Computational pathology for predicting prostate cancer recurrence. In: AACR s106th Annual Meeting, vol. 75, no. 15, pp. 18–22. American Association for Cancer Research, Philadelphia (2015). https://doi.org/10.1158/1538-7445.AM2015-LB-285
Szegedy, C., et al.: Going deeper with convolutions. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), vol. 1, pp. 1–9. IEEE, Boston (2015). https://doi.org/10.1109/CVPR.2015.7298594
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. CoRR, vol. abs/1409.1556, pp. 730–734. IEEE, Kuala Lumpur (2014)
Socher, R., et al.: Parsing natural scenes and natural language with recursive neural networks. In: ICML, pp. 129–136 (2011)
WHO classification of CNS tumours. https://radiopaedia.org/articles/who-classification-of-CNS-tumours-1. Accessed 28 Jan 2016
Galaro, J., et al.: A method for medulloblastoma tumor differentiation based on convolutional neural networks and transfer learning. In: SPIE 9681, 11th International Symposium on Medical Information Processing and Analysis, IEEE, Boston (2016). https://doi.org/10.1117/12.2208825
Lai, Y., et al.: A texture-based classifier to discriminate anaplastic from non-anaplastic medulloblastoma. In: IEEE 37th Annual Northeast on Bioengineering Conference (NEBEC), pp. 141–171. IEEE, Troy (2011)
Roa, A.C., et al.: An integrated texton and bag of words classifier for identifying anaplastic medulloblastomas. In: 11th International Symposium on Medical Information Processing and Analysis, pp. 3443–3446. IEEE, Cuenca (2015)
Tchikindas, L., et al.: Segmentation of nodular medulloblastoma using random walker and hierarchical normalized cuts. In: IEEE 37th Annual Northeast Bioengineering Conference, pp. 4–5. IEEE, Troy (2011)
Das, D., et al.: Study on contribution of biological interpretable and computer- aided features towards the classification of childhood medulloblastoma cells. J. Med. Syst. 42(151), 5–12 (2018)
Pachai, C., et al.: Unsupervised and adaptive segmentation of multispectral 3D magnetic resonance images of human brain: a generic approach. In: Niessen, W.J., Viergever, M.A. (eds.) MICCAI 2001. LNCS, vol. 2208, pp. 1067–1074. Springer, Heidelberg (2001). https://doi.org/10.1007/3-540-45468-3_127
Lee, L.K., Liew, S.C., Thong, W.J.: A review of image segmentation methodologies in medical image. In: Sulaiman, H.A., Othman, M.A., Othman, M.F.I., Rahim, Y.A., Pee, N.C. (eds.) Advanced Computer and Communication Engineering Technology. LNEE, vol. 315, pp. 1069–1080. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-07674-4_99
Priya, T., Kalavathi, P.: HSV based histogram thresholding technique for MRI brain tissue segmentation. In: Thampi, S.M., Marques, O., Krishnan, S., Li, K.-C., Ciuonzo, D., Kolekar, M.H. (eds.) SIRS 2018. CCIS, vol. 968, pp. 322–333. Springer, Singapore (2019). https://doi.org/10.1007/978-981-13-5758-9_27
Satapathy, C.S., et al.: Multi-level image thresholding using Otsu and chaotic bat algorithm. Neural Comput. Appl. 29(12), 1285–1307 (2018)
Srinivas, B., Sasibhushana Rao, G.: Performance evaluation of fuzzy c means segmentation and support vector machine classification for MRI brain tumor. In: Bansal, J.C., Das, K.N., Nagar, A., Deep, K., Ojha, A.K. (eds.) Soft Computing for Problem Solving. AISC, vol. 817, pp. 355–367. Springer, Singapore (2019). https://doi.org/10.1007/978-981-13-1595-4_29
Zhou, H., Schaefer, G., Shi, C.: Fuzzy c-means techniques for medical image segmentation. In: Jin, Y., Wang, L. (eds.) Fuzzy Systems in Bioinformatics and Computational Biology. STUDFUZZ, vol. 242, pp. 257–271. Springer, Berlin (2009). https://doi.org/10.1007/978-3-540-89968-6_13
Lim, P.U., Lee, Y., Jung, Y., Cho, J.H., Kim, M.N.: Liver extraction in the abdominal CT image by watershed segmentation algorithm. In: Magjarevic, R., Nagel, J.H. (eds.) World Congress on Medical Physics and Biomedical Engineering 2006. IFMBE, vol. 14, pp. 2563–2566. Springer, Berlin (2007). https://doi.org/10.1007/978-3-540-36841-0_646
Singh, S.K., Goyal, A.: A novel approach to segment nucleus of uterine cervix pap smear cells using watershed segmentation. In: Singh, D., Raman, B., Luhach, A.K., Lingras, P. (eds.) Advanced Informatics for Computing Research. CCIS, vol. 712, pp. 164–174. Springer, Singapore (2017). https://doi.org/10.1007/978-981-10-5780-9_15
Chapman, C., Feit, E.M.: Segmentation: clustering and classification. R For Marketing Research and Analytics. UR, pp. 299–340. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-14316-9_11
Melloul, M., Joskowicz, L.: Segmentation of microcalcification in X-ray mammograms using entropy thresholding. In: Lemke, H.U., Inamura, K., Doi, K., Vannier, M.W., Farman, A.G., Reiber, J.H.C. (eds.) CARS 2002 Computer Assisted Radiology and Surgery, pp. 671–676. Springer, Berlin (2002). https://doi.org/10.1007/978-3-642-56168-9_112
Manjunath, A.P., Rachana, C.S., Ranjini, S.: Retinal vessel segmentation using local entropy thresholding. In: Sridhar, V., Sheshadri, H., Padma, M. (eds.) Emerging Research in Electronics, Computer Science and Technology. LNEE, vol. 248, pp. 1–8. Springer, New Delhi (2014). https://doi.org/10.1007/978-81-322-1157-0_1
Belsare, A.D., Mushrif, M.M.: Histopathological image analysis using image processing techniques: an overview. Int. J. Signal Image Process. 3, 23–36 (2012)
Acknowledgment
We thank IASST, GMCH, GNRC and Ayursundra Healthcare Ltd. for giving us the platform to carry our work.
Conflict of Interest
The Authors declare no conflict(s) of interest.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Das, D., Mahanta, L.B. (2019). On the Study of Childhood Medulloblastoma Auto Cell Segmentation from Histopathological Tissue Samples. In: Deka, B., Maji, P., Mitra, S., Bhattacharyya, D., Bora, P., Pal, S. (eds) Pattern Recognition and Machine Intelligence. PReMI 2019. Lecture Notes in Computer Science(), vol 11942. Springer, Cham. https://doi.org/10.1007/978-3-030-34872-4_1
Download citation
DOI: https://doi.org/10.1007/978-3-030-34872-4_1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-34871-7
Online ISBN: 978-3-030-34872-4
eBook Packages: Computer ScienceComputer Science (R0)