Elsevier

Neurocomputing

Volume 149, Part A, 3 February 2015, Pages 39-47
Neurocomputing

Accurate segmentation of touching cells in multi-channel microscopy images with geodesic distance based clustering

https://doi.org/10.1016/j.neucom.2014.01.061Get rights and content

Abstract

Multi-channel microscopy images have been widely used for drug and target discovery in biomedical studies by investigating morphological changes of individual cells. However, it is still challenging to segment densely touching individual cells in such images accurately and automatically. Herein, we propose a geodesic distance based clustering approach to efficiently segmenting densely touching cells in multi-channel microscopy images. Specifically, an adaptive learning scheme is introduced to iteratively adjust the clustering centers which can significantly improve the segmentation accuracy of cell boundaries. Moreover, a novel seed selection procedure based on nuclei segmentation is suggested to determine the true number of cells in an image. To validate this proposed method, we applied it to segment the touching Madin-Darby Canine Kidney (MDCK) epithelial cells in multi-channel images for measuring the distinct N-Ras protein expression patterns inside individual cells. The experimental results demonstrated its advantages on accurately segmenting massive touching cells, as well as the robustness to the low signal-to-noise ratio and varying intensity contrasts in multi-channel microscopy images. Moreover, the quantitative comparison showed its superiority over the typical existing cell segmentation methods.

Introduction

High content screening (HCS) has been widely used for discovering novel drugs and targets by investigating the morphological changes of interested proteins inside individual cells [1], [2], [3]. Along with the advance of automated image acquisition equipments, large scale of cell images with multiple fluorescent markers is being generated in experiments. Whereas, great challenges have been posed on the automated quantification of individual cell morphology due to their complex appearances, uneven intensity, low signal-to-noise ratio (SNR), and cell touching [4], [5], [6], [7]. Automated detection and accurate segmentation of touching cells are crucial for biological studies such as cell morphological analysis, cell tracking and cell phase identification [8], [9], [10]. However, it remains being an open problem due to the aforementioned challenges.

A series of approaches of automated cell segmentation in different kinds of images have been reported [11], [12]. In general, the widely used segmentation approaches, for example, watershed method [9], [13] and deformable models like the snake model and level set methods [14], [15], are sensitive to initializations (e.g., seeds selection), and intensity variation inside cells. Specifically, the watershed method is often involved in the over-segmentation problem, and also sensitive to the intensity noise. Active contour or level set based deformable models are very sensitive to boundary initializations. In addition, the computational cost is heavy for these methods. Therefore, they are unsuitable for processing a large scale image dataset in high content screening studies. Clustering methods and statistical approaches have also been employed for cell segmentation [16], [17], [18]. These methods are efficient but usually provide incomplete segmentations because they operate on the single pixel level and neglect the fact that cells are continuous regions. Especially, these methods have limited ability in delineating boundaries of touching cells in cell membrane segmentation. Moreover, large intensity variation on cell membrane and inside cells has made it a much more challenging problem to find the dividing boundaries of cells, which usually lead to biased segmentation results.

In order to segment touching cells more accurately, several new techniques have been investigated in recent years, including certain adaptive learning approaches to separating overlapped nuclei or cells. In fact, Jung et al. [19] utilized an unsupervised Bayesian classification scheme based on parametric EM algorithm for separating overlapped nuclei. Moreover, the single-path voting followed by a mean-shift clustering process was employed to locate the cell centers by Qi et al. [20]. Furthermore, the graph-cuts-based algorithm was successfully applied by Yousef et al. [21] to develop a nuclei segmentation method as well as a powerful toolkit software named ”FARSIGHT”. On the other hand, some classical conventional methods have also been carried forward, such as the adaptive thresholding watershed algorithm [9] and the adaptive active physical deformable model [22]. To solve the cell segmentation problem more effectively via the advanced mathematical model, Jones et al. [23] proposed a new distance metric defined on image manifolds which combined image gradients and inter-pixel distance together. This metric demonstrated its effectiveness, but the segmentation results strongly depended on the initial selection of seeds. Although the centers of nuclei could be adopted as the seeds in CellProfiler [24], the segmentation results became worse as cell nuclei are touched together. Besides, CellProfiler accomplished the pixel clustering after only one time of assignment, the boundaries were obtained without optimization. Along the direction of new distance metric, we recently proposed a novel clustering method with the geodesic distance for gray scale images in [25] with little a priori knowledge. Although it is robust to the initialization of seeds, the determination of number of clusters is still a bottleneck in improving the segmentation performance further. Over-segmentation occurs when the number of initial seeds is more than the true number of cells in an image, which is a well known inherent disadvantage of K-means clustering methods.

Aiming for accurate segmentation of the touching Madin-Darby Canine Kidney (MDCK) epithelial cells in multi-channel (color) images, in this paper, we propose a nuclei segmentation based K-means clustering method using the geodesic distance. The main contribution of this paper is that an adaptive learning scheme is established to adjust the clustering centers with the geodesic distance for improving the segmentation accuracy. Technically, color information is taken into consideration to select the number of reasonable initial seeds for clusters through a well-designed nuclei segmentation procedure. As a result, the over-segmentation problem is effectively overcome, more accurate dividing boundaries are delineated, and the convergence of the improved K-means clustering method is also speeded up.

The rest of this paper is organized as follows. We briefly introduce the geodesic distance based clustering method in Section 2. Section 3 presents the details of our adaptive cell segmentation procedure. The experimental results on MDCK images are demonstrated in Section 4. Finally, a brief conclusion is provided in Section 5.

Section snippets

Riemannian metric

We firstly introduce the assumed Riemannian metric defined at each pixel in an image I [23]. Specifically, G(·) is defined asG(·)=g(·)gT(·)+λE1+λ,where E is the 2×2 identity matrix, λ0 is a regularization parameter and ∇ is a gradient operator. The function g is introduced to reduce the effect of noise by weighted averaging the pixels within a certain neighborhood. The infinitesimal distance at each pixel is then calculated bydxG(·)2dxTG(·)dx=(dxTg(·))2+λdxTdx1+λ.The item (dxTg(·))2

Cell segmentation procedure

In the implementation of the improved clustering method for cell segmentation, it is important to determine the number of clusters and to set the initial seeds of the clusters. Clearly, the number of clusters should correspond to that of cells in the image and it is reasonable and efficient to set the initial seeds as the centers of the nuclei. In order to solve these problems, we can execute a nuclei segmentation procedure independently. In fact, accurate segmentation of the nuclei regions is

Experimental results

We implemented the proposed geodesic distance based K-means clustering method on the MDCK epithelial cell images from dog kidney that were used to study the effects of drug compounds on regulating Ras protein levels, which were determined by measuring the fluorescence intensity of a GFP-tagged N-Ras reporter. Thus, the accurate segmentation of cells is crucial for measuring the GFP-tagged Ras signal changes between the control and drug-treated cells. But the obtained cell images have high

Conclusion

We have proposed a geodesic distance based K-means clustering method that can well segment touching Madin-Darby Canine Kidney (MDCK) epithelial cells for discovering novel drugs regulating the spatial patterns of the N-Ras protein inside individual cells in multi-channel microscopy images. Actually, an adaptive learning scheme is introduced to adjust the clustering centers iteratively and efficiently. Moreover, a novel nuclei segmentation procedure is established to determine the number of

Acknowledgments

This work was supported by the Natural Science Foundation of China for Grants 61171138, NIH U54 CA149196-01, R01LM009161 and John S Dunn Research Foundation.

Xu Chen received his B.S. and M.S. degrees in Applied Mathematics from Peking University, in 2009 and 2013, respectively. He is now a Research Assistant of a government research institute. His main research interests include neural computation, statistical learning, and intelligent information processing.

References (30)

  • E. Ficarra et al.

    Automated segmentation of cells with IHC membrane staining

    IEEE Trans. Biomed. Eng.

    (2011)
  • X. Zhou et al.

    A novel cell segmentation method and cell phase identification using Markov model

    IEEE Trans. Inf. Technol. Biomed.

    (2009)
  • F. Li, X. Zhou, S. Wong, Optimal live cell tracking for cell cycle study using time-lapse fluorescent microscopy...
  • E. Meijering

    Cell segmentation50 years down the road

    IEEE Signal Process. Mag.

    (2012)
  • F. Li et al.

    Chapter 17bioimage informatics for systems pharmacology

    PLoS Comput. Biol.

    (2013)
  • Cited by (9)

    • SEENS: Nuclei segmentation in Pap smear images with selective edge enhancement

      2021, Future Generation Computer Systems
      Citation Excerpt :

      Second, segmentation methods using cell or nucleus edge information utilize the discontinuity of gray information to segment the image; they mainly include differential operator methods and active contour models [23–27]. Third, cell or nucleus segmentation methods are mainly based on related theories [28,29], such as wavelet analysis [30,31], mathematical morphology [32–34], genetic algorithm [35] and neural networks [36–38]. In addition, many state-of-the-art cell or nuclei segmentation algorithms are no longer based on a single algorithm but multiple algorithms together, namely, fusion algorithms, which aim to make up for the shortage of a single algorithm and achieve better segmentation results [39,40].

    • Image segmentation and bias correction using local inhomogeneous iNtensity clustering (LINC): A region-based level set method

      2017, Neurocomputing
      Citation Excerpt :

      In particular, many methods have been proposed in the field of neural computing in recent years [7,8]. The existing methods can be generally classified into model based methods [9,10], clustering methods [11,12], machine learning methods [13,14], graph-cut based methods [15], and active contour and level set methods [16–20]. However, it is still challenging to accurately extract objects from a complex image especially if the image is ambiguous due to the existing of noise and intensity inhomogeneity (also known as bias field) [20–22].

    • Curvelet initialized level set cell segmentation for touching cells in low contrast images

      2016, Computerized Medical Imaging and Graphics
      Citation Excerpt :

      As ellipse-fitting algorithm was also used to refine the results but the algorithm failed to accurately detect the cells of other shapes. Chen et al. [18] proposed a geodesic distance based clustering algorithm to segment densely touching cells which gives improved segmentation accuracy of the cell boundaries. But the algorithm incorporates an adaptive learning scheme to adjust the clustering centers.

    View all citing articles on Scopus

    Xu Chen received his B.S. and M.S. degrees in Applied Mathematics from Peking University, in 2009 and 2013, respectively. He is now a Research Assistant of a government research institute. His main research interests include neural computation, statistical learning, and intelligent information processing.

    Yanqiao Zhu received his B.S. degree in Mathematics and Applied Mathematics from Dalian University of Technology, in 2008 and Ph.D. degree in Applied Mathematics from Peking University, in 2013. His research interests include statistical learning and intelligent information processing.

    Fuhai Li received his Ph.D. degree in Applied Mathematics from Peking University, in 2007 and is now an Instructor in Department of Systems Medicine and Bioengineering, Houston Methodist Hospital Research Institute. His research interests are biomedical image analysis, bioinformatics, and computational systems biology.

    Ze-Yi Zeng received his B.Sc. degree in Biological Sciences and Technology from Zhejiang University (China), in 2001, and Ph.D. degree in Biomedical Sciences from the National University of Singapore (Singapore), in 2006. Since 2007, he is a Susan G. Komen Foundation Postdoctoral Fellow at the Lester & Sue Smith Breast Center of the Baylor College of Medicine in Houston, TX. His research interests include Ras oncogene and breast cancer biology.

    Eric C. Chang is an Associate Professor in Baylor College of Medicine (Houston TX, USA). His lab is interested in a class of proteins that belong to the Ras GTPase superfamily which functions as binary switches to govern cell growth and differentiation. Abnormalities in these proteins are frequently found in human cancers. Ras proteins must localize to the plasma membrane to mediate the signal of growth from growth factors. Thus one of the major areas of research in the Chang lab centers on defining how plasma membrane localization of Ras proteins is controlled, such that this process may be targeted for drug development to treat cancers that are driven by oncogenic Ras proteins.

    Jinwen Ma received his M.S. degree in Applied Mathematics from Xi׳an Jiaotong University, in 1988 and the Ph.D. degree in Probability Theory and Statistics from Nankai University, in 1992. From July 1992 to November 1999, he was a Lecturer or Associate Professor at Department of Mathematics, Shantou University. From December 1999, he became a Full Professor at Institute of Mathematic, Shantou University. From September 2001, he has joined the Department of Information Science at the School of Mathematical Sciences, Peking University, where he is currently a Full Professor and Ph.D. tutor. During 1995 and 2003, he visited several times at the Department of Computer Science and Engineering, the Chinese University of Hong Kong as a Research Associate or Fellow. From September 2005 to August 2006, he worked as Research Scientist at Amari Research Unit, RIKEN Brain Science Institute, Japan. From September 2011 to February 2012, he visited as a Scientist at the Department of Systems Medicine and Bioengineering, Houston Methodist Hospital Research Institute. He has published over 100 academic papers on neural networks, pattern recognition, bioinformatics, and information theory.

    Stephen T.C. Wong, Ph.D., PE, is John S. Dunn Distinguished Endowed Chair in Biomedical Engineering and Chief of Medical Physics, Houston Methodist Hospital, Founding Department Chair of Department of Systems Medicine and Bioengineering, Houston Methodist Research Institute at Texas Medical Center and Professor of Radiology, Neurosciences, Pathology and Laboratory Medicine of Weill Cornell Medical College of Cornell.

    View full text