False positive reduction in urinary particle recognition

doi:10.1016/j.eswa.2009.03.049

Expert Systems with Applications

Volume 36, Issue 9, November 2009, Pages 11429-11438

https://doi.org/10.1016/j.eswa.2009.03.049 Get rights and content

Abstract

The identification of non-cell objects in biological images is not a trivial task largely due to the difficulty in describing their characteristics in recognition systems. In order to better reduce the false positive rate caused by the presence of non-cell particles, we propose a novel approach using a local jet context features scheme combined with a two-tier object classification system. The newly proposed feature scheme, namely local jet context feature, integrates part of global features with the “local jet” features. The scheme aims to effectively describe the particle characteristics that are invariant to shift and rotation, and hence help to retain the critical shape information. The proposed two-tier particle classification strategy consists of a pre-recognition stage first and later a further filtering phase. Using the local jet context features coupled with a multi-class SVM classifier, the pre-recognition stage intends to assign the particles to their corresponding classes as many as possible. To further reduce the false positive particles, next a decision tree classifier based on shape-centered features is applied. Our experimental study shows that through the proposed two-tier classification strategy, we are able to achieve 85% of identification accuracy and 80% of F₁ value in urinary particle recognition. The experiment results demonstrate that the proposed local jet context features are capable to discriminate particles in terms of shape and texture characteristics. Overall, the two-tier classification stage is found to be effective in reducing the false positive rate caused by non-cell particles.

Introduction

The urinary sediment analysis of biological particles in microscopic images has been a subject of research for many years in pathology (Lamchiagdhase et al., 2005, Linko et al., 2006). A correct urinalysis result offers a direct indication of the patient’s renal and genitourinary system and helps to monitor other body systems (Chien et al., 2007). Urinary sediment analysis is usually performed by trained technicians who count the number of each kind of particles using a microscope focusing on the ground slide with urinary sample or through a monitor screen displaying the biological particles that are digitally captured from the urinary sample. Although the procedures for manual analysis have already been standardized, the traditional microscopy approach in urinary sediment analysis is laborious and tedious which often produces subjective and imprecise results (Lamchiagdhase et al., 2005). Hence, there is an increasing demand for an automated system that can detect and count the suspicious objects in the urinary images and further classify them into the correct classes of interests.

In the typical procedures of automated urinalysis, detection and classification are both challenging tasks. Basically, this is due to the complicated characteristics of urinary images:

•
The image backgrounds can be noisy and non-uniform in illumination.
•
The objects of interest exist in diversified forms. Even for the items belong to the same class, they can vary greatly in shape and texture.
•
The presence of spots caused by multilayer urinal sample slides makes the problem more complex. Some particles can be presented in poor resolution and their boundaries are not sharp enough to be readily extracted.
•
The existence of non-cell particles in different texture and shape will surely jeopardize the recognition results.

As a result, some non-cell particles will be inevitably labeled as cells, e.g. red blood cell (RBC) or white blood cell (WBC), which directly leads to in a high false positive rate.

Recently, it has witnessed an increasingly attention amongst researchers and practitioners in handling image classification in cell biology (Auer, Peng, & Singh, 2007). Although existing algorithms are highly sensitive in cell classification, they mainly focus on the identification of biological cells only, rather than the process of identifying non-cell particles (such as impurity and poor focused regions). There are two main reasons why non-cell objects exist in the images. One is due to the image quality problem from poor focusing regions which are located in various places of urinary images. The other reason is because of the random occurrence of impurities. In some urinary images, the impurities are rare; while in some other cases, the impurities appear continuously along a series of images. Inevitably, some non-cell particles are likely to be assigned to one or the other cell group. This will increase the number count in different cell groups of interest. As a result, technicians have to manually examine the specific cases with relatively a large number of cells from certain classes. Therefore, it is highly desirable to lower the false positive rate caused by non-cell particles as much as possible, while retaining the true positives, i.e. RBCs and WBCs.

In literature, several strategies tackling cell image recognition problem have been reported. Here, we only focus on the approaches adopted in feature extraction scheme, which is well recognized as one of the most important stages in pattern recognition, and summarize them based on their original intents of how to describe the object characteristics.

The first prevailing approach is established based on global features. Global features are intended to describe the regions of interest in a macroscopical way. Such features can refer to the statistics of pixel intensities, e.g. mean, standard deviation and correlation between adjacent pixels, or various shape metrics, e.g. area, perimeter and elongation, or edge information, e.g. skewness and curvature, or colorimetric features, etc. Various feature selection techniques will be applied in order to generate an effective feature set usually in a lower dimension (Liyan, Senmiao, Guangyuan, Lingyan, & Yongli, 2007). Using the reduced feature set as variable inputs, classifiers are induced based on different machine learning algorithms, e.g. k-nearest neighbor, artificial neural network, Bayesian classifier and support vector machines, and labeled learning examples to predict any unseen objects. Often, the identification and screening of these features are domain based and highly rely on the knowledge of a specific application, therefore, rule based methods are also proposed (Chen et al., 2006, Lin et al., 2005, Zhen et al., 2000). However, the limitation of this knowledge based rule approach is well understood since it is often intended for certain types of cells. As for other types of particles, the feature discrimination procedure would not be valid (Dahmen et al., 2000, Liyan et al., 2007). This also jeopardizes its robustness in different image transformations (e.g. orientation, scale, translation and symmetrical illumination) that appear in many applications.

The second major effort emphasizes on local appearance models. It has provided another class of methods in terms of the source of features. From the microcosmic point of view, local features are mainly extracted based on the appearance of particularly interesting points or regions. The local descriptors propose extensive local features which are invariant to various transformations by encoding the numeric information in their neighborhood (Krueger and Phillips, 1989, Mikolajczyk and Schmid, 2005, Schmid and Mohr, 1997). Due to the well-known challenge of high dimensionality in feature space with respect to various machine learning algorithms, feature reduction is conducted, e.g. principal component analysis and linear discriminant analysis. Kolsch et al. introduced squared sub-images with a high local variance of gray values as local features (Kolsch, Keysers, Ney, & Paredes, 2004). Distance metrics and probability models were also proposed in image classification. The experimental study stresses that the decomposition of the training images into sets of local features with respect to each class of interest is the most important aspect in the framework of image classification. Maree & Wehenkel proposed a generic approach for image classification without segmentation (Maree, Geurts, & Wehenkel, 2007). A large set of patches were randomly extracted and their raw pixel values were converted to high-dimensional feature vectors. Next, these features were adopted in the extremely randomized trees to build a subwindow classification model. Although the results were however not as good as the best results for one specific dataset, they illustrated the potential of their generic image classification method. Ranzato et al. presented an automatic system to classify 12 object categories in urinary images (Ranzato et al., 2007). Without any segmentation, the differential invariants of brightness at multiple scales were described by an average operator. With these invariant features, a classifier obtained from a mixture-of-Gaussian generative model achieved a 93.2% accuracy. It has come to our attention if the focusing objects from different classes in various shapes bear similar textures, these methods may not be able to perform well. This is highly because the average operator and the subwindows features do not provide sufficient information about size, shape and density distribution of the particles.

The combination of global and local features has emerged as another stream in feature description. Both global and local features have their own merits in representing different kinds of cells. The utilization of a joint feature set may be more effective in classifying certain types of cells than others. How to integrate these two classes of features is indeed a research problem. In literature, studies have been reported to generate feature vectors by joining global and local features together before supplying to learning algorithms. Wang et al. built feature vectors with 145 features including shape, size and Gabor transformed features (Wang, Zhou, King, & Wong, 2007). Lucas et al. derived features from the sub-images and texture-related features. Feature selection techniques are applied to filter out the less significant features (Lucas, Raul, Bruno, Anne, & Marcilio, 2007). A notable problem in their approach is that the normalization operator may severely jeopardize the contribution of certain features in classification, largely due to their different orders of magnitude when compared to each other. One possible solution is to arrange these features in a hierarchical manner using different learners. Zhen et al. built artificial neural networks to classify normal substance, and then fuzzy rules were generated according to the global features chosen (Zhen et al., 2000). However, their experiments have been limited to two substances only, the white blood cells and the red blood cells. Further exploration and validation are needed.

In our study, the non-cell object class possesses many more samples than the classes of biological cells. Our purpose is to reduce the false positives, e.g. the impurity being regarded as a red blood cell or other type of cells, caused by non-cell objects as many as possible, meanwhile not jeopardizing, if not improving, the true positive rate. In this paper, we mainly focus on feature extraction scheme which is often regarded as one of the most important stages in particle classification problem as well as the strategy concerned about how to combine the global and local features.

Fig. 1 shows an overview of the proposed approach. First, we employ a global thresholding approach of local difference to detect the particles. Next, a two-step procedure for particle segmentation has been adopted. Once the individual particle is segmented, feature extraction is carried out. Local descriptors are a powerful technique for small object representation, such as SIFT, steerable filters and local jet. Based on the local descriptor “local jet”, we present a new feature descriptor namely the local jet context (LJC) features to describe the characteristics of the detected regions. Some useful shape-based features are also introduced according to the biological knowledge. After this, a two-tier object classification system is introduced which consists of a pre-recognition stage first and later a further filtering stage. Using the local jet context features coupled with multi-class SVM classifiers, the pre-recognition stage aims to assign the particles to their corresponding classes as many as possible. To further remove the false positive particles, next a decision tree based on shape-centered feature is applied. Finally, the system outputs the identification results.

The rest of this paper is organized as follows. We introduce a two-step procedure for particle segmentation in Section 2. Section 3 mainly focuses on feature extraction especially our proposed feature scheme. The experiment setup is explained in Section 4. Section 5 reports our experimental results and analysis. Section 6 concludes with some future work.

Section snippets

Particle segmentation

The typical particles existed in urinary images include red blood cells, white blood cells, yeasts, epithelial cells and impurities. An accurate and effective object extraction approach forms the foundation of a particle recognition system. In order not to miss any biological particles and also to reduce the distortion of their shapes, we adopt a two-step process. The first step, namely the location step, aims to segment meaningful regions from the nonuniform background. The second step, namely

Local jet context (LJC) feature scheme

At this point, we have extracted individual objects of interest from urinary images. For each particle, we need to extract relevant features for object representation. These features should be invariant with respect to shift and rotation, and should be robust against small changes in scale while retaining its overall shape information as well. Local descriptors are powerful techniques for small object representation, such as SIFT, steerable filter and local jet (Fang et al., 2003, Mikolajczyk

Experiment setup

The image data tested in our experiments were directly obtained using a digital camera through a microscope which focuses on a vessel carrying human urine. These biomedical images were captured as 8-b gray-level 659 × 493 bmp files.

Follow our particle segmentation approach, we set α = 0.1, the objects of interest were detected after the step of contour tuning. The particles found in microscopic urinalysis consist of five classes: red blood cells (RBC), white blood cells (WBC), yeasts (YSTS),

Overall performance

Fig. 10 shows the overall performance of two feature schemes tested over D1 and D2 in terms of P_F values.

Our first observation is that using LJC features the system performs much better than the one based on LJN features. By applying LJC, the overall performance (F₁ value) of SVM is improved by about 8% to 9% compared to those using LJN on both D1 and D2. However, when analyzing the results between D1 and D2, we note that both F₁ values drop about 17% and 15% for the system using LJN and LJC

Conclusion and future work

Particle recognition in biological images is often regarded as a non-trivial task. In addition, the different characteristics of non-cell objects have made the task more difficult. In order to better reduce the false positive rate caused by the presence of non-cell particles, we have proposed a new local feature extraction scheme and a two-tier classification strategy, i.e. a pre-recognition stage plus a further filtering phase. The new local feature extraction scheme, namely local jet context,

Acknowledgements

The work described in this paper was supported by the Program for New Century Excellent Talents of Education Ministry of China (NCET-06-0762), Natural Science Foundation of China (60873092) and Natural Science Foundation of Chongqing (CSTC2007BA2003).

References (22)

T.-I. Chien et al.
Comparison of three automated urinalysis systems – Bayer Clinitek Atlas, Roche Urisys 2400 and Arkray Aution Max for testing urine chemistry and detection of bacteriuria
Clinica Chimica Acta
(2007)
P. Lamchiagdhase et al.
Urine sediment examination: A comparison between the manual method and the iQ200 automated urine microscopy analyzer
Clinica Chimica Acta
(2005)
S. Linko et al.
Analytical performance of the Iris iQ200 automated urine microscopy analyzer
Clinica Chimica Acta
(2006)
D.-T. Lin et al.
Autonomous detection of pulmonary nodules on CT images with a neural network-based fuzzy system
Computerized Medical Imaging and Graphics
(2005)
M. Ranzato et al.
Automatic recognition of biological particles in microscopic images
Pattern Recognition Letters
(2007)
M. Auer et al.
Development of multiscale biological image data analysis: Review of 2006 international workshop on multiscale biological imaging, data mining and informatics, Santa Barbara, USA (BII06)
Bmc Cell Biology
(2007)
C.C. Chang et al.
Training nu-support vector regression: Theory and algorithms
Neural Computation
(2002)
X.W. Chen et al.
Automated segmentation, classification, and tracking of cancer cell nuclei in time-lapse microscopy
IEEE Transactions on Biomedical Engineering
(2006)
J. Dahmen et al.
Automatic classification of red blood cells using gaussian mixture densities
Bildverarbeitung fur die Medizin
(2000)
B. Fang et al.
On the accurate counting of tumor cells
IEEE Transactions on Nanobioscience
(2003)

J.J. Koenderink et al.

Representation of local geometry in the visual-system

Biological Cybernetics

(1987)

Cited by (13)

Microscopic urinary particle detection by different YOLOv5 models with evolutionary genetic algorithm based hyperparameter optimization
2024, Computers in Biology and Medicine
The diagnosis of kidney disease often involves analysing urine sediment particles. Traditionally, urinalysis was performed manually by collecting urine samples and using a centrifuge, which was prone to manual errors and relied on labour-intensive processes. Automated urine sediment microscopy, based on machine learning models, requires segmentation and feature extraction, which can hinder model performance due to intrinsic characteristics of microscopic images. Deep learning models based on convolutional neural networks (CNNs) often rely on a large number of manually annotated data, making the system computationally complex. This study propose an advanced deep learning model based on YOLOv5, which offers faster performance and requires comparatively less data. The proposed model used five variants of the YOLOv5 model (YOLOv5n, YOLOv5s, YOLOv5m, YOLOv5l, and YOLOv5x) to detect six categories of urine particles (erythrocyte, leukocyte, crystals, cast, mycete, epithelial cells) from microscopic urine sediment images. The dataset involved 5376 images of urine sediments with 6 particles. There are 30 sets of hyperparamreteres are employed in the YOLOv5 model. To optimize the hyperparameters and fine-tune with the urine sediment dataset and for training each model, the system employed a genetic algorithm (GA) based on evolutionary principles named as Evolutionary Genetic Algorithm (EGA). Among the six categories of detected particles mycete achieved maximum performance with a mAP of 97.6 % and crystals achieved minimum performance with a mAP of 81.7 % with YOLOv5x model compared to other particles. To optimize the hyperparameters for training each model, the system employed a genetic algorithm (GA) based on evolutionary principles named as Evolutionary Genetic Algorithm (EGA). Among all the models, YOLOv5l and YOLOv5x performed the best. YOLOv5l achieved a mean average precision (mAP) of 85.8 % while YOLOv5x achieved a mAP of 85.4 % at an IoU threshold of 0.5. The detection speed per image was 23.4 ms for YOLOv5l and 28.4 ms for YOLOv5x. The proposed method developed a faster and better automated microscopic model using advanced deep learning techniques to detect urinary particles from microscopic urine sediment images for kidney disease identification. The method demonstrated strong performance in urinalysis.
A new super resolution Faster R-CNN model based detection and classification of urine sediments
2023, Biocybernetics and Biomedical Engineering
Citation Excerpt :
Furthermore, the number of analyses to be performed by a single specialist is often limited. These problems led to the emergence of various approaches for automatic urine sediment analysis [4–7]. In this respect, the present study proposes a deep learning-based automatic urine sediment analysis approach.
The diagnosis of urinary tract infections and kidney diseases using urine microscopy images has gained significant attention of medical community in recent years. These images are usually created by physicians’ own rule of thumb manually. However, this manual urine sediment analysis is usually labor-intensive and time-consuming. In addition, even when physicians carefully examine an image, an erroneous cell recognition may occur due to some optical illusions. In order to achieve cell recognition in low-resolution urine microscopy images with a higher level of accuracy, a new super resolution Faster Region-based Convolutional Neural Network (Faster R-CNN) method is proposed. It aims to increase resolution in low-resolution urine microscopy images using self-similarity based single image super resolution which was used during the pre-processing. De-noising based Wiener filter and Discrete Wavelet Transform (DWT) are used to de-noise high resolution images, respectively, to increase the level of accuracy for image recognition. Finally, for the feature extraction and classification stages, AlexNet, VGFG16 and VGG19 based Faster R-CNN models are used for the recognition and detection of multi-class cells. The model yielded accuracy rates are 98.6%, 96.4% and 96.2% respectively.
A review on various methods for recognition of urine particles using digital microscopic images of urine sediments
2021, Biomedical Signal Processing and Control
Citation Excerpt :
They found that the use of exponential radial basis function as SVM kernel gives the better result. Liang et al. [18] segmented urine images using the location step and tuning step. 8-bit gray-level urine images having a size of 659 × 493 Bitmap (bmp) were directly obtained using a digital microscopic camera.
Urine sediment examination is important for any patient with renal disease. Urinalysis may be physical, chemical or microscopic examinations. Microscopic examination determines the parameters such as Red Blood Cells (RBC), White Blood Cells (WBC), Epithelial Cells, Crystals, Bacteria, and Casts. Results from this test identify various kidney-related diseases such as Hematuria, Kidney Stones, etc. This review compares various automated methods used for urinalysis. The traditional method for microscopic examination of urine sediment performed manually from centrifuged urine samples. It is a time-consuming process and there is possibility of manual errors. This work describes the classification of microscopic images of urine sediments by conventional automated microscopic techniques and by using different types of convolutional neural networks (CNN). The problem with the conventional automated models is that the segmentation and feature extraction to be carefully designed. The characteristics of microscopic urine images make it a formidable task. The convolutional neural network classifies the images without feature extraction and segmentation. Various convolutional neural networks proposed in the literature are different types of RCNN, SSD and its variants and LeNet-5 neural network.
Automated Classification of Microscopic Urinary Particles by Using Deep Learning Methods Based on Efficientnet Algorithm
2024, SSRN
An Efficient Particle YOLO Detector for Urine Sediment Detection
2023, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Inspection of visible components in urine based on deep learning
2020, Medical Physics

View all citing articles on Scopus

View full text

False positive reduction in urinary particle recognition

Abstract

Introduction

Section snippets

Particle segmentation

Local jet context (LJC) feature scheme

Experiment setup

Overall performance

Conclusion and future work

Acknowledgements

Clinica Chimica Acta

Clinica Chimica Acta

Clinica Chimica Acta

Computerized Medical Imaging and Graphics

Pattern Recognition Letters

Development of multiscale biological image data analysis: Review of 2006 international workshop on multiscale biological imaging, data mining and informatics, Santa Barbara, USA (BII06)

Bmc Cell Biology

Training nu-support vector regression: Theory and algorithms

Neural Computation

Automated segmentation, classification, and tracking of cancer cell nuclei in time-lapse microscopy

IEEE Transactions on Biomedical Engineering

Automatic classification of red blood cells using gaussian mixture densities

Bildverarbeitung fur die Medizin

On the accurate counting of tumor cells

IEEE Transactions on Nanobioscience

Representation of local geometry in the visual-system

Biological Cybernetics