Using support vector machines in diagnoses of urological dysfunctions

doi:10.1016/j.eswa.2009.12.055

Expert Systems with Applications

Volume 37, Issue 6, June 2010, Pages 4713-4718

https://doi.org/10.1016/j.eswa.2009.12.055 Get rights and content

Abstract

Urinary incontinence is one of the largest diseases affecting between 10% and 30% of the adult population and an increase is expected in the next decade with rising treatment costs as a consequence. There are many types of urological dysfunctions causing urinary incontinence, which makes cheap and accurate diagnosing an important issue. This paper proposes a support vector machine (SVM) based method for diagnosing urological dysfunctions. 381 registers collected from patients suffering from a variety of urological dysfunctions have been used to ensure the (generalization) performance of the decision support system. Moreover, the robustness of the proposed system is examined by fivefold cross-validation and the results show that the SVM-based method can achieve an average classification accuracy at 84.25%.

Introduction

Presently urinary incontinence affects between 10% and 30% of the adult population and it is expected to increase in the next decade with accelerating treatment costs as a consequence (Cortes and Kelleher, 2005, Wein, 2004). This rise in incidence is similar for the male and the female parts of the adult population (Irwin et al., 2006) (see Table 1).

The use of classifier systems in medical diagnosis is increasing gradually. There is no doubt that evaluation of data taken from patients and decisions of experts are the most important factors in diagnosis. However, expert systems and different artificial intelligence techniques for classification have the potential of being good supportive tools for the expert. Classification systems can help in increasing accuracy and reliability of diagnoses and minimizing possible errors, as well as making the diagnoses more time efficient (Akay, 2008).

Some of the related work in the field of the urological diagnosis has been developed basically by means of artificial neural networks (ANNs) (Gil et al., 2007, Gil et al., 2008). To increase the accuracy and the generalization ability we propose the use of a Support Vector Machine (SVM) based system combined with techniques for dimensionality reduction. In addition to ANNs, the SVM (Cortes & Vapnik, 1995) has also emerged as a powerful tool for classification. SVMs were proposed by Vapnik (1995) and is based on the structured risk minimization (SRM) principle. Hence it tries to minimize an upper bound of the generalization error instead of the empirical error as in the artificial neural networks. Therefore a particular advantage of SVMs over other classifiers is that they can achieve better performance when applied to real world problems (He, Hu, Harrison, Tai, & Pan, 2006). Some classifiers, such as ANNs suffer from the overfitting problem. In the case of the SVM overfitting is unlikely to occur. Overfitting is caused by too much flexibility in the decision boundary.

SVMs are global representatives of the whole set of training points, and there are usually few of them, which gives little flexibility. Thus overfitting is unlikely to occur (Witten & Frank, 2005). SVMs have been successfully applied to a wide variety of applications, e.g. including pattern recognition, biology and financial domains (Hearst et al., 1998, Hua and Sun, 2001, Huang and Wu, 2006, Shin et al., 2005, Wu et al., 2008, Yan et al., 2008).

The remaining part of the paper is organized as follows: first, we give a brief description of some basic SVM concepts. Next we describe the design of our proposal of the SVM-based decision support system with dimensionality reduction and the training of the SVM by the available data. Then we describe our testing of the system and analyze the results. Finally we draw relevant conclusions and suggest future lines of research.

Section snippets

Support vector machines

In this section, the basic concept of SVM will be briefly described. More thorough descriptions can be found in Burges, 1998, Theodoridis and Koutroumbas, 2003, Hsu et al., 2003. A typical two class problem as Fig. 1 shows is similar to the problem of diagnosing urological patients as either ill or healthy.

For a classification problem, it is necessary to first try to estimate a function $f : R^{N} \to {\pm 1}$ using training data, which are l N-dimensional patterns $x_{i}$ and class labels $y_{i}$ , where $(x_{1}, y_{1}), \dots, (x_{l}, y$

Urological data

The input data in the system starts when a patient reports to a physician. Then, a large number of information to be considered during the diagnosis will be saved in a database. In this study, an exhaustive urological exploration with 20 different measurements has been carried out by using 381 patients with dysfunctions in the lower urinary tract (LUT). The 20 input variables (Table 2) that are essential to the diagnosis of the LUT diseases of interest are extracted from the urological

Conclusions and future work

In this paper we have evaluated the performance of a classifier constructed by means of the SVM method when applied to the diagnosis of urological dysfunctions. The SVM were trained with data from a database with registers of patients with urological dysfunctions. The experiment starts with a preprocessing of the urodynamical measures from every patient. This preprocessing includes missing data treatment and normalization process. After that, data are provided to the SVM which determines

Acknowledgments

We want to express our acknowledgements to the urologists of the Hospital of San Juan (Alicante-Spain), who have made it possible to reach a better understanding of the different types of urological dysfunctions. Moreover, the data used in the development of this system is the result of several years of this collaboration.

References (40)

E. Cortes et al.
Costs of female urinary incontinence
Women’s Health Medicine
(2005)
J. He et al.
Transmembrane segments prediction and understanding using support vector machine and decision tree
Expert Systems with Applications
(2006)
S. Hua et al.
A novel method of protein secondary structure prediction with high segment overlap measure: Support vector machine approach
Journal of Molecular Biology
(2001)
D.E. Irwin et al.
Population-based survey of urinary incontinence, overactive bladder, and other lower urinary tract symptoms in five countries: Results of the EPIC study
European Urology
(2006)
R. Kohavi et al.
Wrappers for feature subset selection
Artificial Intelligence
(1997)
S.H. Min et al.
Hybrid genetic algorithms and support vector machines for bankruptcy prediction
Expert Systems with Applications
(2006)
K.S. Shin et al.
An application of support vector machines in bankruptcy prediction model
Expert Systems with Applications
(2005)
Ö. Uncu et al.
A novel feature selection approach: Combining feature wrappers and filters
Information Sciences
(2007)
A.J. Wein
Costs of urinary incontinence and overactive bladder in the United States: A comparative study
The Journal of Urology
(2004)
T.K. Wu et al.
Evaluation of ANN and SVM classifiers as predictors to the diagnosis of students with learning disabilities
Expert Systems with Applications
(2008)

Akay, M. F. (2008). Support vector machines combined with feature selection for breast cancer diagnosis. Expert systems...

K.P. Bennett et al.

Robust linear programming discrimination of two linearly inseparable sets

Optimization Methods and Software

(1992)

C.J.C. Burges

A tutorial on support vector machines for pattern recognition

Data Mining and Knowledge Discovery

(1998)

C. Cortes et al.

Support-vector networks

Machine Learning

(1995)

R. Courant et al.

(1953)

Das, S. (2001). Filters, wrappers and a boosting-based hybrid for feature selection. In Machine learning-international...

W. Duch

Filter methods

Studies in fuzziness and soft computing

(2006)

X. Fu et al.

Data dimensionality reduction with application to simplifying RBF network structure and improving classification performance

IEEE Transactions on Systems, Man and Cybernetics, Part B

(2003)

Gil, D., Soriano, A., Ruiz, D., & Montejo, C. A. (2007). Embedded system for diagnosing dysfunctions in the lower...

D. Gil et al.

Application of artificial neural networks in the diagnosis of urological dysfunctions

Expert Systems with Applications

(2008)

Cited by (21)

Machine learning reveals salivary glycopatterns as potential biomarkers for the diagnosis and prognosis of papillary thyroid cancer
2022, International Journal of Biological Macromolecules
Citation Excerpt :
SVM diagnosis was performed on 381 patients with urinary system diseases. The results showed that the method based on Support Vector Machine can reach 84.25 % classification accuracy [42]. If the SVM data mining method is combined with the expertise of various research fields, SVM can be extended to different fields.
The diagnosis of thyroid cancer, especially papillary thyroid cancer (PTC), is increasing rapidly worldwide. In this study, we aimed to study the glycosylation of salivary proteins associated with PTC and assess the likelihood that salivary glycopatterns may be a potential biomarker of PTC diagnosis. Firstly, 22 benign thyroid nodule (BTN) samples, 27 PTC samples, and 30 healthy volunteers (HV) samples were collected to probe the difference of salivary glycopatterns associated with PTC using lectin microarrays. Then, five machine learning models including K-Nearest Neighbor (KNN), Multilayer Perceptron (MLP), Logistic Regression (LR), Random Forest (RF), and Support Vector Machine (SVM) were established to distinguish HV, BTN and PTC based on the changes of salivary glycopatterns. As a result, SVM had the best diagnostic effect with an accuracy rate of 92 % in testing set. Besides, lectin microarrays were used to explore the differences in salivary glycopatterns of 26 paired salivary samples of PTC patients before and after operation in order to probe into salivary glycopatterns as potential biomarkers for prognosis of PTC patients. The results showed that the levels of salivary glycopatterns recognized by 6 different lectins in patients after the operation almost convergenced with HVs. This study could help to screen and assess patients with PTC and their prognosis based on precise changes of salivary glycopatterns.
Application of total X-Ray fluorescence to gunshot residue determination
2019, Applied Radiation and Isotopes
Citation Excerpt :
Quadratic discriminant analysis (QDA) is the same as the former method but employs a quadratic equation which is usually more efficient (Esteki et al., 2018; Rodionova et al., 2016). Supporting vector machines (SVM) finds the best hyper-plane that discriminate two different classes of samples (Zendehboudi et al., 2018; Gil and Johnsson, 2010). In K-nearest neighbors (KNN) algorithm, the distance from the unknown sample to the samples in the training data set is calculated, then the class of the unknown sample is decided according to the class of majority of the k (k is a natural number) closest samples (Chirici et al., 2016; Devroye and Wagner, 1982).
Currently the great majority of the criminal acts have involved the use of firearms, for these reasons the evidences generate from these are one of the fundamental pillars of a forensic investigation.
The firearm leaves evidence known as gunshot residue (GSR), which is principally composed of burnt and unburnt particles from the detonation, as well as fragments of the bullet, cartridge case, and the firearm.
Gunshot residue (GSR) is produced when a firearm is discharged and large quantities of it can be transferred to an individual who has fired.
SEM-EDX is the common technique used in the forensic laboratories, the analysis consists in detecting the particles and its elements.
In this work we propose the use of X-ray Spectrometry by Total Reflection (TXRF) for the analysis of metals present in related samples in ballistic cases. The analysis was focused in the relationship of three elements present in GSR. A series of experiments with different persons firing gun of 9 mm was performed in a shooting range.
Analytical XRFS signals corresponding to K line of Copper and L lines Barium and Lead were employed as the best discriminating variables. Machine Learning techniques, such as discriminant analysis, supported vector machines and partial least squares – discriminant analysis, enable the correct classification of all samples analyzed.
A hundred samples were analyzed so far, this method has demonstrated a very high classification performance for detecting gunpowder residues in the skin.
Identifying central and peripheral nerve fibres with an artificial intelligence approach
2018, Applied Soft Computing Journal
Citation Excerpt :
Then, it could be identified in the most suitable group, either in the optic or the cochlear nerve. The authors have experience in the medical field by applying several AI techniques in classification and prediction tasks with very good results [50,51,48]. Additionally, our architecture, characterized by complexity, generalization, and flexibility, can be extended to other biological control systems.
Distinguishing axons from central or peripheral nervous systems (CNS or PNS, respectively) is often a complicated task. The main objective of this work was to facilitate and support the process of automatically distinguishing the different types of nerve fibres by analysing their morphological characteristics. Our approach was based on a multi-level hierarchical classifier architecture that can handle the complexity of directly identifying nerve-fibre groups belonging to either the CNS or the PNS. The approach adopted comprises supervised methods (multilayer perceptron and decision trees), which are responsible for distinguishing the origin of the axons (CNS or PNS), whereas the unsupervised method (K-means clustering) performs nerve fibre clustering based on similar characteristics for both the CNS and PNS. Our experiments produced results with an accuracy higher than 88%. Our findings suggest that the development and implementation of a multi-level system improves automation capabilities and increases accuracy in the classification of nerves. Furthermore, our architecture allows for generalisation and flexibility, which can subsequently be extended to other biological control systems.
A novel acoustic emission detection module for leakage recognition in a gas pipeline valve
2017, Process Safety and Environmental Protection
Citation Excerpt :
As a pattern recognition method, the support vector machine (SVM) has been extensively employed to solve classification and regression problems (Cortes and Vapnik, 1995). The SVM has been successfully used in many fields, including valve leakage (Yang et al., 2005), mechanical fault diagnosis (Oh and Sohn, 2009), image and video processing (Ding et al., 2008), medical engineering (Gil and Johnsson, 2010), and chemical engineering (Kulkarni et al., 2005). To improve the accuracy of classification, several feature parameters were calculated in the time and frequency domains.
Internal valve leakage in a natural gas pipeline seriously impairs the safe operation on pipelines, and the recognition of leakages has therefore been a major concern of the industry. In this study, a novel leakage detection scheme based on kernel principal component analysis (kernel PCA) and the support vector machine (SVM) classifier for the recognition of the leakage level is constructed. Using this approach, the acoustic signal of the leakage is obtained as the feature source using an acoustic emission (AE) sensor. The kernel PCA is used to reduce the dimensionality of the features and extract the optimal features for the classification process, and the SVM is applied to perform the recognition of the leakage levels. The performance of the classification process based on kernel PCA and the classifier are evaluated in terms of the accuracy, Cohen’s kappa number and training time. The experimental results demonstrate that the intelligent recognition model based on kernel PCA and SVM classifier is very effective for recognizing the leakage level of a valve in a natural gas pipeline.
Predicting seminal quality with artificial intelligence methods
2012, Expert Systems with Applications
Citation Excerpt :
These AI methods can also help to make decision in the next steps of the infertility treatment, focusing or not on the male partner and avoiding painful and expensive examinations on the female. The authors have experience in the field of urology by applying several AI techniques in classification tasks with very good results (Gil & Johnsson, 2010b; Gil et al., 2009, Gil, Johnsson, Garcia Chamizo, Paya, & Fernandez, 2011). This makes it possible to adapt the artificial intelligence to meet the needs of the data analyzed in this study.
Fertility rates have dramatically decreased in the last two decades, especially in men. It has been described that environmental factors, as well as life habits, may affect semen quality. Artificial intelligence techniques are now an emerging methodology as decision support systems in medicine.
In this paper we compare three artificial intelligence techniques, decision trees, Multilayer Perceptron and Support Vector Machines, in order to evaluate their performance in the prediction of the seminal quality from the data of the environmental factors and lifestyle.
To do that we collect data by a normalized questionnaire from young healthy volunteers and then, we use the results of a semen analysis to asses the accuracy in the prediction of the three classification methods mentioned above.
The results show that Multilayer Perceptron and Support Vector Machines show the highest accuracy, with prediction accuracy values of 86% for some of the seminal parameters. In contrast decision trees provide a visual and illustrative approach that can compensate the slightly lower accuracy obtained.
In conclusion artificial intelligence methods are a useful tool in order to predict the seminal profile of an individual from the environmental factors and life habits. From the studied methods, Multilayer Perceptron and Support Vector Machines are the most accurate in the prediction. Therefore these tools, together with the visual help that decision trees offer, are the suggested methods to be included in the evaluation of the infertile patient.
Hybrid principal component analysis and support vector machine model for predicting the cost performance of commercial building projects using pre-project planning variables
2012, Automation in Construction
Citation Excerpt :
Details of the two stages of the proposed hybrid model are provided in the following subsections. Input feature vectors include irrelevant dimensions and redundant dimensions, both of which decrease the prediction accuracy of the resulting model while increasing the computational cost of employing the data-mining techniques needed to construct the prediction model [30–32]. To minimize such costs, in this study the PCA approach proposed by Dunteman [33] was adopted to allow for more efficient construction of the prediction model by reducing the dimensionality of the problem before the data mining took place.
An accurate prediction of project performance in the pre-project planning stage – especially prediction of cost performance – is paramount to project stakeholders. The aim of this study is to propose and validate a hybrid predictive model for cost performance of commercial building projects using 64 variables related to the levels of definition in the pre-project planning stage. The proposed model integrates a support vector regression (SVR) model with principal component analysis (PCA). The proposed method was analyzed and validated based on 84 sets of data from an equal number of commercial building projects. Additionally, the result obtained using the proposed PCA–SVR model was compared with four other data-mining techniques. Experimental results revealed that the proposed PCA–SVR model is able to predict with high accuracy the cost performance of commercial building projects in the pre-project planning stage and is more efficient than the other four models.

View all citing articles on Scopus

View full text

ReviewUsing support vector machines in diagnoses of urological dysfunctions

Abstract

Introduction

Section snippets

Support vector machines

Urological data

Conclusions and future work

Acknowledgments

Women’s Health Medicine

Expert Systems with Applications

Journal of Molecular Biology

European Urology

Artificial Intelligence

Expert Systems with Applications

Expert Systems with Applications

Information Sciences

The Journal of Urology

Expert Systems with Applications

Robust linear programming discrimination of two linearly inseparable sets

Optimization Methods and Software

A tutorial on support vector machines for pattern recognition

Data Mining and Knowledge Discovery

Support-vector networks

Machine Learning

Filter methods

Studies in fuzziness and soft computing

Data dimensionality reduction with application to simplifying RBF network structure and improving classification performance

IEEE Transactions on Systems, Man and Cybernetics, Part B

Application of artificial neural networks in the diagnosis of urological dysfunctions

Expert Systems with Applications

Review
Using support vector machines in diagnoses of urological dysfunctions