Knowledge-based approach to septic shock patient data using a neural network with trapezoidal activation functions

https://doi.org/10.1016/S0933-3657(03)00057-5Get rights and content

Abstract

In this contribution we present an application of a knowledge-based neural network technique in the domain of medical research. We consider the crucial problem of intensive care patients developing a septic shock during their stay at the intensive care unit. Septic shock is of prime importance in intensive care medicine due to its high mortality rate. Our analysis of the patient data is embedded in a medical data analysis cycle, including preprocessing, classification, rule generation and interpretation. For classification and rule generation we chose an improved architecture based on a growing trapezoidal basis function network for our metric variables. Our results extend those of a black box classification and give a deeper insight in our patient data. We evaluate our results with classification and rule performance measures. For feature selection we introduce a new importance measure.

Introduction

During the last years many scientists have published medical applications of neural networks for classification analysis, e.g. [2], [22], [34], [43]. We have learned that supervised neural networks usually adapt better to data with highly overlapping or nonlinear class borders than statistical regression does [29], [37].

Standard statistical regression and standard neural network techniques like backpropagation do not explain their classification results by rules. Particularly physicians as a main interest group are interested in such rules to get insight in the classification process, e.g. to draw conclusions for therapy. Thus, scientists have developed methods that allow the generation of rules within the classification process or the extraction of rules after the completed classification process. In Section 2.1, we give a short overview of such alternatives. The subsequent layout of the paper is described below.

Since our main goal is the application of a knowledge based method to septic shock patient data, we choose one of the algorithms that are introduced in Section 2.1 for this task, discussing our choice. The main ideas of the chosen algorithm are described in Section 2.2. During the experimental phase we realized that the algorithm could be improved, regarding the overlapping behavior of the neurons and the shrink-mechanism [28], see Section 2.3.

We repeat all the experiments with random partitions of the data into training and test data to get meaningful, statistically reasonable results. We evaluate our classification results with standard performance measures, e.g. classification error on training and test datasets. Each one of the rules is evaluated with a frequency and confidence measure (Section 3.1). Another question concerning rules is the global importance of each variable. We propose an importance measure for this feature selection task in Section 3.2.

Before applying the improved network to our septic shock patient data, we present some results on well-known benchmark datasets in Section 4 to point to our improvements and to clarify the general usefulness of our new measure “importance.”

Septic shock is one of the most common reasons of death in intensive care units (ICUs). Of course, this is reason enough to explore the causes in detail. Our analysis is restricted to abdominal intensive care patients who developed a septic shock during their stay at the ICU. The abdominal septic shock has a high mortality rate in the ICU of up to 50%. Some more details are described in Section 5.1. Our analysis is retrospectively based on a medical database. Thus, preprocessing steps for data quality improvement are necessary (Section 5.2). We present all the results concerning our septic shock patient data in Section 5.3. Some interesting insights are found compared to a mere classification procedure. Finally, we discuss our results in Section 6. We find out that it is not possible to clearly reduce the same dimensions for all the rules if we use septic shock data sampled from the entire time series. But we can generate several performant rules with less dimensions that give insight in the data.

Section snippets

Rule generation for metric data

In principle there are two kinds of medical data: metric (numerical) data (blood pressure, heart frequency, doses of medicaments, etc.) and categorical (symbolic) data (operations, diagnoses, therapies) including binary data (yes/no) or medical codes such as ICD10 or OP301 [42]. Metric data could further be divided into biosignal data (e.g. EEG, MEG), sampled with an adequate sampling rate and measurement data from patient records, recorded by physicians irregularly whenever they considered it

Performance measures

To evaluate the performance of our classification and rule generation results, we reproduce relevant performance measures that are commonly used. A new method to rate the global importance of variables for feature selection (dimension reduction) is introduced. Our aim is to take into account the specific rule structure of the neuro-fuzzy rule set. We do not consider more general or more arbitrary, ad hoc defined rule interestingness measures [15], [46].

An important aspect concerning neural

Application to benchmark datasets

Before applying the algorithm (MGT) to our septic shock patient data, we test it on benchmark datasets. The spiral data serve as a good (two-dimensional) example of nonlinear data to visualize the generated rules by (ALG) and (MGT). “Cancer1” [36] serves as a medical benchmark dataset to evaluate our importance measure.

Application to septic shock patient data

In this section, we review shortly the septic shock problem in intensive care medicine. Then, we describe our preprocessing steps for preparing the data for analysis. In fact, preprocessing of multivariate time series with missing values—the usual case in medical databases—is very time consuming although very important [40]. Finally, we present our results.

Discussion and conclusion

We have presented our data analysis approach for the important medical problem septic shock with an emphasis on rule generation for metric data. The results are a major extension of preliminary work (preprocessing, classification) [11], [33], now providing us with understandable knowledge for classification.

We have reviewed and improved the algorithm [17] with regard to the overlapping behavior and the shrinking mechanism to generate more performant rules. Our results on benchmark data

Acknowledgements

The work was done within the project MEDAN http://www.medan.de (Ref. no. HA 1456/7-2), supported by the German Research Foundation (DFG). The author thanks all the participants of the MEDAN working group especially Dr. Brause and Prof. Hanisch for supporting my work.

References (47)

  • Castellano G, Fanelli AM. Modeling fuzzy classification systems with compact rule base. In: Mohammadian M, editor....
  • Fein AM, et al., editors. Sepsis and multiorgan failure. Baltimore: Lippincott Williams & Wilkins;...
  • Fritzke B. Incremental neuro-fuzzy systems. In: Bosacchi B, Bezdek JC, Fogel DB, editors. Proceedings of the...
  • Hamker F, Paetz J, Thöne S, Brause R, Hanisch E. Erkennung kritischer Zustände von Patienten mit der Diagnose...
  • Hanisch E, Encke A. Intensive care management in abdominal surgical patients with septic complications. In: Faist E,...
  • R.M. Hardaway

    A review of septic shock

    Am Surg

    (2000)
  • Haykin S. Neural networks: a comprehensive foundation. 2nd ed. Upper Saddle River: Prentice-Hall;...
  • Hilderman RJ, Hamilton H. Knowledge discovery and interestingness measures: a survey. Technical Report CS 99-04....
  • Huber K-P. Datenbasierte Metamodellierung mit automatisch erzeugten Fuzzy-Regeln. VDI-Verlag, Doctoral thesis,...
  • Huber K-P, Berthold MR. Building precise classifiers with automatic rule extraction. In: Proceedings of the IEEE...
  • J.-S.R. Jang

    ANFIS: adaptive-network-based fuzzy inference system

    IEEE Trans Syst—Man and Cybernet

    (1993)
  • J.-S.R. Jang et al.

    Functional equivalence between radial basis function networks and fuzzy inference systems

    IEEE Trans Neur Networks

    (1993)
  • Jolliffe IT, Principal component analysis. New York: Springer;...
  • Cited by (38)

    • Septic shock prediction for ICU patients via coupled HMM walking on sequential contrast patterns

      2017, Journal of Biomedical Informatics
      Citation Excerpt :

      Decision trees were also employed by Gwadry-Sridhar et al. [50] for 20 clinical variables, achieving nearly 100% predictive accuracy. Among the soft computing techniques, numerous wrapper-based feature selection and preprocessing methods, namely Zero-Order-Hold, and missing-value imputation techniques have been employed along with particle swarm optimization, fuzzy models, and neural networks, to improve septic shock classification performance [51,49,55,56,58]. Selecting appropriate clinical features turns out to an important concern for predicting cases of septic shock.

    • Missing data in medical databases: Impute, delete or classify?

      2013, Artificial Intelligence in Medicine
      Citation Excerpt :

      This information is new knowledge and we believe will substantively inform study designs of this type. In the final stage following classification, missing segments that are classified as sampling related are imputed by averaging the existing samples on each side of the data gap of the same variable, as in [13]. As previously detailed, our aim is to determine if the true missing data are recoverable or not-recoverable and for this purpose we use fuzzy models.

    • Modified binary PSO for feature selection using SVM applied to mortality prediction of septic patients

      2013, Applied Soft Computing Journal
      Citation Excerpt :

      A comparison was made between the studied wrapper methods and other FS methods already applied to the problem of sepsis outcome prediction. These wrapper approaches are: neuro-fuzzy modeling using only the 12 feature dataset [47], bottom-up (BU) using fuzzy modeling and neural networks [48] and ant feature selection (AFS) using fuzzy modeling and neural networks [48]. The results are depicted in Tables 6 and 7 for the 12 and 28 features datasets respectively.

    View all citing articles on Scopus
    View full text