Abstract
A multiple classifier system can only improve the performance when the members in the system are diverse from each other. Combining some methodologically different techniques is considered a constructive way to expand the diversity. This paper investigates the diversity between the two different data mining techniques, neural networks and automatically induced decision trees. Input decimation through salient feature selection is also explored in the paper in the hope of acquiring further diversity. Among various diversities defined, the coincident failure diversity (CFD) appears to be an effective measure of useful diversity among classifiers in a multiple classifier system when the majority voting decision strategy is applied. A real-world medical classification problem is presented as an application of the techniques. The constructed multiple classifier systems are evaluated with a number of statistical measures in terms of reliability and generalisation. The results indicate that combined MCSs of the nets and trees trained with the selected features have higher diversity and produce better classification results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Eckhardt, D. et al. (1985): A theoretical basis for the analysis of multiversion software subject to coincident errors. IEEE Trans. Software Eng. SE-11, pp1511–1517.
Freund, Y. & Schapire R.E. (1996): Experiments with a new boosting algorithm, in L. Saitta, ed., Machine Learning: Proceedings of the 13th national conference, Morgan Kaufmann. pp148–156.
Gedeon, T. (1997): Data mining of inputs: analysis magnitude and functional measures. Int. J. of Neural Networks. 8, pp209–218.
Hansen, L. et al. (1990): Neural network ensembles. IEEE Trans. Patterns Analysis and Machine Intelligence, vol. 12 pp993–1001.
Krogh, K. & Vedelsby, J. (1995): Neural network ensembles, cross-validation and active learning. In G. Tesauro, D.S Touretzky, and T.K. Leen, eds, Advances in Neural Information Processing System. vol.7. pp231–238, MIT press.
Littlewood, B. & Miller, D. (1989): Conceptual modelling of coincident failures in multiversion software. IEEE Trans. Software Eng. vol. 15, no. 12, pp1596–1614.
Partridge, D. et al. (1996): Engineering multiversion neural-net systems. Neural Computation, vol. 8, pp869–893.
Partridge, D. & Krzanowski, W. (1997): Software diversity: practical statistics for its measurement and exploitation. Information and Software Technology, vol. 39, pp707–717.
Quinlan, J. R. (1986): Induction of decision trees. Machine Learning, 1, pp81–106.
Tchaban, T., Taylor, M J, & Griffin, J.P. (1998): Establishing impacts of the inputs in a feedforward network, Neural Computing & Applications, 7, pp309–317.
Tuner, K. & Oza, N. (1999): Decimated input ensembles for improved generalisation. Proceedings of IJCNN1999, Washington DC, October, 1999.
Wang, W. & Partridge, D. (1998): Multiversion neural network systems. Proceedings of neural networks and their applications, NEURAP’98, Marseilles, France, 1998, p351–357.
Wang, W., Jones, P. & Partridge, D. (1998): Ranking pattern recognition features for neural networks. in S. Singh, ed. Advances in Patterns Recognition, Springer, pp232–241.
Wang, W., Jones, P. & Partridge, D. (1999): Assessing the impact of input features in a feedforward network. Neural Computing and Applications. in press.
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wang, W., Jones, P., Partridge, D. (2000). Diversity between Neural Networks and Decision Trees for Building Multiple Classifier Systems. In: Multiple Classifier Systems. MCS 2000. Lecture Notes in Computer Science, vol 1857. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45014-9_23
Download citation
DOI: https://doi.org/10.1007/3-540-45014-9_23
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-67704-8
Online ISBN: 978-3-540-45014-6
eBook Packages: Springer Book Archive