Abstract
The results of knowledge discovery in data bases could vary depending on the data mining method. There are several ways to select the most appropriate data mining method dynamically. One proposed method clusters the whole domain area into “competence areas” of the methods. A metamethod is then used to decide which data mining method should be used with each data base instance. However, when knowledge is extracted from several data bases knowledge discovery may produce conflicting results even if the separate data bases are consistent. At least two types of conflicts may arise. The first type is created by data inconsistency within the area of the intersection of the data bases. The second type of conflicts is created when the metamethod selects different data mining methods with inconsistent competence maps for the objects of the intersected part. We analyze these two types of conflicts and their combinations and suggest ways to handle them.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Chan, P.: An Extensible Meta-Learning Approach for Scalable and Accurate Inductive Learning. Ph.D. Thesis, Columbia University (1996)
Chan, P., Stolfo, S.: On the Accuracy of Meta-Learning for Scalable Data Mining. Intelligent Information Systems 8 (1997) 5–28
Dietterich, T.: Machine Learning Research: Four Current Directions. AI Magazine 18 (1997) 97–136
Fayyad, U., et al.: Advances in Knowledge Discovery and Data Mining. AAAI/MIT Press (1997)
Kohavi, R.: A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection. In: The Proceedings of IJCAI’95 (1995)
Koppel, M., Engelson, S.: Integrating Multiple Classifiers by Finding their Areas of Expertise. In: AAAI-96 Workshop On Integrating Multiple Learning Models (1996) 53–58
Liu, H., Lu, H., Yao, J.: Identifying Relevant Databases for Multidatabase Mining. In: The Proceedings of the PAKDD’98, Melbourne, Australia, Springer Verlag (1998)
Merz, C.: Dynamical Selection of Learning Algorithms. In: Fisher, D., Lenz, H.-J. (eds.), Learning from Data, Artificial Intelligence and Statistics, Springer Verlag, NY (1996)
Skalak, D.: Combining Nearest Neighbor Classifiers. Ph.D. Thesis, Dept. of Computer Science, University of Massachusetts, Amherst, MA (1997)
Terziyan, V., Tsymbal, A., Puuronen, S.: The Decision Support System for Telemedicine Based on Multiple Expertise. International Journal of Medical Informatics 49 (1998) 217–229
Tsymbal, A., Puuronen, S., Terziyan, V.: Advanced Dynamic Selection of Diagnostic Methods. In: Proceedings of the CBMS’98, IEEE CS Press, Lubbock, Texas (1998) 50–54
Wolpert, D.: Stacked Generalization. Neural Networks 5 (1992) 241–259
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1999 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Puuronen, S., Terziyan, V., Logvinovsky, A. (1999). Mining Several Databases with an Ensemble of Classifiers. In: Bench-Capon, T.J., Soda, G., Tjoa, A.M. (eds) Database and Expert Systems Applications. DEXA 1999. Lecture Notes in Computer Science, vol 1677. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48309-8_83
Download citation
DOI: https://doi.org/10.1007/3-540-48309-8_83
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-66448-2
Online ISBN: 978-3-540-48309-0
eBook Packages: Springer Book Archive