Abstract
In this paper we propose the combined use of different methods to improve the data analysis process. This is obtained by combining inductive and deductive techniques. Inductive techniques are used for generating hypotheses from data whereas deductive techniques are used to derive knowledge and to verify hypotheses. In order to guide users in the the analysis process, we have developed a system which integrates deductive tools, data mining tools (such as classification algorithms and features selection algorithms), visualization tools and tools for the easy manipulation of data sets. The system developed is currently used in a large project whose aim is the integration of information sources containing data concerning the socio-economic aspects of Calabria and the analysis of the integrated data. Several experiments on socio-economic indicators of Calabrian cities have shown that the combined use of different techniques improves both the comprehensibility and the accuracy of models.
Work partially supported by a MURST grant under the projects “Data-X” and “Piano Telematico Calabria”
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Chen M.S., Han J., Yu P.S. (1996) Data Mining: An Overview from a Database Perspective. IEEE Trans. on Know. Disc. and Data Eng. 8 (6): 866–883
Cheeseman P., Stutz J. (1996) Bayesian Classification (Autoclass): Theory and Results. In: [6], 153–180
Dougherty J., Kohavi R., Sahami M. (1997) Supervised and unsupervised discretization of continuous features. In Proc. 12th Int. Conf. Mach. Learn., 194–202
Bauer E., Kohavi R. (1999) An Empirical Comparison of Voting Classification Algorithms: Bagging, Boosting, and Variants. Machine Lerning 36 (1–2): 105–139
Fayyad U.M., Piatesky-Shapiro G., Smyth P. (1996) LFrom Data Mining to Knowledge Discovery: An overview. In: [6], 1–36
Fayyad U.M., Piatesky-Shapiro G., Smyth P., Uthurusamy R., (Eds.) (1996) Advances in Knoweldge Discovery and Data Mining. The MIT Press.
Freund Y., Shapire R.E. (1997) A Decision-Theoretic Generalization of On-line Learning and an Application to Boosting. In Journal of Computer System Sciences, 55 (1): 119–139.
Quinlan J.R. (1986) Induction of Decision Trees. Machine Learning 1 (1): 81–106
Hanson R., Stutz J., Cheeseman P (1991) Bayesian classification with correlation and inheritance. Proc. 12th IJCAI Conf., 1991. 692–698
Mardia K.V., Kent J.T., Bibby J.M. (1979) Multivariant Analysis. Academic Press, New York
Salzberg S.L. (1997) On Comparing Classifiers: Pitfalls to Avoid and a Recommended Approach. Data Mining and Knowledge Discovery 1 (3): 317–328
Simoudis E., Livezey B., Kerber R. (1996) Integrating Inductive and Deductive Reasoning for Data Mining. In: [6], 353–373
Scheffer T., Herbrich H. (1997) Unbiased assessment of learning algorithm. In: Proc. 15th IJCAI Conf., 1997, 798–803
Waikato Environment for Knowledge Analysis (WEKA). Available at http://www.cs.waikato.ac.nz/ml/weka“.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Greco, S., Masciari, E., Pontieri, L. (2001). Combining Different Data Mining Techniques to Improve Data Analysis. In: Larsen, H.L., Andreasen, T., Christiansen, H., Kacprzyk, J., Zadrożny, S. (eds) Flexible Query Answering Systems. Advances in Soft Computing, vol 7. Physica, Heidelberg. https://doi.org/10.1007/978-3-7908-1834-5_42
Download citation
DOI: https://doi.org/10.1007/978-3-7908-1834-5_42
Publisher Name: Physica, Heidelberg
Print ISBN: 978-3-7908-1347-0
Online ISBN: 978-3-7908-1834-5
eBook Packages: Springer Book Archive