On the Accuracy of Meta-learning for Scalable Data Mining

Chan, Philip K.; Stolfo, Salvatore J.

doi:10.1023/A:1008640732416

On the Accuracy of Meta-learning for Scalable Data Mining

Published: January 1997

Volume 8, pages 5–28, (1997)
Cite this article

Journal of Intelligent Information Systems Aims and scope Submit manuscript

Philip K. Chan¹ &
Salvatore J. Stolfo²

461 Accesses
96 Citations
Explore all metrics

Abstract

In this paper, wedescribe a general approach to scaling data mining applications thatwe have come to call meta-learning. Meta-Learningrefers to a general strategy that seeks to learn how to combine anumber of separate learning processes in an intelligent fashion. Wedesire a meta-learning architecture that exhibits two key behaviors.First, the meta-learning strategy must produce an accurate final classification system. This means that a meta-learning architecturemust produce a final outcome that is at least as accurate as aconventional learning algorithm applied to all available data.Second, it must be fast, relative to an individual sequential learningalgorithm when applied to massive databases of examples, and operatein a reasonable amount of time. This paper focussed primarily onissues related to the accuracy and efficacy of meta-learning as ageneral strategy. A number of empirical results are presenteddemonstrating that meta-learning is technically feasible in wide-area,network computing environments.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Artificial Intelligence

References

Ali, K. and Pazzani, M. Error reduction through learning multiple descriptions. Machine Learning, 1996. to appear.
Breiman, L., Friedman, J.H., Olshen, R.A., and Stone, C.J. Classification and Regression Trees. Wadsworth, Belmont, CA, 1984.
Buntine, W. and Caruana, R. Introduction to IND and Recursive Partitioning. NASA Ames Research Center, 1991.
Catlett, J. Megainduction: A test flight. In Proc. Eighth Intl. Work. Machine Learning, pages 596–599, 1991.
Chan, P. and Stolfo, S. Experiments on multistrategy learning by meta-learning. In Proc. Second Intl. Conf. Info. Know. Manag., pages 314–323, 1993.
Chan, P. and Stolfo, S. Meta-learning for multistrategy and parallel learning. In Proc. Second Intl. Work. on Multistrategy Learning, pages 150–165, 1993.
Chan, P. and Stolfo, S. Toward parallel and distributed learning by meta-learning. In Working Notes AAAI Work. Know. Disc. Databases, pages 227–240, 1993.
Chan, P. and Stolfo, S. Scaling learning by meta-learning over disjoint and partially replicated data. In Proc. Ninth Florida AI Research Symposium, pages 151–155, 1996.
Clark, P. and Niblett, T. The CN2 induction algorithm. Machine Learning, 3:261–285, 1989.
Google Scholar
Cost, S. and Salzberg, S. A weighted nearest neighbor algorithm for learning with symbolic features. Machine Learning, 10:57–78, 1993.
Google Scholar
Craven, M. and J. Shavlik. Learning to represent codons: A challenge problem for constructive induction. In Proc. IJCAI-93, pages 1319–1324, 1993.
Flann, N. and Dietterich, T. A study of explanation-based mehtods for inductive learning. Machine Learning, 4:187–266, 1989.
Google Scholar
Krogh, A. and Vedelsby, J. Neural network ensembles, cross validation, and active learning. In G. Tesauro, D. Touretzky, and T. Leen, editors, Advances in Neural Info. Proc. Sys. 7, pages 231–238. MIT Press, 1995.
Littlestone, N. and Warmuth, M. The weighted majority algorithm. Technical Report UCSC-CRL-89-16, Univ. Cal., Santa Cruz, 1989.
Google Scholar
Mitchell, T.M. The need for biases in learning generalizaions. Technical Report CBM-TR-117, Dept. Comp. Sci., Rutgers Univ., 1980.
Quinlan, J.R. Induction of decision trees. Machine Learning, 1:81–106, 1986.
Google Scholar
Schapire, R. The strength of weak learnability. Machine Learning, 5:197–226, 1990.
Google Scholar
Stolfo, S., Galil, Z., McKeown, K., and Mills, R. Speech recognition in parallel. In Proc. Speech Nat. Lang. Work., pages 353–373. DARPA, 1989.
Towell, G., Shavlik, J., and Noordewier, M. Refinement of approximate domain theories by knowledge-based neural networks. In Proc. AAAI-90, pages 861–866, 1990.
Valiant, L. A theory of the learnable. Comm. ACM, 27:1134–1142, 1984.
Google Scholar
Wolpert, D. Stacked generalization. Neural Networks, 5:241–259, 1992.
Google Scholar
Xu, L., Krzyzak, A., and Suen, C. Methods of combining multiple classifires and their applications to handwriting recognition. IEEE Trans. Sys. Man. Cyb., 22:418–435, 1992.
Google Scholar
Zhang, X., Mesirov, J., and Waltz, D. A hybrid system for protein secondary structure prediction. J. Mol. Biol., 225:1049–1063, 1992.
Google Scholar

Download references

Author information

Authors and Affiliations

Computer Science, Florida Institute of Technology, Melbourne, FL, 32901
Philip K. Chan
Department of Computer Science, Columbia University, New York, NY, 10027
Salvatore J. Stolfo

Authors

Philip K. Chan
View author publications
You can also search for this author in PubMed Google Scholar
Salvatore J. Stolfo
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chan, P.K., Stolfo, S.J. On the Accuracy of Meta-learning for Scalable Data Mining. Journal of Intelligent Information Systems 8, 5–28 (1997). https://doi.org/10.1023/A:1008640732416

Download citation

Issue Date: January 1997
DOI: https://doi.org/10.1023/A:1008640732416

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

On the Accuracy of Meta-learning for Scalable Data Mining

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Machine Learning Algorithms for Big Data Mining Processing: A Review

Large-Scale and Distributed Optimization: An Introduction

Synergizing Four Different Computing Paradigms for Machine Learning and Big Data Analytics

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Subscribe and save

Buy Now

On the Accuracy of Meta-learning for Scalable Data Mining

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Machine Learning Algorithms for Big Data Mining Processing: A Review

Large-Scale and Distributed Optimization: An Introduction

Synergizing Four Different Computing Paradigms for Machine Learning and Big Data Analytics

Explore related subjects

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Subscribe and save

Buy Now