Abstract
Intelligent data analysis implies the reasoned application of autonomous or semi-autonomous tools to data sets drawn from problem domains. Automation of this process of reasoning about analysis (based on factors such as available computational resources, cost of analysis, risk of failure, lessons learned from past errors, and tentative structural models of problem domains) is highly non-trivial. By casting the problem of reasoning about analysis (MetaReasoning) as yet another data analysis problem domain, we have previously [R. Levinson and J. Wilkinson, in Advances in Intelligent Data Analysis, edited by X. Liu, P. Cohen, and M. Berthold, volume LNCS 1280, Springer-Verlag, Berlin, pp. 89–100, 1997] presented a design framework, MetaReasoning for Data Analysis Tool Allocation (MRDATA). Crucial to this framework is the ability of a Tool Allocator to track resource consumption (i.e. processor time and memory usage) by the Tools it employs, as well as the ability to allocate measured quantities of resources to these Tools. In order to test implementations of the MRDATA design, we now implement a Runtime Environment for Data Analysis Tool Allocation, RE:DATA. Tool Allocators run as processes under RE:DATA, are allotted system resources, and may use these resources to run their Tools as spawned sub-processes. We also present designs of native RE:DATA implementations of analysis tools used by MRDATA: K-Nearest Neighbor Tables, Regression Trees, Interruptible (“Any-Time”) Regression Trees, and “Hierarchy Diffusion” Temporal Difference Learners. Preliminary results are discussed and techniques for integration with non-native tools are explored.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Stuart Russell and Eric Wefald, Do The Right Thing: Studies in Limited Rationality, MIT Press: Cambridge, Massachusetts, 1991.
R.S. Sutton, “Special issue on reinforcement learning,” Machine Learning, 1991.
R.S. Sutton, “Learning to predict by the methods of temporal differences,” Machine Learning, vol. 3,no. 1, pp. 9-44, 1988.
L. Brieman, Classification and Regression Trees, The Wadsworth statistics/probability series, Wadsworth International Group, 1984.
S. Omohundro, Efficient algorithms with neural network behavior. Technical Report UIUCDCS-R-87-1331, University of Illinois, April 1987.
A.V. Balakrishnan, I. Karatzas, and M. Yor (Ed.), Probabilistic Theory of Pattern Recognition, volume 3 of Applications of Mathematics, Springer-Verlag, 1996.
Pat Langley, Elements of Machine Learning, Morgan Kaufman Publishers, 1996.
R.M. Axelrod, The Evolution of Cooperation, Basic Books: New York, 1984.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Wilkinson, J., Levinson, R. Research Environment for Data Analysis Tool Allocators. Applied Intelligence 11, 241–258 (1999). https://doi.org/10.1023/A:1008382825019
Issue Date:
DOI: https://doi.org/10.1023/A:1008382825019