Abstract
This paper introduces the AD+tree, a data structure for quickly counting the number of records that match conjunctive queries in a data set. The structure is useful for machine learning on large data sets. The AD+tree is an adaptation of the Dynamic AD-tree data structure [1].
We analyze the performance of AD+trees, comparing them to static AD-trees and Dynamic AD-trees. We show AD+trees maintain a very compact cache that enables them to handle queries on massively large data sets very efficiently even under complex, unstructured query patterns.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Komarek, P., Moore, A.W.: A dynamic adaptation of ad-trees for efficient machine learning on large data sets. In: Proceedings of the 17th International Conference on Machine Learning (2000)
Moore, A.W., Lee, M.S.: Cached sufficient statistics for efficient machine learning with large datasets. Journal of Artificial Intelligence Research 8 (1998)
Anderson, B., Moore, A.W.: Rich probabilistic models for gene expression. In: Proceedings of the 4th International Conference on Knowledge Discovery and Data Mining, pp. 134–138. AAAI Press, Menlo Park (1998)
Heckerman, D.: A tutorial for learning in bayesian networks, pp. 301–354. Kluwer Academic Publishers, Dordrecht (1998)
Dzeroski, S., Lavrac, N. (eds.): Relational Data Mining. Springer, Heidelberg (2001)
Gaede, V., Günther, O.: Multidimensional access methods. ACM Computing Surveys 30(2), 170–231 (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Moraleda, J., Miller, T. (2003). AD+Tree: A Compact Adaptation of Dynamic AD-Trees for Efficient Machine Learning on Large Data Sets. In: Liu, J., Cheung, Ym., Yin, H. (eds) Intelligent Data Engineering and Automated Learning. IDEAL 2003. Lecture Notes in Computer Science, vol 2690. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45080-1_41
Download citation
DOI: https://doi.org/10.1007/978-3-540-45080-1_41
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40550-4
Online ISBN: 978-3-540-45080-1
eBook Packages: Springer Book Archive