Abstract
This paper presents preliminary works on an agent-based approach for distributed learning of decision trees. The distributed decision tree approach is applied to intrusion detection domain, the interest of which is recently increasing. In the approach, a network profile is built by applying a distributed data analysis method for the collection of data from distributed hosts. The method integrates inductive generalization and agent-based computing, so that classification rules are learned via tree induction from distributed data to be used as intrusion profiles. Agents, in a collaborative fashion, generate partial trees and communicate the temporary results among them in the form of indices to the data records. Experimental results are presented for military network domain data used for the network intrusion detection in KDD cup 1999. Several experimental results show that the performance of distributed version of decision tree is much better than that of non-distributed version with data collected manually from distributed hosts.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Zaidi, S.Z.H., Abidi, S.S.R., Manickam, S.: Distributed data mining from heterogeneous healthcare data repositories: towards an intelligent agent-based framework. In: Proceedings of the 15th IEEE Symposium on Computer-Based Medical Systems (CBMS), pp. 339–342 (2002)
Abidi, S.S.R.: Applying Knowledge Discovery in Healthcare: An Info-Structure for Delivering Knowledge-Driven Strategic Services. In: Medical Informatics Europe 1999, pp. 453–456. IOS Press, Amsterdam (1999)
Krishnaswamy, S., Zaslavsky, A., Loke, S.W.: An architecture to support distributed data mining services in e-commerce environments. In: Proceedings of Second International Workshop on Advanced Issues of E-Commerce and Web-Based Information Systems (WECWIS), pp. 239–246 (2000)
Chan, P.K., Fan, W., Prodromidis, A.L., Stolfo, S.J.: Distributed data mining in credit card fraud detection, Intelligent Systems. IEEE [see also IEEE Expert] 14(6), 67–74 (1999)
Kargupta, H., Park, B.: Collective Data Mining: A New Perspective toward Distributed Data Analysis. In: Kargupta, H., Chan, P. (eds.) Advanced in Distributed and Parallel Knowledge Discovery, pp. 133–184. AAAI/MIT Press (2000)
Cheung, D.W., Ng, V.T., Fu, A.W., Fu, Y.: Efficient mining of association rules in distributed databases. Proceedings of the IEEE Transactions on Knowledge and Data Engineering 8(6), 911–922 (1996)
Yamanish, K.: Distributed cooperative Bayesian learning strategies. In: Proceedings of COLT 1997 (ACM), pp. 250–262 (1997)
Grossman, R.L., Yunhong, G., Hanley, D., Xinwei, H., Levera, J., Mazzucco, M., Lillethun, D., Mambretti, J., Weinberger, J.: Mass Storage Systems and Technologies (MSST). In: Proceedings of the 20th IEEE/11th NASA Goddard Conference, pp. 62–66 (2003)
Wei, D., Agrawal, G.: Developing distributed data mining implementations for a grid Environment. In: Cluster Computing and the Grid 2nd IEEE/ACM International Symposium (CCGRID), pp. 410–411 (2002)
Cannataro, M., Talia, D., Trunfio, P.: Distributed data mining on the grid. Future Generation Computer Systems 18(8), 1101–1112 (2002)
Cannataro, M.: Clusters and Grids for Distributed and Parallel Knowledge Discovery. In: Williams, R., Afsarmanesh, H., Bubak, M., Hertzberger, B. (eds.) HPCN-Europe 2000. LNCS, vol. 1823, pp. 708–715. Springer, Heidelberg (2000)
Zaidi, S.Z.H., Abidi, S.S.R., Manickam, S.: Distributed data mining from heterogeneous healthcare data repositories: towards an intelligent agent-based framework. In: Proceedings of the 15th IEEE Symposium on Computer-Based Medical Systems (CBMS), pp. 339–342 (2002)
Klusch, M., Lodi, S., Moro, G.: Agent-Based Distributed Data Mining: The KDEC Scheme. In: Klusch, M., Bergamaschi, S., Edwards, P., Petta, P. (eds.) Intelligent Information Agents. LNCS (LNAI), vol. 2586, pp. 104–122. Springer, Heidelberg (2003)
Quinlan, J.R., Rivest, R.L.: Inferring Decision Trees Using the Minimum Description Length Principle. Information and Computation 80(3) (1989)
See Web site at http://kdd.ics.uci.edu/databases/kddcup99/kddcup99.html
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Baik, S., Bala, J. (2004). A Decision Tree Algorithm for Distributed Data Mining: Towards Network Intrusion Detection. In: Laganá, A., Gavrilova, M.L., Kumar, V., Mun, Y., Tan, C.J.K., Gervasi, O. (eds) Computational Science and Its Applications – ICCSA 2004. ICCSA 2004. Lecture Notes in Computer Science, vol 3046. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24768-5_22
Download citation
DOI: https://doi.org/10.1007/978-3-540-24768-5_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22060-2
Online ISBN: 978-3-540-24768-5
eBook Packages: Springer Book Archive