Abstract
The rapid growth in data volume, user base, and data diversity render Internet-accessible information increasingly difficult to be used effectively. In this paper we discuss the issues involved with knowledge discovery in knowledge bases, in particular the WWW, by presenting a general architecture and describing how it has been instantiated in a functional system we developed. The system attempts to concurrently maximize and optimize the resource/knowledge discovery, and custimize the information to individual users. A number of machine learning techniques have been employed in the development of the system for comparative reasons — results are presented and discussed.
Preview
Unable to display preview. Download preview PDF.
References
T Berners-Lee, R Caillian, A Luotonen, HF Nielsen, and A Secret. The World-Wide Web. Communications of the ACM, 37(8):76–82, 1994.
Digital Equipment Corp. AltaVista. http://altavista.digital.com/.
Excite Inc. Excite. http://www.excite.com/.
H Berghel. Cyberspace 2000: Dealing with information overload. Communications of the ACM, 40(2):19–25, 1997.
H Chen, C Schuffels, and R Orwig. Internet categorization and search: A self-organizing approach. Journal of Visual Communication and Image Representation, 7(1):88–102, 1996.
C Knoblock and Levy (eds). Agent-based knowledge discovery. AAAI Spring Symposium on Information Gathering, 1995.
B Krulwich. Learning user interests across heterogeneous document databases. AAAI Spring Symposium on Information Gathering, 1995.
W H E Davies and P Edwards. Distributed learning: An agent-based approach to data-mining. In ML95 — workshop on agents that learn from other agents, 1995.
G Piatetsky-Shapiro and W J Frawley. Knowledge Discovery in Databases. MIT press, 1991.
D Bayer. A learning agent for resource discovery on the world wide web. Master’s thesis, University of Aberdeen, 1995.
C L Green and P Edwards. Using Machine Learning to enhance software tools for internet information management. In A Franz and H Kitamo, editors, AAAI-96, Workshop on Internet-Based Information Systems, pages 48–55. AAAI Press, 1996.
G Salton and M J McGill. Introduction to Modern Information Retrieval. McGraw-Hill, 1983.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1998 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Montebello, M. (1998). Optimizing knowledge discovery over the WWW. In: Litwin, W., Morzy, T., Vossen, G. (eds) Advances in Databases and Information Systems. ADBIS 1998. Lecture Notes in Computer Science, vol 1475. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0057739
Download citation
DOI: https://doi.org/10.1007/BFb0057739
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-64924-3
Online ISBN: 978-3-540-68309-4
eBook Packages: Springer Book Archive