Abstract
The proliferation of information sources available on the Wide World Web has resulted in a need for database selection tools to locate the potential useful information sources with respect to the user’s information need. Current database selection tools always treat each database independently, ignoring the implicit, useful associations between distributed databases. To overcome this shortcoming, in this paper, we introduce a data-mining approach to assist the process of database selection by extracting potential interesting association rules between web databases from a collection of previous selection results. With a topic hierarchy, we exploit intraclass and interclass associations between distributed databases, and use the discovered knowledge on distributed databases to refine the original selection results. We present experimental results to demonstrate that this technique is useful in improving the effectiveness of database selection.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Agrawal, R., Imielinski, T., Swami, A.: Mining Association Rles between Sets of Items in Large Databases. In: Proceedings of the 1993 Acm Sigmod International Conference on Management of Data, pp. 26–28 (1993)
Callan, J.P., Lu, Z., Croft, W.B.: Searching Distributed Collections with Inference Networks. In: Proceedings of the 19th Annual International Acm Sigir Conference on Research and Development in Information Retrieval, pp. 21–29 (1995)
Gravano, L., Garcia-Molina, H., Tomasic, A.: Gloss: Text-Source Discovery over the Internet. ACM Transactions on Database Systems 24(2), 229–264 (1999)
Hawking, D., Thistlewaite, P.: Methods for Information Server Selection. ACM Transaction on Information System 17(1), 40–76 (1999)
Kantardzic, M.: Data Mining-Concepts, Models, Methods, and Algorithms. IEEE Press, New Jork (2002)
Yang, H., Zhang, M.: A Language Modeling Approach to Search Distributed Text Databases. In: The Proceedings of 16th Australian Joint Conference on Artificial Intelligence, Perth, Australia, pp. 196–207 (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Yang, H., Zhang, M., Shi, Z. (2004). Association-Rule Based Information Source Selection. In: Zhang, C., W. Guesgen, H., Yeap, WK. (eds) PRICAI 2004: Trends in Artificial Intelligence. PRICAI 2004. Lecture Notes in Computer Science(), vol 3157. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-28633-2_60
Download citation
DOI: https://doi.org/10.1007/978-3-540-28633-2_60
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22817-2
Online ISBN: 978-3-540-28633-2
eBook Packages: Springer Book Archive