Abstract
This paper proposes the architecture of the functional composition of Web databases (WebDBs). Unlike a general search engine which receives keywords and returns a list of URLs, a WebDB receives a complex query and returns a list of records. The complex query specifies the condition of each field of the records. The process of composing WebDBs is described as a script, where a user chooses the target WebDBs and describes how to connect the output from one WebDB to the input of another WebDB and how to generate outputs. The novelty of the proposal is that both the WebDBs and output formats are considered as components of the same level and that the reuse of new keywords is represented as a connection (CGI links). Once the process is described as a script, the user can use the script for a new WebDB of his own.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
BrightPlanet: The Deep Web: Surfacing Hidden Value, BrightPlanet White Paper (2000)
Chawathe, S., Garcia-Molina, H., Hammer, J., Ireland, K., Papakonstantinou, Y., Ullman, J., Widom, J.: The TSIMMIS Project: Integration of Heterogeneous Information Sources. In: Proceedings of IPSJ Conference, Tokyo, Japan, October 1994, pp. 7–18 (1994)
He, H., Meng, W., Yu, C., Wu, Z.: WISE-Integrator: A System for Extracting and Integrating Complex Web Search Interfaces of the Deep Web. In: Proceedings of the 31st International Conference on Very Large Data Bases (VLDB 2005), Trondheim, Norway, August 30-September 2, 2005, pp. 1314–1317 (2005)
Ipeirotis, P., Gravano, L., Sahami, M.: PERSIVAL Demo: Categorizing Hidden-Web Resources. In: JCDL 2001 (2001)
Ipeirotis, P., Gravano, L., Sahami, M.: Probe, Count, and Classify: Categorizing Hidden-Web Databases. In: ACM SIGMOD 2001 (2001)
Kitamura, Y., Noda, T., Tatsumi, S.: Single-agent and Multi-agent Approaches to WWW Information Integration. In: Ishida, T. (ed.) PRIMA 1998. LNCS (LNAI), vol. 1599, pp. 133–147. Springer, Heidelberg (1999)
Knoblock, C.A., Minton, S., Ambite, J.L., Ashish, N., Muslea, I., Philpot, A.G., Tejada, S.: The Ariadne Approach to Web-Based Information Integration. International Journal of Cooperative Information Systems 10(1-2), 145–169 (2001)
Knoblock, C.A.: Deploying Information Agents on the Web. In: IJCAI 2003, Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence, Acapulco, Mexico, August 9-15, 2003, pp. 1580–1586 (2003)
Nakatoh, T., Ohmori, K., Hirokawa, S.: A Report on Metadata for Web Databases, IPSJ SIG Technical Reports, 2004-ICS-138(17), pp. 95–98 (2004)
Nakatoh, T., Ohmori, K., Yamada, Y., Hirokawa, S.: Complex Query And Metadata. In: Proc. ISE 2003, pp. 291–294 (2003)
Nakatoh, T., Yamada, Y., Hirokawa, S.: Automatic Generation of Deep Web Wrappers based on Discovery of Repetition. In: Proc. of the First Asia Information Retrieval Symposium (AIRS 2004), pp. 269–272 (2004)
Pedley, P.: The invisible web. In: ASLIB (2001)
Sherman, C., Pric, G.: The Invisible Web, Information Today, Inc., Medfore, New Jersey (2001)
Wu, Z., Raghavan, V., Du, C., Sai, K., Meng, C.W., He, H., Yu, C.: SE-LEGO: creating metasearch engines on demand. In: Proceedings of the 26th annual international ACM SIGIR conference on Research and development in information retrieval (SIGIR 2003) (2003)
Project DAISEn: Directory Architecture for Integrated Search Engines, http://daisen.cc.kyushu-u.ac.jp/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Mori, M., Nakatoh, T., Hirokawa, S. (2006). Functional Composition of Web Databases. In: Sugimoto, S., Hunter, J., Rauber, A., Morishima, A. (eds) Digital Libraries: Achievements, Challenges and Opportunities. ICADL 2006. Lecture Notes in Computer Science, vol 4312. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11931584_47
Download citation
DOI: https://doi.org/10.1007/11931584_47
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-49375-4
Online ISBN: 978-3-540-49377-8
eBook Packages: Computer ScienceComputer Science (R0)