Abstract
We present a system for searching, collecting, and integrating Web-resident data. The system consists of five tools, where each tool provides a specific functionality aimed at solving one aspect of the complex task of using and managing Web data. Each tool can be used in a stand-alone mode, in combination with the other tools, or even in conjunction with other systems. Together, the tools offer a wide range of capabilities that overcome many of the limitations in existing systems for harnessing Web data. The paper describes each tool, possible ways of combining the tools, and the architecture of the combined system.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
West university, http://www.west.edu/datamining/publications.html
Abiteboul, S., Cluet, S., Milo, T.: Querying the file. In: Proc. of Intl. Conf. on Very Large Data Bases, Dublin (1993)
Abiteboul, S., Cluet, S., Milo, T.: A database interface for files update. In: Proc. ACM SIGMOD Int. Conf. on Management of Data (May 1995)
Abiteboul, S., Cluet, S., Milo, T.: Correspondence and Translation for Hetero-geneous Data. In: Proc. Int. Conf. on Database Theory (ICDT), pp. 351–363 (1997)
Atzeni, P., Labonia, S., Masci, A., Mecca, G., Merialdo, P., Tabet, E.: The ARANEUS Project, http://poincare.inf.uniroma3.it:8080/Araneus/araneus.html
Bell, G., Parisi, A., Pesce, M.: The Virtual Reality Modeling Language: Version 1 Specification (May 1995), http://www.virtpark.com/theme/vrml/
Buneman, P., Davidson, S., Hart, K., Overton, C., Wong, L.: A data transformation system for biological data sources. In: Proc. Int. Conf. on Very Large Data Bases (VLDB), Zurich, Switzerland, pp. 158–169 (1995)
Buneman, P., Davidson, S., Suciu, D.: Programming constructs for unstructured data (May 1996)
Carey, M.J., et al.: Towards heterogeneous multimedia information systems: The Garlic approach. Technical Report RJ 9911, IBM Almaden Research Center (1994)
Christophides, V., Abiteboul, S., Cluet, S., Scholl, M.: From structured documents to novel query facilities. In: Proc. ACM Sigmod, Minneapolis (1994)
Chang, T.-P., Hull, R.: Using witness generators to support bi-directional up-date between object-based databases. In: Proc. ACM SIGMOD/SIGACT Conf. on Princ. of Database Syst. (PODS), San Jose, California (May 1995)
Consens, M., Milo, T.: Optimizing Queries on Files. In: ACM SIGMOD Int. Conf. on Management of Data, Minneapolis, Minnesota, May 1994, pp. 301–312 (1994)
Das Neves, F.: The aleph: A tool to spatially represent user knowledge about the www docuverse. In: Proc. ACM Hypertext 1997 (1997)
Doemel, P.: WebMap - a graphical hypertext navigation tool. In: Proceedings of the 2nd Int’l World Wide Web Conference, Chicago (October 1994), http://www.ncsa.uiuc.edu/SDG/IT94/Proceedings/Searching/doemel/-www-fall94.html
Excite Inc. Excite: Main page (1996), http://www.excite.com
Feng, A., Wakayama, T.: Simon: A Grammar Based Transformation System of Structured Documents. In: Proc. Int. Conf. Electronic Publishing (1994)
Franchitti, J.C., King, R.: Amalgame: a tool for creating interoperating persistent, heterogeneous components. In: Advanced Database Systems, pp. 313–336 (1993)
Garcia-Molina, H., Quass, D., Papakonstantinou, Y., Rajaraman, A., Sagiv, Y., Ullman, J.D., Widom, J.: The TSIMMIS Approach to Mediation: Data Models and Languages. A special issue of the International Journal of Intelligent Information Systems (to appear)
Hemmje, M.: Lyberworld - a visulalization user interface with full text retrieval. In: Proceedings of SIGIR 1994 (1994)
Kifer, M., Lausen, G., Wu, J.: Logical Foundations of Object-Oriented and Frame-Based Languages. JACM 42(4), 741–843 (1995)
Kirk, T., Levy, A.Y., Sagiv, Y., Srivastava, D.: The Information Manifold. In: AI Spring Symp. (1995)
Konopnicki, D., Shmueli, O.: Information Gathering in the World-Wide Web: The W3QL Query Language and the W3QS system. ACM TODS (to appear)
Konopnicki, D., Shmueli, O.: W3QS: A Query System for the World-Wide Web. In: Proceedings of 1995 VLDB Conference, Zurich, Switzerland (September 1995)
Levy, A., Rajaraman, A., Ordille, J.: The World-Wide Web as a Collection of Views: Query Processing in the Information Manifold. In: Proc. Workshop on Mate- rialized Views: Techniques and Applications, Montreal, Canada, pp. 43–55 (1996)
Levy, A.Y., Mendelzon, A.O., Sagiv, Y., Srivastava, D.: Answering Queries Using Views. In: Proc. 14th PODS (1995)
Mamrak, A., O’Connell, C.: Technical Documentation for the Integrated Chameleon Architecture: ftp.ifi.uio.no /pub/SGML/ICA (1992)
Mendelzon, A., Mihaila, G., Milo, T.: Querying the world wide. In: Proc. of PDIS (1996)
Mogilevski, P.: Integration and Translation of Heterogeneous Data. M.Sc Thesis, Tel-Aviv University (1997)
Mukherjea, S., Foley, J.D.: Visualizing the World Wide Web with the Navigational View Builder. Computer Networks and ISDN Systems 27, 1075–1087 (1995)
Papakonstantinou, Y., Garcia-Molina, H., Ullman, J.: Medmaker: A mediation system based on declarative specifications. Available at db.stanford.edu /pub/papakonstantinou/1995/medmaker.ps
Papakonstantinou, Y., Garcia-Molina, H., Widom, J.: Object exchange across heterogeneous information sources. In: Int’l Conf. on Data Engineering (1995)
Pitkow, J.E., Bharat, K.A.: WebViz: A tools for World Wide Web access log analysis. In: Proc. 1st Int’l World Wide Web Conf., Geneva, Switzerland (May 1994), http://www1.cern.ch/PapersWWW94/pitkow-webvis.ps
Quass, D., Rajaraman, A., Sagiv, Y., Ullman, J.D., Widom, J.: Querying Semi-structured Heterogeneous Information. In: Ling, T.W., Mendelzon, A.O., Vieille, L. (eds.) Proc. 4th Int. Conf., DOOD 1995, December 1995. LNCS, vol. 1013, pp. 319–344. Springer, Heidelberg (1995)
Rajaraman, A., Sagiv, Y., Ullman, J.D.: Answering Queries Using Templates With Binding Patterns. In: Proc. 14th PODS (1995)
Shoens, K., Luniewski, A., Schwartz, P., Stamos, J., Thomas, J.: The rofus system: Information organization for semi-structured data. In: Proc. of the 19th Int. conf. on Very Large Databases, VLDB 1993, pp. 97–107 (1993)
Slonim, N., Tishby, N.: Automatic statistical categorization and segmentation of text, HUJI Technical Report (to appear)
Subrahmanian, V.S., Adali, S., Brink, A., Emery, R., Lu, J., Rajput, A., Rogers, T., Ross, R., Ward, C.: HERMES: A Heterogeneous Reasoning and Mediator System. Tech. Report, U. of Maryland (1995)
Walker, J.: HTML Converters. In (1994), http://www2.ncsu.edu/bae/people/faculty/walker/hotlist/htmlconv.html
Yahoo Inc. Yahoo: Main page (1996), http://www.yahoo.com
W3QS Home Page, http://www.cs.technion.ac.il/~konop/w3qs.html
The W3QS System, http://www.cs.technion.ac.il/~W3QS
PERLCOND Home Page, http://www.cs.technion.ac.il/~konop/perlcond.html
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1999 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Beeri, C. et al. (1999). WebSuite—A Tool Suite for Harnessing Web Data. In: Atzeni, P., Mendelzon, A., Mecca, G. (eds) The World Wide Web and Databases. WebDB 1998. Lecture Notes in Computer Science, vol 1590. Springer, Berlin, Heidelberg. https://doi.org/10.1007/10704656_10
Download citation
DOI: https://doi.org/10.1007/10704656_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-65890-0
Online ISBN: 978-3-540-48909-2
eBook Packages: Springer Book Archive