Skip to main content
Log in

Query processing with quality control in the World Wide Web

  • Published:
World Wide Web Aims and scope Submit manuscript

Abstract

Recent research on integrating database and World Wide Web (WWW) technologies has changed the navigation approach to searching information in the Web. People now can issue queries via a simple query interface or a database‐like query language to retrieve information from semistructured WWW data sources. However, the quality of query processing in the WWW is still low due to many factors such as unpredictable response time, irrelevant results, and out‐of‐date data. Such low‐quality query processing is intolerable to either users or service providers. In this paper, we present a quality‐controlled query processing method in the WWW. Quality parameters that users can specify with their queries are introduced. Distance functions that are used to evaluate the goodness of query quality parameters are defined. A query processing model with quality control is introduced. A quality control protocol in query processing is presented. Quality‐controlled query scheduling algorithms including admission scheduling, promotion/demotion scheduling and execution scheduling are proposed. Other relevant issues such as query classification, system parameter estimation, and query queue management are also discussed. Query processing with quality control is a promising way to solve the uncertain and low‐quality query processing problems in the WWW.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Ashish, N. and C. Knoblock (1997), “Wrapper Generation for Semi-Structured Internet Sources,” In The PODS/SIGMODWorkshop on Management of Semistructured Data, Tucson, AZ.

  • Berners-Lee, T. (1996), “WWW: Past, Present, and Future,” IEEE Computer29, 10, 69–77.

    Google Scholar 

  • Berners-Lee, T. and D. Conolly (1995), “Hypertext Markup Language – 2.0,” IETF RFC 1866.

  • Bray, T.(1996), “Measuring the Web,” Computer Networks and ISDN Systems 28, 7/11, 993–1005.

  • Chen, Y., H. Xu, and N. Wang(1998), “WWWDS: Towards Globalization of Distributed Data Sources over WWW,” Chinese Journal of Software 9, 8, 566–573.

    Google Scholar 

  • Chen, Y., B. Xu, and N. Wang (1998), “WebCORD: A Collaborative Resource Discovery System Model in Web,” Chinese Journal ofComputer 21, 4, 381–384.

    Google Scholar 

  • Fiebig, T., J. Weiss, and G. Moerkotte (1997), “RAW: A Relational Algebra for the Web,” In ThePODS/SIGMOD Workshop on Management of Semistructured Data, Tucson, AZ.

  • Gosinski, T. and S. Avila (1997), “Implementing aRegional Traffic Data Management System,” In Proceedings of the Annual Conference of the Urban and Regional Information Systems Association, Washington, DC, pp. 735–743.

  • Hammer, J., H. Garica-Molina, J. Cho, R. Aranha, and A. Crespo (1997), “ExtractingSemistructured Information from the Web,” In The PODS/SIGMOD Workshop on Management of Semistructured Data, Tucson, AZ.

  • Kim, K.,J. Kim, J. Choi, and M. Sung (1995), “Application of GIS to Water Quality Management,” In GIS/LIS '95 Annual Conference and Exposition, Vol. 2, Nashville, TN, pp. 554–562.

    Google Scholar 

  • Konopnicki, D. and O. Shmueli (1995), “W3QS: A Query System for the World-Wide Web,” In Proceedings of the 21st International Conference on Very Large DataBases, Morgan Kaufmann, Zurich, Switzerland, pp. 54–65.

    Google Scholar 

  • Lamm, S.E., D.A. Reed, and W.H. Scullin (1996), “Real-Time Geographic Visualization of World Wide Web Traffic,”Computer Networks and ISDN Systems 28, 7/11, 1457–1468.

    Google Scholar 

  • Levy, A.Y., A. Rajaraman, and J.J. Ordille (1996), “QueryingHeterogeneous Information Sources Using Source Descriptions,” In Proceedings of the 22nd International Conference on Very Large DataBases, Morgan Kaufmann, Mumbai, IN, pp. 251–262.

    Google Scholar 

  • Mendelzon, A.O., G.A. Mihaila, and T. Milo (1996), “Querying the World WideWeb,” In Proceedings of the 4th IEEE International Conference on Parallel and Distributed Information Systems, Miami Beach, FL, pp. 80–91.

  • Vrbsky, S.V. (1994), “A Data Model for Approximate Query Processing of Real-Time Database,” Data & Knowledge Engineering 21,1, 79–102.

    Google Scholar 

  • Winder, D. (1997), “Internet Search Engines,” PC Pro 34, 212–219.

  • Woodruff, A., P.M. Aoki, E. Brewer, P. Gauthier, and L.A. Wowe (1996), “An Investigation of Documents from the World Wide Web,” Computer Networks and ISDN Systems 28, 7/11, 963–980.

    Google Scholar 

  • Yuwono, B. and D.L. Lee (1996), “Searching and Ranking Algorithms for Locating Resources on the World WideWeb,” In Proceedings of the 12nd IEEE International Conference on Data Engineering, New Orleans, LA, pp. 164–171.

  • Zhu, Q. andP.-Å. Larson (1994), “A Query Sampling Method for Estimating Local Cost Parameters in a Multidatabase System,” In Proceedings of the 10th IEEE International Conference on Data Engineering, Houston, TX, pp. 144–153.

  • Zhu, Q. and P.-Å. Larson (1996), “BuildingRegression Cost Models for Multidatabase Systems,” In Proceedings of the 4th IEEE International Conference on Parallel and Distributed Information Systems, Miami Beach, FL, pp. 220–231.

  • Zhu, Q. and P.-Å. Larson (1998), “Solving Local Cost Estimation Problem forGlobal Query Optimization in Multidatabase Systems,” Distributed and Parallel Databases 6, 4, 373–420.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chen, Y., Zhu, Q. & Wang, N. Query processing with quality control in the World Wide Web. World Wide Web 1, 241–255 (1998). https://doi.org/10.1023/A:1019280102006

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/A:1019280102006

Keywords

Navigation