Abstract
We consider the problem of improving the efficiency of query processing on an XML interface of a relational database, for predefined query workloads. The main contribution of this paper is to show that selective materialization of data as XML views reduces query-execution costs in relatively static databases. Our learning-based approach precomputes and stores (materializes) parts of the answers to the workload queries as clustered XML views. In addition, the data in the materialized XML clusters are periodically incrementally refreshed and rearranged, to respond to the changes in the query workload. Our experiments show that the approach can significantly reduce processing costs for frequent and important queries on relational databases with XML interfaces.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Chen, J., Chen, S., Rundensteiner, E.A.: A transactional model for data warehouse maintenance. In: Spaccapietra, S., March, S.T., Kambayashi, Y. (eds.) ER 2002. LNCS, vol. 2503, p. 247. Springer, Heidelberg (2002)
Chen, L., Rundensteiner, E.A., Wang, S.: XCache: A semantic caching system for XML queries. In: Proc. 2002 ACM SIGMOD International Conference on Management of Data (2002)
Claypool, K.T., Rundensteiner, E.A., Zhang, X., Su, H., Kuno, H.A., Lee, W.C., Mitchell, G.: Gangam — a solution to support multiple data models, their mappings and maintenance. In: Proc. 2001 ACM SIGMOD International Conference on Management of Data (2001)
Chen, L., Wang, S., Cash, E., Ryder, B., Hobbs, I., Rundensteiner, E.A.: A fine-grained replacement strategy for XML query cache. In: Proc. Fourth ACM CIKM International Workshop on Web Information and Data Management (WIDM 2002), pp. 76–83 (2002)
Chen, J., Zhang, X., Chen, S., Koeller, A., Rundensteiner, E.A.: DyDa: Data warehouse maintenance in fully concurrent environments. In: Proc. ACM SIGMOD (2001)
Embley, D.W., Mok, W.Y.: Developing XML Documents with Guaranteed “Good” Properties. In: Kunii, H.S., Jajodia, S., Sølvberg, A. (eds.) ER 2001. LNCS, vol. 2224, pp. 426–441. Springer, Heidelberg (2001)
Filha, I.M.R.E., da Silva, A.S., Laender, A.H.F., Embley, D.W.: Using nested tables for representing and querying semistructured web data. In: Pidduck, A.B., Mylopoulos, J., Woo, C.C., Ozsu, M.T. (eds.) CAiSE 2002. LNCS, vol. 2348, p. 719. Springer, Heidelberg (2002)
Fernandez, M., Kadiyska, Y., Suciu, D., Morishima, A., Tan, W.C.: SilkRoute: A framework for publishing relational data in XML. ACM Trans. Database Systems 27(4), 438–493 (2002)
Ioannidis, Y.E.: Query optimization. In: Tucker, A.B. (ed.) The Computer Science and Engineering Handbook, pp. 1038–1057. CRC Press, Boca Raton (1997)
Liu, Z., Li, F., Ng, W.K.: Wiccap data model: Mapping physical websites to logical views. In: Spaccapietra, S., March, S.T., Kambayashi, Y. (eds.) ER 2002. LNCS, vol. 2503, p. 120. Springer, Heidelberg (2002)
Mengchi, L.: A logical foundation for XML. In: Proc. Advanced Information Systems Engineering, 14th International Conference (CAiSE 2002), pp. 568–583 (2002)
Mitchell, T.M.: Generalization as search. Artificial Intelligence 18, 203–226 (1982)
Mengchi, L., Ling, T.W.: Towards declarative XML querying. In: Proc. 3rd International Conference on Web Information Systems Engineering (WISE 2002), pp. 127–138 (2002)
Passi, K., Lane, L., Madria, S.K., Sakamuri, B.C., Mohania, M.K., Bhowmick, S.S.: A model for XML schema integration. In: Proc. 3rd Int’l Conf. E-Commerce and Web Technologies (2002)
Psaila, G.: ERX: An experience in integrating entity-relationship models, relational databases, and XML technologies. In: Proc. XML-Based Data Management and Multimedia Engineering EDBT workshop (2002)
Shanmugasundaram, J., Kiernan, J., Shekita, E.J., Fan, C., Funderburk, J.: Querying XML views of relational data. In: Proc. 27th Int’l Conference on Very Large Data Bases (2001)
Widom, J.: Research problems in data warehousing. In: Proc. Fourth International Conference on Information and Knowledge Management, pp. 25–30 (1995)
Extensible Markup Language (XML), http://www.w3.org/XML
Zhang, X., Ding, L., Rundensteiner, E.A.: Parallel multi-source view maintenance. VLDB Journal: Very Large DataBases (2003) (to appear)
Zhang, X., Mulchandani, M., Christ, S., Murphy, B., Rundensteiner, E.A.: Rainbow: mapping-driven XQuery processing system. In: Proc. ACM SIGMOD (2002)
Zhang, X., Rundensteiner, E.A.: Integrating the maintenance and synchronization of data warehouses using a cooperative framework. Information Systems 27, 219–243 (2002)
The CDDB database, http://www.freedb.org
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Shah, A., Chirkova, R. (2003). Improving Query Performance Using Materialized XML Views: A Learning-Based Approach. In: Jeusfeld, M.A., Pastor, Ó. (eds) Conceptual Modeling for Novel Application Domains. ER 2003. Lecture Notes in Computer Science, vol 2814. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39597-3_30
Download citation
DOI: https://doi.org/10.1007/978-3-540-39597-3_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20257-8
Online ISBN: 978-3-540-39597-3
eBook Packages: Springer Book Archive