Abstract
Transformation of XML data is an important task in data exchange, data publishing and data integration. Specifically in data integration, data in XML sources is transformed to match the target schema. Some of these sources have XML keys defined. When the data is transformed, the keys also need to be transformed for constraint comparisons, consistency checking and unification in the target schema. Thus, how the keys are transformed, and whether the transformed keys are valid and preserved to the target schema are important problems in XML data transformation and integration. Towards this problem, we firstly define XML keys and their satisfactions. We then study how keys are transformed and whether transformed keys are valid when a source schema is transformed to a target schema. Finally we show whether the transformed keys are satisfied by the transformed document.
Similar content being viewed by others
References
Jiang, H., Ho, H., Popa, L., Han, W.: Mapping-driven XML transformation. In: WWW, pp. 1063–1072 (2007)
Zamboulis, L., Poulovassilis, A.: Using automed for XML data transformation and integration. In: DIWeb, pp. 58–69 (2004)
Zamboulis, L.: XML data integration by graph restructuring. In: BNCOD, pp. 57–71 (2004)
Su, H., Kuno, H., Rudensteiner, E.A.: Automating the transformation of XML documents. In: WIDM, pp. 68–75 (2001)
Erwig, M.: Toward the automatic derivation of XML transformations. In: ER, pp. 342–354 (2003)
Liu, J., Park, H., Vincent, M., Liu, C.: A formalism of XML restructuring operations. In: ASWC, Lecture Notes in Computer Sciences, vol. 4185, pp. 126–132 (2006)
Barbosa, D., Freire, J., Mendelzon, A.O.: Information preservation in XML-to-relational mappings. In: XSym, Lecture Notes in Computer Sciences, vol. 3186, pp. 66–81 (2004)
Lee, D., Chu, W.W.: Constraint preserving transformation from XML document type definition to relational schema. In: ER, Lecture Notes in Computer Sciences, vol. 1920, pp. 323–338 (2000)
Lu, Z., Zhang, S.: Data integration based WWW with XML and CORBA. In: ICWE, Lecture Notes in Computer Sciences, vol. 2722, pp. 455–458 (2003)
Pan, W., Liu, J., Tian, J.: An implementation of XML data integration. In: ICEIS, pp. 111–116 (2008)
Mlynkova, I., Pokorn, J.: UserMap: an adaptive enhancing of user-driven XML-to-relational mapping strategies. In: ADC, pp. 165–174 (2008)
Liu, J., Park, H.-H., Vincent M., Liu, C.: Transformation of XML data, extended version. http://www.cis.unisa.edu.au/~cisjl/publications/publ.html
Cali, A., Calvanese, D., Giacomo, G.D., Lenzerini, M.: Data integration under integrity constraints. In: CAISE, Lecture Notes in Computer Sciences, vol. 2348, pp. 262–279 (2002)
Li, C.: Describing and utilizing constraints to answer queries in data integration systems. In: IIWeb (2003)
Christiansen, H., Martinenghi, D.: Simplification of integrity constraints for data integration. In: FoIKs, Lecture Notes in Computer Sciences, vol. 2942, pp. 31–48 (2004)
Turker, C., Saake, G.: Consistent handling of integrity constraints and extensional assertions for schema integration. In: ADBIS, Lecture Notes in Computer Sciences, vol. 1691, pp. 31–45 (1999)
Cali, A., Calvanese, D., Giacomo, G.D., Lenzerini, M.: On the expressive power of data integration systems. In: ER, Lecture Notes in Computer Sciences, vol. 2503, pp. 338–350 (2002)
Cali, A., Calvanese, D., Giacomo, G.D., Lenzerini, M.: On the role of integrity constraints in data integration. In: Bulletin of the IEEE Computer Society Technical Committee on Data Engineering (2002)
Fuxman, A., Miller, R.e.J.: Towards inconsistency management in data integration systems. In: IIWeb (2003)
Klug, A.: Calculating constraints on relational expressions. In: ACM TODS, vol. 5(3), pp. 260–290 (1980)
Bray, T., Paoli, J., Sperberg-McQueen, C.M.: Extensible markup language (XML) 1.0., World Wide Web Consortium (W3C). http://www.w3.org/TR/REC-xml (1998). Accessed Feb 1998
Benedikt, M., Chan, C.Y., Fan, W., Freire, J., Rastogy, R.: Capturing both types and constraints in data integration. In: ACM SIGMOD, pp. 277–288 (2003)
Popa, L., Velegrakis, Y., Miller, R.J., Hernandez, M.A., Fagin, R.: Translating the web data. In: VLDB, pp. 598–609 (2002)
Tian, J., Liu, J., Pan, W., Vincent, M.W., Liu, C.: Performance analysis and improvement for transformation operators in XML data integration. In: APWeb, pp. 214–226 (2008)
Buneman P., Davidson S.B., Fan W., Hara C.S., Tan W.C.: Keys for XML. Comput. Netw. 5(39), 473–487 (2002)
Thompson, H.S., Beech, D., Maloney, M., Mendelsohn, N.: XML schema part 1: structures, W3C working draft. http://www.w3.org/TR/xmlschema-1/ (2000). Accessed April 2000
Lenzerini, M.: Data integration: a theoretical perspective. In: ACM PODS, pp. 233–246 (2002)
Halevy, A.Y., Rajaraman, A., Ordille, J.J.: Data integration: the teenage years. In: VLDB, pp. 9–16 (2006)
Davidson, S., Fan, W., Hara, C., Qin, J.: Propagating XML constraints to relations. In: ICDE, pp. 543–554 (2003)
Liu, Y., Zhong, H., Wang, Y.: XML constraints preservation in relational schema. In: CEC-East (2004)
Wang, Q., Wu, H., Xiao, J., Zhou, A.: Deriving relation keys from XML keys. In: ADC (2003)
Liu, Y., Zhong, H., Wang, Y.: Capturing XML constraints with relational schema. In: CIT (2004)
Liu, C., Vincent, M.W., Liu, J.: Constraint preserving transformation from relational schema to XML schema. In: World Wide Web: Internet and Web Information Systems, vol. 9, pp. 93–110 (2006)
Deutsch, A., Tannen, V.: MARS: A system for publishing XML from mixed and redundant storage. In: VLDB (2003)
Suciu, D.: On database theory and XML. In: SIGMOD Record, vol. 30(2), pp. 39–45 (2001)
Buneman, P., Fan, W., Simeon, J., Weinstein, S.: Constraints for semistructured data and XML. In: SIGMOD Record, pp. 47–54 (2001)
Fan, W.: XML constraints: specification, analysis, and applications. In: DEXA, pp. 805–809 (2005)
Poggi, A., Abiteboul, S.: XML data integration with identification. In: DBPL, pp. 106–121 (2005)
Buneman P., Davidson S.B., Fan W., Hara C.S., Tan W.C.: Reasoning about keys for XML. Inform. Syst. 8(28), 1037–1063 (2003)
Fan, W., Simeon, J.: Integrity constraints for XML. In: PODS, pp. 23–34 (2000)
Fan W., Libkin L.: On XML integrity constraints in the presence of DTDs. J. ACM 49, 368–406 (2002)
Arenas, M., Fan, W., Libkin, L.: On verifying consistency of XML specifications. In: ACM PODS, pp. 259–270 (2002)
Arenas, M., Fan, W., Libkin, L.: What’s hard about XML schema constraints? In: DEXA, Lecture Notes in Computer Sciences, vol. 2453, pp. 269–278 (2002)
Java Document Object Model (DOM). http://java.sun.com/j2se/1.4.2/docs/api/org/w3c/dom/package-summary.html
Koloniari, G., Pitoura, E.: Peer-to-peer management of XML data: issues and research challenges. In: SIGMOD Record, vol. 34(2), pp. 6–17 (2005)
Gribble, S.D., Halevy, A.Y., Ives, Z.G., Rodrig, M., Suciu, D.: What can database do for peer-to-peer? In: WebDB (2001)
Wei, W., Liu, M., Li, S.: Merging of XML documents. In: ER, Lecture Notes in Computer Sciences, vol. 3288, pp. 273–285 (2004)
Lim, E., Prabhakar, S., Srivastava, J., Richardson, J.: Entity identification in database integration. In: ICDE, pp. 294–301 (1993)
Author information
Authors and Affiliations
Corresponding author
Additional information
The work is under the Australian Research Council (ARC) Discovery Project (DP) Fund.
Rights and permissions
About this article
Cite this article
Shahriar, M.S., Liu, J. Preserving key in XML data transformation. Acta Informatica 46, 475–507 (2009). https://doi.org/10.1007/s00236-009-0101-z
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00236-009-0101-z