Skip to main content
Log in

Preserving key in XML data transformation

  • Original Article
  • Published:
Acta Informatica Aims and scope Submit manuscript

Abstract

Transformation of XML data is an important task in data exchange, data publishing and data integration. Specifically in data integration, data in XML sources is transformed to match the target schema. Some of these sources have XML keys defined. When the data is transformed, the keys also need to be transformed for constraint comparisons, consistency checking and unification in the target schema. Thus, how the keys are transformed, and whether the transformed keys are valid and preserved to the target schema are important problems in XML data transformation and integration. Towards this problem, we firstly define XML keys and their satisfactions. We then study how keys are transformed and whether transformed keys are valid when a source schema is transformed to a target schema. Finally we show whether the transformed keys are satisfied by the transformed document.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Jiang, H., Ho, H., Popa, L., Han, W.: Mapping-driven XML transformation. In: WWW, pp. 1063–1072 (2007)

  2. Zamboulis, L., Poulovassilis, A.: Using automed for XML data transformation and integration. In: DIWeb, pp. 58–69 (2004)

  3. Zamboulis, L.: XML data integration by graph restructuring. In: BNCOD, pp. 57–71 (2004)

  4. Su, H., Kuno, H., Rudensteiner, E.A.: Automating the transformation of XML documents. In: WIDM, pp. 68–75 (2001)

  5. Erwig, M.: Toward the automatic derivation of XML transformations. In: ER, pp. 342–354 (2003)

  6. Liu, J., Park, H., Vincent, M., Liu, C.: A formalism of XML restructuring operations. In: ASWC, Lecture Notes in Computer Sciences, vol. 4185, pp. 126–132 (2006)

  7. Barbosa, D., Freire, J., Mendelzon, A.O.: Information preservation in XML-to-relational mappings. In: XSym, Lecture Notes in Computer Sciences, vol. 3186, pp. 66–81 (2004)

  8. Lee, D., Chu, W.W.: Constraint preserving transformation from XML document type definition to relational schema. In: ER, Lecture Notes in Computer Sciences, vol. 1920, pp. 323–338 (2000)

  9. Lu, Z., Zhang, S.: Data integration based WWW with XML and CORBA. In: ICWE, Lecture Notes in Computer Sciences, vol. 2722, pp. 455–458 (2003)

  10. Pan, W., Liu, J., Tian, J.: An implementation of XML data integration. In: ICEIS, pp. 111–116 (2008)

  11. Mlynkova, I., Pokorn, J.: UserMap: an adaptive enhancing of user-driven XML-to-relational mapping strategies. In: ADC, pp. 165–174 (2008)

  12. Liu, J., Park, H.-H., Vincent M., Liu, C.: Transformation of XML data, extended version. http://www.cis.unisa.edu.au/~cisjl/publications/publ.html

  13. Cali, A., Calvanese, D., Giacomo, G.D., Lenzerini, M.: Data integration under integrity constraints. In: CAISE, Lecture Notes in Computer Sciences, vol. 2348, pp. 262–279 (2002)

  14. Li, C.: Describing and utilizing constraints to answer queries in data integration systems. In: IIWeb (2003)

  15. Christiansen, H., Martinenghi, D.: Simplification of integrity constraints for data integration. In: FoIKs, Lecture Notes in Computer Sciences, vol. 2942, pp. 31–48 (2004)

  16. Turker, C., Saake, G.: Consistent handling of integrity constraints and extensional assertions for schema integration. In: ADBIS, Lecture Notes in Computer Sciences, vol. 1691, pp. 31–45 (1999)

  17. Cali, A., Calvanese, D., Giacomo, G.D., Lenzerini, M.: On the expressive power of data integration systems. In: ER, Lecture Notes in Computer Sciences, vol. 2503, pp. 338–350 (2002)

  18. Cali, A., Calvanese, D., Giacomo, G.D., Lenzerini, M.: On the role of integrity constraints in data integration. In: Bulletin of the IEEE Computer Society Technical Committee on Data Engineering (2002)

  19. Fuxman, A., Miller, R.e.J.: Towards inconsistency management in data integration systems. In: IIWeb (2003)

  20. Klug, A.: Calculating constraints on relational expressions. In: ACM TODS, vol. 5(3), pp. 260–290 (1980)

  21. Bray, T., Paoli, J., Sperberg-McQueen, C.M.: Extensible markup language (XML) 1.0., World Wide Web Consortium (W3C). http://www.w3.org/TR/REC-xml (1998). Accessed Feb 1998

  22. Benedikt, M., Chan, C.Y., Fan, W., Freire, J., Rastogy, R.: Capturing both types and constraints in data integration. In: ACM SIGMOD, pp. 277–288 (2003)

  23. Popa, L., Velegrakis, Y., Miller, R.J., Hernandez, M.A., Fagin, R.: Translating the web data. In: VLDB, pp. 598–609 (2002)

  24. Tian, J., Liu, J., Pan, W., Vincent, M.W., Liu, C.: Performance analysis and improvement for transformation operators in XML data integration. In: APWeb, pp. 214–226 (2008)

  25. Buneman P., Davidson S.B., Fan W., Hara C.S., Tan W.C.: Keys for XML. Comput. Netw. 5(39), 473–487 (2002)

    Article  Google Scholar 

  26. Thompson, H.S., Beech, D., Maloney, M., Mendelsohn, N.: XML schema part 1: structures, W3C working draft. http://www.w3.org/TR/xmlschema-1/ (2000). Accessed April 2000

  27. Lenzerini, M.: Data integration: a theoretical perspective. In: ACM PODS, pp. 233–246 (2002)

  28. Halevy, A.Y., Rajaraman, A., Ordille, J.J.: Data integration: the teenage years. In: VLDB, pp. 9–16 (2006)

  29. Davidson, S., Fan, W., Hara, C., Qin, J.: Propagating XML constraints to relations. In: ICDE, pp. 543–554 (2003)

  30. Liu, Y., Zhong, H., Wang, Y.: XML constraints preservation in relational schema. In: CEC-East (2004)

  31. Wang, Q., Wu, H., Xiao, J., Zhou, A.: Deriving relation keys from XML keys. In: ADC (2003)

  32. Liu, Y., Zhong, H., Wang, Y.: Capturing XML constraints with relational schema. In: CIT (2004)

  33. Liu, C., Vincent, M.W., Liu, J.: Constraint preserving transformation from relational schema to XML schema. In: World Wide Web: Internet and Web Information Systems, vol. 9, pp. 93–110 (2006)

  34. Deutsch, A., Tannen, V.: MARS: A system for publishing XML from mixed and redundant storage. In: VLDB (2003)

  35. Suciu, D.: On database theory and XML. In: SIGMOD Record, vol. 30(2), pp. 39–45 (2001)

  36. Buneman, P., Fan, W., Simeon, J., Weinstein, S.: Constraints for semistructured data and XML. In: SIGMOD Record, pp. 47–54 (2001)

  37. Fan, W.: XML constraints: specification, analysis, and applications. In: DEXA, pp. 805–809 (2005)

  38. Poggi, A., Abiteboul, S.: XML data integration with identification. In: DBPL, pp. 106–121 (2005)

  39. Buneman P., Davidson S.B., Fan W., Hara C.S., Tan W.C.: Reasoning about keys for XML. Inform. Syst. 8(28), 1037–1063 (2003)

    Article  Google Scholar 

  40. Fan, W., Simeon, J.: Integrity constraints for XML. In: PODS, pp. 23–34 (2000)

  41. Fan W., Libkin L.: On XML integrity constraints in the presence of DTDs. J. ACM 49, 368–406 (2002)

    Article  MathSciNet  Google Scholar 

  42. Arenas, M., Fan, W., Libkin, L.: On verifying consistency of XML specifications. In: ACM PODS, pp. 259–270 (2002)

  43. Arenas, M., Fan, W., Libkin, L.: What’s hard about XML schema constraints? In: DEXA, Lecture Notes in Computer Sciences, vol. 2453, pp. 269–278 (2002)

  44. Java Document Object Model (DOM). http://java.sun.com/j2se/1.4.2/docs/api/org/w3c/dom/package-summary.html

  45. Koloniari, G., Pitoura, E.: Peer-to-peer management of XML data: issues and research challenges. In: SIGMOD Record, vol. 34(2), pp. 6–17 (2005)

  46. Gribble, S.D., Halevy, A.Y., Ives, Z.G., Rodrig, M., Suciu, D.: What can database do for peer-to-peer? In: WebDB (2001)

  47. Wei, W., Liu, M., Li, S.: Merging of XML documents. In: ER, Lecture Notes in Computer Sciences, vol. 3288, pp. 273–285 (2004)

  48. Lim, E., Prabhakar, S., Srivastava, J., Richardson, J.: Entity identification in database integration. In: ICDE, pp. 294–301 (1993)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Md. Sumon Shahriar.

Additional information

The work is under the Australian Research Council (ARC) Discovery Project (DP) Fund.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Shahriar, M.S., Liu, J. Preserving key in XML data transformation. Acta Informatica 46, 475–507 (2009). https://doi.org/10.1007/s00236-009-0101-z

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00236-009-0101-z

Keywords

Navigation