Skip to main content

The DaQuinCIS Broker: Querying Data and Their Quality in Cooperative Information Systems

  • Chapter
Journal on Data Semantics I

Abstract

In cooperative information systems, the quality of data exchanged and provided by different data sources is extremely important. A lack of attention to data quality can imply data of low quality to spread all over the cooperative system. At the same time, improvement can be based on comparing data, correcting them and disseminating high quality data. In this paper, a framework and a related architecture for managing data quality in cooperative information systems is proposed, as developed in the context of the DaQuinCIS research project. Then the focus concentrates (i) on an XML-based model for data and quality data, and (ii) on the design of a broker, which selects the best available data from different sources; such a broker also supports the improvement of data based on feedbacks to data sources. The broker is the basic component of the DaQuinCIS architecture.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Batini, C., Mecella, M.: Enabling Italian e-Government Through a Cooperative Architecture. IEEE Computer 34(2) (2001)

    Google Scholar 

  2. Lenzerini, M.: Data Integration: A Theoretical Perspective. In: Proceedings of the 21st ACM Symposium on Principles of Database Systems (PODS 2002), Madison, Wisconsin, USA (2002)

    Google Scholar 

  3. Marchetti, C., Mecella, M., Scannapieco, M., Virgillito, A., Baldoni, R.: Data Quality Notification in Cooperative Information Systems. In: Proceedings of the First International Workshop on Data Quality in Cooperative Information Systems, Siena, Italy (2003)

    Google Scholar 

  4. De Santis, L., Scannapieco, M., Catarci, T.: A Trust Model for Tightly Coupled P2P Systems. In: Proceedings of the 11o Convegno Nazionale su Sistemi Evoluti per Basi di Dati (SEBD 2003), Cetraro (CS), Italy (2003)

    Google Scholar 

  5. Bertolazzi, P., De Santis, L., Scannapieco, M.: Automatic Record Matching in Cooperative Information Systems. In: Proceedings of the ICDT 2003 International Workshop on Data Quality in Cooperative Information Systems (DQCIS 2003), Siena, Italy (2003)

    Google Scholar 

  6. Redman, T.C.: Data Quality for the Information Age. Artech House (1996)

    Google Scholar 

  7. Wand, Y., Wang, R.Y.: Anchoring Data Quality Dimensions in Ontological Foundations. Communications of the ACM 39(11) (1996)

    Google Scholar 

  8. Hall, P.A.V., Dowling, G.R.: Approximate String Matching. ACM Computing Surveys 12(4) (1980)

    Google Scholar 

  9. Pipino, L.L., Lee, Y.W., Wang, R.Y.: Data Quality Assessment. Communications of the ACM 45(4) (2002)

    Google Scholar 

  10. Ballou, D.P., Wang, R.Y., Pazer, H., Tayi, G.K.: Modeling Information Manufacturing Systems to Determine Information Product Quality. Management Science 44(4) (1998)

    Google Scholar 

  11. Missier, P., Scannapieco, M., Batini, C.: Cooperative Architectures: Introducing Data Quality. Technical Report 14-2001, Dipartimento di Informatica e Sistemistica, Università di Roma “La Sapienza”, Roma, Italy (2001)

    Google Scholar 

  12. Tansell, A., Snodgrass, R., Clifford, J., Gadia, S., Segev, A. (eds.): Temporal Databases. Benjamin-Cummings, Redwood City (1993)

    Google Scholar 

  13. Codd, E.F.: Relational Database: a Practical Foundation for Productivity (1981 ACM Turing Award Lecture. Communications of the ACM 25(2) (1982)

    Google Scholar 

  14. Bruni, R., Sassano, A.: Errors Detection and Correction in Large Scale Data Collecting. In: Proceedings of the 4th International Conference on Advances in Intelligent Data Analysis, Cascais, Portugal (2001)

    Google Scholar 

  15. Deutsch, A., Fernandez, M., Florescu, D., Levy, A., Suciu, D.: XML-QL: A Query Language for XML. In: Proceedings of the 8th International World Wide Web Conference (WWW8), Toronto, Canada (1999)

    Google Scholar 

  16. Cappiello, C., Francalanci, C., Pernici, B., Plebani, P., Scannapieco, M.: Data Quality Assurance in Cooperative Information Systems: a Multi-dimension Quality Certificate. In: Proceedings of the ICDT 2003 International Workshop on Data Quality in Cooperative Information Systems (DQCIS 2003), Siena, Italy (2003)

    Google Scholar 

  17. Ullman, J.D.: Information Integration Using Logical Views. In: Proceedings of the Sixth International Conference on Database Theory (ICDT 1997), Delphi, Greece (1997)

    Google Scholar 

  18. Boag, S., Chamberlin, D., Fernandez, M.F., Florescu, D., Robie, J., Simeon, J.: XQuery 1.0: An XML Query Language,W3C Working Draft (November 2002)

    Google Scholar 

  19. Milano, D.: Progetto DaQuinCIS: Estensione di XQuery e Query processing in un Sistema di Integrazione Basato sulla Qualità dei Dati, Tesi di Laurea in Ingegneria Informatica, Università di Roma “La Sapienza”, Facoltà di Ingegneria (2003) (in Italian); the thesis is available by writing an e-mail to: monscan@dis.uniromai.it

    Google Scholar 

  20. Yan, L.L., Ozsu, M.T.: Conflict Tolerant Queries in AURORA. In: Proceedings of the Fourth International Conference on Cooperative Information Systems (CoopIS 1999), Edinburgh, Scotland, UK (1999)

    Google Scholar 

  21. Hernandez, M.A., Stolfo, S.J.: Real-world Data is Dirty: Data Cleansing and The Merge/Purge Problem. Journal of Data Mining and Knowledge Discovery 1(2) (1998)

    Google Scholar 

  22. Hwang, C., Yoon, K.: Multiple Attribute Decision Making. Lectures Notes in Economics and Mathematical Systems. Springer 186, Heidelberg (1981)

    Google Scholar 

  23. Saaty, T.L.: The Analytic Hierarchy Process. McGraw-Hill, New York (1980)

    MATH  Google Scholar 

  24. Palmieri, G.: Progetto DaQuinCIS: Architettura Basata su Tecnologie Web Service ed Ottimizzazione di Query XML, Tesi di Laurea in Ingegneria Informatica, Università di Roma “La Sapienza”, Facoltà di Ingegneria (2003) (in Italian); the thesis is available by writing an e-mail to: monscan@dis.uniromai.it

    Google Scholar 

  25. Buchmann, A., Casati, F., Fiege, L., Hsu, M.C., Shan, M.C. (eds.): Proceedings of the 3nd VLDB International Workshop on Technologies for e-Services (VLDB-TES 2002), Hong Kong, China (2002)

    Google Scholar 

  26. Chandra, T.D., Toueg, S.: Unreliable failure detectors for reliable distributed systems. Journal of the ACM (JACM) 43(2), 225–267 (1996)

    Article  MATH  MathSciNet  Google Scholar 

  27. Fischer, M.J., Lynch, N.A., Paterson, M.S.: Impossibility of distributed consensus with one faulty process. Journal ofthe ACM (JACM) 32(2), 374–382 (1985)

    Article  MATH  MathSciNet  Google Scholar 

  28. Guerraoui, R., Schiper, A.: Software-Based Replication for Fault Tolerance. IEEE Computer, 30 (April 1997)

    Google Scholar 

  29. Naumann, F., Leser, U., Freytag, J.C.: Quality-driven Integration of Heterogenous Information Systems. In: Proceedings of 25th International Conference on Very Large Data Bases (VLDB 1999), Edinburgh, Scotland, UK (1999)

    Google Scholar 

  30. Bertolazzi, P., Scannapieco, M.: Introducing Data Quality in a Cooperative Context. In: Proceedings of the 6th International Conference on Information Quality (IQ 2001), Boston, MA, USA (2001)

    Google Scholar 

  31. Berti-Equille, L.: Quality-Extended Query Processing for Distributed Processing. In: Proceedings of the ICDT 2003 International Workshop on Data Quality in Cooperative Information Systems (DQCIS 2003), Siena, Italy (2003)

    Google Scholar 

  32. Galhardas, H., Florescu, D., Shasha, D., Simon, E.: An Extensible Framework for Data Cleaning. In: Proceedings of the 16th International Conference on Data Engineering (ICDE 2000), San Diego, CA, USA (2000)

    Google Scholar 

  33. Jarke, M., Lenzerini, M., Vassiliou, Y., Vassiliadis, P. (eds.): Fundamentals of Data Warehouses. Springer, Heidelberg (1999)

    Google Scholar 

  34. Wang, R.Y., Kon, H.B., Madnick, S.E.: Data Quality Requirements: Analysis and Modeling. In: Proceedings of the 9th International Conference on Data Engineering (ICDE 1993), Vienna, Austria (1993)

    Google Scholar 

  35. Wang, R.Y., Ziad, M., Lee, Y.W.: Data Quality. Kluwer Academic Publishers, Dordrecht (2001)

    MATH  Google Scholar 

  36. Mihaila, G., Raschid, L., Vidal, M.: Querying Quality of Data Metadata. In: Proceedings of the 6th International Conference on Extending Database Technology (EDBT 1998), Valencia, Spain (1998)

    Google Scholar 

  37. Sattler, K., Conrad, S., Saake, G.: Interactive Example-driven Integration and Reconciliation for Accessing Database Integration. Information systems 28 (2003)

    Google Scholar 

  38. Fan, K., Lu, H., Madnick, S.E., Cheung, D.: Discovering and Reconciling Value Conflicts for Numerical Data Integration. Information systems 28 (2003)

    Google Scholar 

  39. The COntext INterchange (COIN) Project (1996-1999), http://context.mit.edu/~coin/

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Mecella, M., Scannapieco, M., Virgillito, A., Baldoni, R., Catarci, T., Batini, C. (2003). The DaQuinCIS Broker: Querying Data and Their Quality in Cooperative Information Systems. In: Spaccapietra, S., March, S., Aberer, K. (eds) Journal on Data Semantics I. Lecture Notes in Computer Science, vol 2800. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39733-5_9

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-39733-5_9

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-20407-7

  • Online ISBN: 978-3-540-39733-5

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics