Skip to main content

Reference Architectures to Measure Data Completeness across Integrated Databases

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7802))

Abstract

Completeness is an important aspect of data quality and to determine data acceptability one needs to measure the completeness of the data set of concerned. One type of data completeness measure is population-based completeness (PBC). Nevertheless, the notion of PBC will be of little use until we can determine the efforts required (in terms of architectural design) to implement PBC. In this paper, we present the types of PBC system reference architecture involving integrated databases and motivate the selection of each.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Emran, N.A., Embury, S., Missier, P., Muda, A.K.: Measuring Data Completeness for Microbial Genomics Database. In: Selamat, A., et al. (eds.) ACIIDS 2013, Part I. LNCS (LNAI), vol. 7802, pp. 186–195. Springer, Heidelberg (2013)

    Google Scholar 

  2. Maddux, R.: The origin of relation algebras in the development and axiomatization of the calculus of relations. Studia Logica 50, 421–455 (1991)

    Article  MathSciNet  MATH  Google Scholar 

  3. Tiffin, N., Andrade-Navarro, M.A., Perez-Iratxeta, C.: Linking genes to diseases: it’s all in the data. Genome Medicine 1, 1–7 (2009)

    Article  Google Scholar 

  4. Lenzerini, M.: Data integration: a theoretical perspective. In: Proceedings of the Twenty-first ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems (PODS), pp. 233–246. ACM (2002)

    Google Scholar 

  5. Wiederhold, G.: Mediators in the architecture of future information systems. Computer 25, 38–49 (1992)

    Article  Google Scholar 

  6. Inmon, W.: In: Elliot, R. (ed.) Building the Datawarehouse, 3rd edn., pp. 1–427. Wiley Computer Publishing (2002)

    Google Scholar 

  7. Balakrishnan, R., Park, J., Karra, K., Hitz, B., Binkley, G., Hong, E., Sullivan, J., Micklem, G., Cherry, J.: YeastMinean integrated data warehouse for Saccharomyces cerevisiae data as a multipurpose tool-kit. Database: The Journal of Biological Database and Curation 2012, 1–8 (2012)

    Google Scholar 

  8. Chaudhuri, S., Dayal, U., Ganti, V.: Database technology for decision support systems. Computer 34, 48–55 (2001)

    Article  Google Scholar 

  9. Hull, R., Zhou, G.: A framework for supporting data integration using the materialized and virtual approaches. SIGMOD Records 25, 481–492 (1996)

    Article  Google Scholar 

  10. Hull, R.: Managing semantic heterogeneity in databases: a theoretical prospective. In: Proceedings of the Sixteenth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems (PODS), pp. 51–61. ACM (1997)

    Google Scholar 

  11. Widom, J.: Research problems in data warehousing. In: Proceedings of the Fourth International Conference on Information and Knowledge Management, pp. 25–30. ACM (1995)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Emran, N.A., Embury, S., Missier, P., Ahmad, N. (2013). Reference Architectures to Measure Data Completeness across Integrated Databases. In: Selamat, A., Nguyen, N.T., Haron, H. (eds) Intelligent Information and Database Systems. ACIIDS 2013. Lecture Notes in Computer Science(), vol 7802. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-36546-1_23

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-36546-1_23

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-36545-4

  • Online ISBN: 978-3-642-36546-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics