Skip to main content

A Fourth Normal Form for Uncertain Data

  • Conference paper
  • First Online:
Advanced Information Systems Engineering (CAiSE 2019)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11483))

Included in the following conference series:

Abstract

Relational database design addresses applications for data that is certain. Modern applications require the handling of uncertain data. Indeed, one dimension of big data is veracity. Ideally, the design of databases helps users quantify their trust in the data. For that purpose, we need to establish a design framework that handles responsibly any knowledge of an organization about the uncertainty in their data. Naturally, such knowledge helps us find database designs that process data more efficiently. In this paper, we apply possibility theory to introduce the class of possibilistic multivalued dependencies that are a significant source of data redundancy. Redundant data may occur with different degrees, derived from the different degrees of uncertainty in the data. We propose a family of fourth normal forms for uncertain data. We justify our proposal showing that its members characterize schemata that are free from any redundant data occurrences in any of their instances at the targeted level of uncertainty in the data. We show how to automatically transform any schema into one that satisfies our proposal, without loss of any information. Our results are founded on axiomatic and algorithmic solutions to the implication problem of possibilistic functional and multivalued dependencies which we also establish.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://www-01.ibm.com/software/data/bigdata/.

References

  1. Chaudhry, N.A., Moyne, J.R., Rundensteiner, E.A.: An extended database design methodology for uncertain data management. Inf. Sci. 121(1–2), 83–112 (1999)

    Article  Google Scholar 

  2. Dubois, D., Prade, H.: Possibility theory. In: Meyers, R.A. (ed.) Computational Complexity: Theory, Techniques, and Applications, pp. 2240–2252. Springer, New York (2012). https://doi.org/10.1007/978-1-4614-1800-9_139

    Chapter  Google Scholar 

  3. Fagin, R.: Multivalued dependencies and a new normal form for relational databases. ACM Trans. Database Syst. 2(3), 262–278 (1977)

    Article  Google Scholar 

  4. Galil, Z.: An almost linear-time algorithm for computing a dependency basis in a relational database. J. ACM 29(1), 96–102 (1982)

    Article  MathSciNet  Google Scholar 

  5. Hartmann, S., Link, S.: On a problem of Fagin concerning multivalued dependencies in relational databases. Theor. Comput. Sci. 353(1–3), 53–62 (2006)

    Article  MathSciNet  Google Scholar 

  6. Köhler, H., Link, S.: Inclusion dependencies and their interaction with functional dependencies in SQL. J. Comput. Syst. Sci. 85, 104–131 (2017)

    Article  MathSciNet  Google Scholar 

  7. Levene, M., Vincent, M.W.: Justification for inclusion dependency normal form. IEEE Trans. Knowl. Data Eng. 12(2), 281–291 (2000)

    Article  Google Scholar 

  8. Link, S.: Charting the completeness frontier of inference systems for multivalued dependencies. Acta Inf. 45(7–8), 565–591 (2008)

    Article  MathSciNet  Google Scholar 

  9. Link, S.: On the implication of multivalued dependencies in partial database relations. Int. J. Found. Comput. Sci. 19(3), 691–715 (2008)

    Article  MathSciNet  Google Scholar 

  10. Link, S.: Characterisations of multivalued dependency implication over undetermined universes. J. Comput. Syst. Sci. 78(4), 1026–1044 (2012)

    Article  MathSciNet  Google Scholar 

  11. Link, S., Prade, H.: Possibilistic functional dependencies and their relationship to possibility theory. IEEE Trans. Fuzzy Syst. 24(3), 757–763 (2016)

    Article  Google Scholar 

  12. Link, S., Prade, H.: Relational database schema design for uncertain data. In: Mukhopadhyay, S., et al. (eds.) Proceedings of the 25th ACM International Conference on Information and Knowledge Management, CIKM 2016, Indianapolis, IN, USA, 24–28 October 2016, pp. 1211–1220. ACM (2016)

    Google Scholar 

  13. Raju, K., Majumdar, A.K.: Fuzzy functional dependencies and lossless join decomposition of fuzzy relational database systems. ACM Trans. Database Syst. 13(2), 129–166 (1988)

    Article  Google Scholar 

  14. Sarma, A.D., Ullman, J.D., Widom, J.: Schema design for uncertain databases. In: Arenas, M., Bertossi, L.E. (eds.) Proceedings of the 3rd Alberto Mendelzon International Workshop on Foundations of Data Management, Arequipa, Peru, 12–15 May 2009, CEUR Workshop Proceedings, vol. 450 (2009)

    Google Scholar 

  15. Vincent, M.W.: Semantic foundations of 4NF in relational database design. Acta Inf. 36(3), 173–213 (1999)

    Article  MathSciNet  Google Scholar 

  16. Wei, Z., Link, S.: A fourth normal form for possibilistic data. Technical report 533, The University of Auckland, CDMTCS (2019)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sebastian Link .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Wei, Z., Link, S. (2019). A Fourth Normal Form for Uncertain Data. In: Giorgini, P., Weber, B. (eds) Advanced Information Systems Engineering. CAiSE 2019. Lecture Notes in Computer Science(), vol 11483. Springer, Cham. https://doi.org/10.1007/978-3-030-21290-2_19

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-21290-2_19

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-21289-6

  • Online ISBN: 978-3-030-21290-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics