Skip to main content

Modelling Entity Integrity for Semi-structured Big Data

  • Conference paper
  • First Online:
Database Systems for Advanced Applications (DASFAA 2021)

Abstract

We propose a data model for investigating constraints that enforce the entity integrity of semi-structured big data. Particular support is given for the volume, variety, and veracity dimensions of big data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Amalina, F., et al.: Blending big data analytics: review on challenges and a recent study. IEEE Access 8, 3629–3645 (2020)

    Article  Google Scholar 

  2. Armstrong, W.W.: Dependency structures of data base relationships. In: Proceedings of IFIP World Computer Congress, pp. 580–583 (1974)

    Google Scholar 

  3. Brown, P., Link, S.: Probabilistic keys. IEEE Trans. Knowl. Data Eng. 29(3), 670–682 (2017)

    Article  Google Scholar 

  4. Christophides, V., Efthymiou, V., Stefanidis, K.: Entity Resolution in the Web of Data, Synthesis Lectures on the Semantic Web. Morgan & Claypool Publishers (2015)

    Google Scholar 

  5. Codd, E.F.: A relational model of data for large shared data banks. Commun. ACM 13(6), 377–387 (1970)

    Article  Google Scholar 

  6. Date, C.J.: A critique of the SQL database language. SIGMOD Rec. 14(3), 8–54 (1984)

    Article  Google Scholar 

  7. Diederich, J., Milton, J.: New methods and fast algorithms for database normalization. ACM Trans. Database Syst. 13(3), 339–365 (1988)

    Article  MathSciNet  Google Scholar 

  8. Dubois, D., Prade, H., Schockaert, S.: Generalized possibilistic logic: foundations and applications to qualitative reasoning about uncertainty. Artif. Intell. 252, 139–174 (2017)

    Article  MathSciNet  Google Scholar 

  9. Ganti, V., Sarma, A.D.: Data Cleaning: A Practical Perspective, Synthesis Lectures on Data Management. Morgan & Claypool Publishers (2013)

    Google Scholar 

  10. Hartmann, S., Link, S.: The implication problem of data dependencies over SQL table definitions: axiomatic, algorithmic and logical characterizations. ACM Trans. Database Syst. 37(2), 13:1–13:40 (2012)

    Google Scholar 

  11. Jensen, C.S., Snodgrass, R.T., Soo, M.D.: Extending existing dependency theory to temporal databases. IEEE Trans. Knowl. Data Eng. 8(4), 563–582 (1996)

    Article  Google Scholar 

  12. Köhler, H., Link, S.: Armstrong axioms and Boyce-Codd-Heath normal form under bag semantics. Inf. Process. Lett. 110(16), 717–724 (2010)

    Article  MathSciNet  Google Scholar 

  13. Lien, Y.E.: On the equivalence of database models. J. ACM 29(2), 333–362 (1982)

    Article  Google Scholar 

  14. Link, S., Prade, H.: Possibilistic functional dependencies and their relationship to possibility theory. IEEE Trans. Fuzzy Syst. 24(3), 757–763 (2016)

    Article  Google Scholar 

  15. Link, S., Prade, H.: Relational database schema design for uncertain data. Inf. Syst. 84, 88–110 (2019)

    Article  Google Scholar 

  16. Liu, Z.H., Hammerschmidt, B.C., McMahon, D.: JSON data management: supporting schema-less development in RDBMS. In: International Conference on Management of Data, SIGMOD 2014, Snowbird, UT, USA, 22–27 June 2014, pp. 1247–1258 (2014)

    Google Scholar 

  17. Suciu, D., Olteanu, D., Ré, C., Koch, C.: Probabilistic Databases, Synthesis Lectures on Data Management. Morgan & Claypool Publishers (2011)

    Google Scholar 

  18. Thalheim, B.: Dependencies in relational databases. Teubner (1991)

    Google Scholar 

  19. Thalheim, B.: On semantic issues connected with keys in relational databases permitting null values. Elektronische Informationsverarbeitung und Kybernetik 25(1/2), 11–20 (1989)

    MathSciNet  Google Scholar 

  20. Wei, Z., Link, S.: Embedded functional dependencies and data-completeness tailored database design. PVLDB 12(11), 1458–1470 (2019)

    Google Scholar 

  21. Zaniolo, C.: Database relations with null values. J. Comput. Syst. Sci. 28(1), 142–166 (1984)

    Article  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sebastian Link .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Litvinenko, I., Wei, Z., Link, S. (2021). Modelling Entity Integrity for Semi-structured Big Data. In: Jensen, C.S., et al. Database Systems for Advanced Applications. DASFAA 2021. Lecture Notes in Computer Science(), vol 12681. Springer, Cham. https://doi.org/10.1007/978-3-030-73194-6_9

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-73194-6_9

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-73193-9

  • Online ISBN: 978-3-030-73194-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics