Skip to main content

Functional Dependencies in Incomplete Databases with Limited Domains

  • Conference paper
  • First Online:
Foundations of Information and Knowledge Systems (FoIKS 2020)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 12012))

Abstract

Missing data value is an extensive problem in both research and industrial developers. Two general approaches are there to deal with the problem of missing values in databases, they either could be ignored (removed) or imputed (filled in) with new values [9]. In the present paper, we use the second method. Possible worlds were introduced in [14, 16] and possible and certain keys, as well as weak and strong functional dependencies were studied. We introduced the intermediate concept of strongly possible worlds that are obtained by imputing values already existing in the table in a preceding paper. This results in strongly possible keys and strongly possible functional dependencies. We give a polynomial algorithm to verify a single spKey, and show that in general, it is NP-complete to verify an arbitrary collection of spKeys. We give a graph theoretical characterization of the validity of a given spFD \(X\rightarrow _{sp}Y\). We analyze which weak/strong functional dependency axioms remain sound for strongly possible functional dependencies and give appropriate modifications of the not sound ones.

Research of the second author was partially supported by the National Research, Development and Innovation Office (NKFIH) grant K–116769. This work is also connected to the scientific program of the “Development of quality-oriented and harmonized R+D+I strategy and functional model at BME” project, supported by the New Hungary Development Plan (Project ID: TMOP-4.2.1/B-09/1/KMR-2010-0002).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Let G(VE) be a graph and \(L:V\rightarrow 2^{\mathbb {N}}\) be a mapping that assigns each vertex a L(v). A is a mapping \(c:V\rightarrow \bigcup _{v\in V}L(v)\) such that \(c(v)\in L(v)\) and \(c(u)\ne c(v)\) if \(\{u,v\}\in E\).

References

  1. Acuña, E., Rodriguez, C.: The treatment of missing values and its effect on classifier accuracy. In: Banks, D., McMorris, F.R., Arabie, P., Gaul, W. (eds.) Classification, Clustering, and Data Mining Applications. STUDIES CLASS. Springer, Berlin, Heidelberg (2004). https://doi.org/10.1007/978-3-642-17103-1_60

    Chapter  Google Scholar 

  2. Alattar, M., Sali, A.: Keys in relational databases with nulls and bounded domains. In: Welzer, T., Eder, J., Podgorelec, V., Kamišalić Latifić, A. (eds.) ADBIS 2019. LNCS, vol. 11695, pp. 33–50. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-28730-6_3

    Chapter  Google Scholar 

  3. Atzeni, P., Morfuni, N.M.: Functional dependencies and constraints on null values in database relations. Inf. Process. Lett. 18(4), 233–238 (1984)

    Article  Google Scholar 

  4. Sree Dhevi, A.T.: Imputing missing values using Inverse Distance Weighted Interpolation for time series data. In: Sixth International Conference on Advanced Computing (ICoAC), Chennai, pp. 255–259 (2014). https://doi.org/10.1109/ICoAC.2014.7229721

  5. Chang, G., Ge, T.: Comparison of missing data imputation methods for traffic flow. In: Proceedings 2011 International Conference on Transportation, Mechanical, and Electrical Engineering (TMEE), Changchun, pp. 639–642 (2011). https://doi.org/10.1109/TMEE.2011.6199284

  6. Cheng, C., Wei L., Lin, T.: Improving relational database quality based on adaptive learning method for estimating null value. In: Second International Conference on Innovative Computing, Information and Control (ICICIC 2007), Kumamoto, p. 81 (2007). https://doi.org/10.1109/ICICIC.2007.350

  7. Date, C.J.: NOT Is Not “Not”! (Notes on Three-Valued Logic and Related Matters) in Relational Database Writings 1985–1989. Addison-Wesley, Reading (1990)

    Google Scholar 

  8. Farhangfar, A., Kurgan, L.A., Pedrycz, W.: Experimental analysis of methods for imputation of missing values in databases. In: Proceedings of the SPIE, Intelligent Computing: Theory and Applications II, 12 April 2004, vol. 5421 (2004). https://doi.org/10.1117/12.542509

  9. Farhangfar, A., Kurgan, L.A., Pedrycz, W.: A novel framework for imputation of missing values in databases. IEEE Trans. Syst. Man Cybern. Part A Syst. Hum. 37(5), 692–709 (2007)

    Article  Google Scholar 

  10. Farhangfar, A., Kurgan, L., Dy, J.: Impact of imputation of missing values on classification error for discrete data. Pattern Recogn. 41(12), 3692–3705 (2008)

    Article  Google Scholar 

  11. Grahne, G.: Dependency satisfaction in databases with incomplete information. In: Proceedings of the Tenth International Conference of Very Large Data Base, Singapore, August 1984. vldb.org

  12. Grzymala-Busse, J.W., Hu, M.: A comparison of several approaches to missing attribute values in data mining. In: Ziarko, W., Yao, Y. (eds.) RSCTC 2000. LNCS (LNAI), vol. 2005, pp. 378–385. Springer, Heidelberg (2001). https://doi.org/10.1007/3-540-45554-X_46

    Chapter  MATH  Google Scholar 

  13. Imielinski, T., Lipski, W.: Incomplete information in relational databases. JACM 31(4), 761–791 (1984)

    Article  MathSciNet  Google Scholar 

  14. Köhler, H., Leck, U., Link, S., Zhou, X.: Possible and certain keys for SQL. VLDB J. 25(4), 571–596 (2016)

    Article  Google Scholar 

  15. Köhler, H., Link, S.: SQL schema design: foundations, normal forms, and normalization. Inf. Syst. 76, 88–113 (2018)

    Article  Google Scholar 

  16. Levene, M., Loizou, G.: Axiomatisation of functional dependencies in incomplete relations. J. Theor. Comput. Sci. 206(1–2), 283–300 (1998)

    Article  MathSciNet  Google Scholar 

  17. Lien, Y.E.: On the equivalence of database models. J. ACM 29(2), 333–362 (1982)

    Article  Google Scholar 

  18. Lovász, L., Plummer, M.D.: Matching Theory, vol. 367. American Mathematical Society, Providence (2009)

    MATH  Google Scholar 

  19. Vassilou, Y.: Functional dependencies and incomplete information. In: Proceedings of International Conference on Very Large Databases, VLDB 1980, pp. 260–269 (1980)

    Google Scholar 

  20. Wei, Z., Link, S.: Embedded functional dependencies and data-completeness tailored database design. PVLDB 12(11), 1458–1470 (2019)

    Google Scholar 

  21. Zhang, S., Qin, Z., Ling, C.X., Sheng, S.: “Missing is Useful”: missing values in cost-sensitive decision trees. IEEE Trans. Knowl. Data Eng. 17(12), 1689–1693 (2005)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Attila Sali .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Alattar, M., Sali, A. (2020). Functional Dependencies in Incomplete Databases with Limited Domains. In: Herzig, A., Kontinen, J. (eds) Foundations of Information and Knowledge Systems. FoIKS 2020. Lecture Notes in Computer Science(), vol 12012. Springer, Cham. https://doi.org/10.1007/978-3-030-39951-1_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-39951-1_1

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-39950-4

  • Online ISBN: 978-3-030-39951-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics