Abstract
The lack of quality of stored data is reflected by violations of integrity constraints. Answers to queries in databases containing bad quality information usually cannot be trusted. Nevertheless, many answers given by such databases may still be useful, as long as they are derived from data the quality of which is sufficiently high. We formalize our intuition of answers that have quality on the basis of ‘causes’. A cause of an answer is a minimal excerpt of the database that explains why the answer has been given. Thus, an answer has quality if the overlap of its causes with the causes of integrity violation is empty. Even if that overlap is not empty, but is sufficiently low, an answer may have sufficient quality. The amount of causes in the overlaps of causes of answers and integrity violations can be sized by quality metrics.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Abiteboul, S., Hull, R., Vianu, V.: Foundations of Databases. Addison-Wesley (1995)
Apt, K., Bezem, M.: Acyclic programs. New Generation Computing 9(3,4), 335–364 (1991)
Arenas, M., Bertossi, L.E., Chomicki, J.: Consistent query answers in inconsistent databases. In: Proceedings of PODS, pp. 68–79. ACM Press (1999)
Batini, C., Cappiello, C., Francalanci, C., Maurino, A.: Methodologies for Data Quality Assessment and Improvement. Comput. Surveys 41(3),16:1–16:52 (2009)
Berlin, J., Motro, A.: Tuplerank: Ranking discovered content in virtual databases. In: Etzion, O., Kuflik, T., Motro, A. (eds.) NGITS 2006. LNCS, vol. 4032, pp. 13–25. Springer, Heidelberg (2006)
Borek, A., Woodall, P., Oberhofer, M., Parlikad, A.K.: A Classification of Data Quality Asessment Methods. In: Proc. 16th ICIQ, pp. 189–203
Campbell, R., Zhang, L., Francis, L., Palenik, R., Popelyukhin, A., Scruton, G., Prevosto, V.: Survey of Data Management and Data Quality Texts. In: Proc. CAS Winter Forum, pp. 273–306. Casualty Actuarial Society (2007)
Cavedon, L., Lloyd, J.: A completeness theorem for SLDNF resolution. J. Log. Prog. 7(3), 177–191 (1989)
Chockler, H., Halpern, J.: Responsibility and blame: a structural-model approach. J. Artif. Intell. Res. 22, 93–115 (2004)
Clark, K.: Negation as failure. In: Gallaire, H., Minker, J. (eds.) Logic and Data Bases, pp. 293–322. Plenum Press (1978)
Cong, G., Fan, W., Geerts, F., Jia, X., Ma, S.: Improving Data Quality: Consistency and Accuracy. In: Proc. 33rd VLDB, pp. 315–326. ACM (2007)
Date, C.: The Relational Database Dictionary, Extended edn. Springer (2008)
Date, C.: View Updating and Relational Theory. O’Reilly (2012)
Decker, H.: The range form of databases and queries or: How to avoid floundering. In: Proc. 5th ÖGAI. Informatik-Fachberichte, vol. 208, pp. 114–123. Springer (1989)
Decker, H.: On explanations in deductive databases. In: Proc. 3rd Workshop on Foundations of Models and Languages for Data and Objects, Informatik-Bericht 91/3, pp. 173–185. Inst. f. Informatik, Tech. Univ. Clausthal (1991)
Decker, H.: Basic causes for the inconsistency tolerance of query answering and integrity checking. In: Proc. 21st DEXA Workshops, pp. 318–322. IEEE CSP (2010)
Decker, H.: Toward a uniform cause-based approach to inconsistency-tolerant database semantics. In: Meersman, R., Dillon, T., Herrero, P. (eds.) OTM 2010, Part II. LNCS, vol. 6427, pp. 983–998. Springer, Heidelberg (2010)
Decker, H.: Answers that have integrity. In: Schewe, K.-D., Thalheim, B. (eds.) SDKB 2010. LNCS, vol. 6834, pp. 54–72. Springer, Heidelberg (2011)
Decker, H.: Causes for inconsistency-tolerant schema update management. In: Proc. 27th ICDE Workshops, pp. 157–161. IEEE CSP (2011)
Decker, H.: Causes of the violation of integrity constraints for supporting the quality of databases. In: Murgante, B., Gervasi, O., Iglesias, A., Taniar, D., Apduhan, B.O. (eds.) ICCSA 2011, Part V. LNCS, vol. 6786, pp. 283–292. Springer, Heidelberg (2011)
Decker, H.: Data quality maintenance by integrity-preserving repairs that tolerate inconsistency. In: Proc. 11th QSIC, pp. 192–197. IEEE CSP (2011)
Decker, H.: Axiomatizing inconsistency metrics for integrity maintenance. In: Proc. 16th KES, pp. 1243–1252. IOS Press (2012)
Decker, H.: New measures for maintaining the quality of databases. In: Murgante, B., Gervasi, O., Misra, S., Nedjah, N., Rocha, A.M.A.C., Taniar, D., Apduhan, B.O. (eds.) ICCSA 2012, Part IV. LNCS, vol. 7336, pp. 170–185. Springer, Heidelberg (2012)
Decker, H.: Maintaining desirable properties of information by inconsistency-tolerant integrity management. In: Mayr, H.C., Kop, C., Liddle, S., Ginige, A. (eds.) UNISON 2012. LNBIP, vol. 137, pp. 13–24. Springer, Heidelberg (2013)
Decker, H.: Measure-based inconsistency-tolerant maintenance of database integrity. In: Schewe, K.-D., Thalheim, B. (eds.) SDKB 2013. LNCS, vol. 7693, pp. 149–173. Springer, Heidelberg (2013)
Decker, H.: Modeling, measuring and maintaining the quality of databases (to appear, 2013)
Decker, H., Martinenghi, D.: Modeling, measuring and monitoring the quality of information. In: Heuser, C.A., Pernul, G. (eds.) ER 2009. LNCS, vol. 5833, pp. 212–221. Springer, Heidelberg (2009)
Ehling, M., Körner, T.: Handbook on Data Quality Assessment Methods and Tools. European Commission, Eurostat (2007)
Fan, W.: Dependencies revisited for improving data quality. In: Proc. 27th PODS, pp. 159–170. ACM (2008)
Gertz, M.: Managing Data Quality and Integrity in Federated Databases. In: Integrity and Internal Control in Information Systems. IFIP Conference Proceedings, vol. 136, pp. 211–230. Kluwer (1998)
Godfrey, P., Grant, J., Gryz, J., Minker, J.: Integrity Constraints: Semantics and Applications. In: Logics for Databases and Information Systems. Engineering and Computer Science, vol. 436, pp. 265–306. Springer (1998)
Halpern, J.: Causality, responsibility, and blame: A structural-model approach. In: Proc. 3rd QEST, pp. 3–8. IEEE CSP (2006)
Halpern, J., Pearl, J.: Causes and explanations: a structural-model approach, part i: Causes. Brit. J. Phil. Sci. 56, 843–887 (2005)
Hinrichs, T., Kao, J., Genesereth, M.: Inconsistency-tolerant reasoning with classical logic and large databases. In: Proc. 8th SARA, pp. 105–112. AAAI Publications (2009)
Typology of Database Quality Factors. Software Quality Journal 7(3/4), 179–193 (1998)
Kowalski, R., Kuehner, D.: Linear Resolution with Selection Function. Artificial Intelligence 2(3-4), 227–260 (1971)
Kowalski, R.A., Sadri, F.: Teleo-Reactive Abductive Logic Programs. In: Artikis, A., Craven, R., Kesim Çiçekli, N., Sadighi, B., Stathis, K. (eds.) Sergot Festschrift 2012. LNCS, vol. 7360, pp. 12–32. Springer, Heidelberg (2012)
Lifschitz, V.: What is answer set computing? In: Proc. 23rd AAAI, pp. 1594–1597 (2008)
LLoyd, J.: Foundations of Logic Programming, 2nd edn. Springer (1987)
Meliou, A., Gatterbauer, W., Moore, K., Suciu, D.: The complexity of causality and responsibility for query answers and non-answers. In: Proc. 37th VLDB, pp. 34–45 (2011)
Pipino, L., Lee, Y., Wang, R.: Data quality assessment. CACM 45(4), 211–218 (2002)
Sidi, F., Panah, P., Affendey, L., Jabar, M., Ibrahim, H., Mustapha, A.: Data quality: A survey of data quality dimensions. In: Proc. CAMP 2012, pp. 300–304. IEEE CSP (2012)
Tejay, G., Dhillon, G., Chin, A.G.: Data Quality Dimensions for Information Systems Security: A Theoretical Exposition. In: Dowland, P., Furnell, S., Thuraisingham, B., Sean Wang, X. (eds.) Security Management, Integrity, and Internal Control in Information Systems. IFIP, vol. 193, pp. 21–39. Springer, Boston (2006)
Wang, R., Kon, H., Madnick, S.: Data quality requirements analysis and modeling. In: Proc. 9th ICDE, pp. 670–677. IEEE CSP (1993)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Decker, H. (2013). Answers That Have Quality. In: Murgante, B., et al. Computational Science and Its Applications – ICCSA 2013. ICCSA 2013. Lecture Notes in Computer Science, vol 7972. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-39643-4_39
Download citation
DOI: https://doi.org/10.1007/978-3-642-39643-4_39
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-39642-7
Online ISBN: 978-3-642-39643-4
eBook Packages: Computer ScienceComputer Science (R0)