skip to main content
research-article

Data Quality Assessment through a Preference Model

Published:06 March 2024Publication History
Skip Abstract Section

Abstract

Evaluating the quality of data is a problem of a multi-dimensional nature and quite frequently depends on the perspective of an expected use or final purpose of the data. Numerous works have explored the well-known specification of data quality dimensions in various application domains, without addressing the inter-dependencies and aggregation of quality attributes for decision support. In this work we therefore propose a context-dependent formal process to evaluate the quality of data which integrates a preference model from the field of Multi-Criteria Decision Aiding. The parameters of this preference model are determined through interviews with work-domain experts. We show the interest of the proposal on a case study related to the evaluation of the quality of hydrographical survey data.

REFERENCES

  1. [1] Aljumaili M., Karim R., and Tretten P.. 2016. Metadata-based data quality assessment. VINE Journal of Information and Knowledge Management Systems (2016).Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. [2] Aljumaili M., Karim R., and Tretten P.. 2018. Data quality assessment using multi-attribute maintenance perspective. International Journal of Information and Decision Sciences 10, 2 (2018), 147161.Google ScholarGoogle ScholarCross RefCross Ref
  3. [3] Batini C., Cappiello C., Francalanci C., and Maurino A.. 2009. Methodologies for data quality assessment and improvement. ACM Computing Surveys (CSUR) 41, 3 (2009), 152.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. [4] Bouyssou D. and Marchant T.. 2007. An axiomatic approach to noncompensatory sorting methods in MCDM, I: The case of two categories. European Journal of Operational Research 178, 1 (April 2007), 217245.Google ScholarGoogle ScholarCross RefCross Ref
  5. [5] Bouyssou D. and Marchant T.. 2007. An axiomatic approach to noncompensatory sorting methods in MCDM, II: More than two categories. European Journal of Operational Research 178, 1 (April 2007).Google ScholarGoogle Scholar
  6. [6] Bouyssou D., Marchant T., Pirlot M., Tsoukiàs A., and Vincke P.. 2006. Evaluation and Decision Models with Multiple Criteria: Stepping Stones for the Analyst (1st ed.). Springer, Boston.Google ScholarGoogle Scholar
  7. [7] Barbosa A. de Souza,Oliveira R. A. Shayani, and M. A. G. de. 2018. A multi-criteria decision analysis method for regulatory evaluation of electricity distribution service quality. Utilities Policy 53 (2018), 3848.Google ScholarGoogle ScholarCross RefCross Ref
  8. [8] Gualo Fernando, Rodríguez Moisés, Verdugo Javier, Caballero Ismael, and Piattini Mario. 2021. Data quality certification using ISO/IEC 25012: Industrial experiences. Journal of Systems and Software 176 (2021), 110938.Google ScholarGoogle ScholarCross RefCross Ref
  9. [9] IHO. 2000. IHO Transfer Standard for Digital Hydrographic Data. Technical Report. IHO. Special Publication N°57, 3 edition.Google ScholarGoogle Scholar
  10. [10] IHO. 2020. IHO Standards for Hydrographic Surveys. Technical Report. IHO. Special Publication N°44, 6 edition.Google ScholarGoogle Scholar
  11. [11] Keeney R. L. and Raiffa H.. 1976. Decisions with Multiple Objectives: Preferences and Value Tradeoffs. J. Wiley, New York.Google ScholarGoogle Scholar
  12. [12] Klein A. and Lehner W.. 2009. Representing data quality in sensor data streaming environments. Journal of Data and Information Quality (JDIQ) 1, 2 (2009), 128.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. [13] C. Lacagnina,G. Peng, I. Ivanova, R. Downs, H. Ramapriyan, D. Moroni, Y. Wei, L. Wyborn, D. Jones, and A. Ganske. 2021. International community guidelines for sharing and reusing quality information of individual earth science datasets. Earth and Space Science Open Archive (2021). Google ScholarGoogle ScholarCross RefCross Ref
  14. [14] Deunf J. Le, Khannoussi A., Lecornu L., Meyer P., and Puentes J.. 2021. Automatic data quality assessment of hydrographic surveys taking into account experts’ preferences. In OCEANS 2021: San Diego – Porto. 110. Google ScholarGoogle ScholarCross RefCross Ref
  15. [15] Leroy A., Mousseau V., and Pirlot M.. 2011. Learning the parameters of a multiple criteria sorting method. In Algorithmic Decision Theory, Brafman R., Roberts F., and Tsoukiàs A. (Eds.), Vol. 6992. Springer, 219233.Google ScholarGoogle Scholar
  16. [16] McGilvray D.. 2021. Executing Data Quality Projects: Ten Steps to Quality Data and Trusted Information (TM). Academic Press.Google ScholarGoogle Scholar
  17. [17] Mousseau V., Figueira J., and Naux J. P.. 2001. Using assignment examples to infer weights for ELECTRE TRI method: Some experimental results. European Journal of Operational Research 130, 2 (April 2001), 263275.Google ScholarGoogle ScholarCross RefCross Ref
  18. [18] Mousseau V. and Słowiński R.. 1998. Inferring an ELECTRE TRI model from assignment examples. Journal of Global Optimization 12, 2 (1998), 157174.Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. [19] The A. Ngo and Mousseau V.. 2002. Using assignment examples to infer category limits for the ELECTRE TRI method. JMCDA 11, 1 (Nov. 2002), 2943.Google ScholarGoogle Scholar
  20. [20] Olteanu A.-L. and Meyer P.. 2014. Inferring the parameters of a majority rule sorting model with vetoes on large datasets. In DA2PL 2014: From Multiple Criteria Decision Aid to Preference Learning. Ecole Centrale Paris and Université de Mons, 8794.Google ScholarGoogle Scholar
  21. [21] Puentes J., Lecornu L., and Solaiman B.. 2019. Data and information quality in remote sensing. In Information Quality in Information Fusion and Decision Making. Springer, 401421.Google ScholarGoogle ScholarCross RefCross Ref
  22. [22] Ramapriyan H. K., Peng G., Shie D. Moroni, and C. L.. 2017. Ensuring and improving information quality for earth science data and products. D.-Lib Magazine 23 (2017).Google ScholarGoogle ScholarCross RefCross Ref
  23. [23] Ranasinghe C. and Kray C.. 2018. Location information quality: A review. Sensors 18, 11 (2018), 3999.Google ScholarGoogle ScholarCross RefCross Ref
  24. [24] Roy B.. 1996. Multicriteria Methodology for Decision Aiding. Kluwer Academic, Dordrecht.Google ScholarGoogle ScholarCross RefCross Ref
  25. [25] Sha K. and Zeadally S.. 2015. Data quality challenges in cyber-physical systems. Journal of Data and Information Quality (JDIQ) 6, 2-3 (2015), 14.Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. [26] Shom. 2022. Lot bathymétrique S201500100-1. https://services.data.shom.fr/geonetwork/srv/api/records/LOTS_BATHY_S201500100-1. Accessed: 2022-09-27.Google ScholarGoogle ScholarCross RefCross Ref
  27. [27] Sobrie O., Mousseau V., and Pirlot M.. 2013. Learning a majority rule model from large sets of assignment examples. In Algorithmic Decision Theory. Springer Berlin, 336350.Google ScholarGoogle Scholar
  28. [28] Sobrie O., Mousseau V., and Pirlot M.. 2017. A population-based algorithm for learning a majority rule sorting model with coalitional veto. In Evolutionary Multi-Criterion Optimization, Trautmann H., Rudolph G., Klamroth K., Schütze O., Wiecek M., Jin Y., and Grimme C. (Eds.). Springer International Publishing, Cham, 575589.Google ScholarGoogle Scholar
  29. [29] Todoran I., Lecornu L., Khenchaf A., and Caillec J. Le. 2015. A methodology to evaluate important dimensions of information quality in systems. Journal of Data and Information Quality (JDIQ) 6, 2-3 (2015), 123.Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. [30] Kopczyńska A. Trendowicz and S.. 2014. Adapting multi-criteria decision analysis for assessing the quality of software products. Current approaches and future perspectives. Advances in Computers 93 (2014), 153226.Google ScholarGoogle ScholarCross RefCross Ref
  31. [31] Visengeriyeva L. and Abedjan Z.. 2020. Anatomy of metadata for data curation. Journal of Data and Information Quality (JDIQ) 12, 3 (2020), 130.Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. [32] Yeganeh N. K., Sadiq S., and Sharaf M. A.. 2014. A framework for data quality aware query systems. Information Systems 46 (2014), 2444.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Data Quality Assessment through a Preference Model

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in

        Full Access

        • Published in

          cover image Journal of Data and Information Quality
          Journal of Data and Information Quality  Volume 16, Issue 1
          March 2024
          187 pages
          ISSN:1936-1955
          EISSN:1936-1963
          DOI:10.1145/3613486
          Issue’s Table of Contents

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 6 March 2024
          • Online AM: 29 November 2023
          • Accepted: 6 November 2023
          • Revised: 14 September 2023
          • Received: 17 October 2022
          Published in jdiq Volume 16, Issue 1

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article
        • Article Metrics

          • Downloads (Last 12 months)195
          • Downloads (Last 6 weeks)50

          Other Metrics

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Full Text

        View this article in Full Text.

        View Full Text