skip to main content
research-article

Do you know your IQ?: a research agenda for information quality in systems

Published: 21 January 2010 Publication History

Abstract

Information quality (IQ) is a measure of how fit information is for a purpose. Sometimes called Quality of Information (QoI) by analogy with Quality of Service (QoS), it quantifies whether the correct information is being used to make a decision or take an action. Not understanding when information is of adequate quality can lead to bad decisions and catastrophic effects, including system outages, increased costs, lost revenue -- and worse. Quantifying information quality can help improve decision making, but the ultimate goal should be to select or construct information producers that have the appropriate balance between information quality and the cost of providing it. In this paper, we provide a brief introduction to the field, argue the case for applying information quality metrics in the systems domain, and propose a research agenda to explore this space.

References

[1]
S. Agarwala, Y. Chen, D. Milojicic, and K. Schwan, "QMON: QoS- and utility-aware monitoring in enterprise systems", 3rd IEEE International Conference on Autonomic Computing (ICAC), 2006.
[2]
C. Aggarwal and P. Yu, "A survey of uncertain data algorithms and applications," IEEE Trans. on Knowledge and Data Engineering, Vol. 21, No. 5, May 2009, pp. 609--623.
[3]
M. Aguilera, J. Mogul, J. Wiener, P. Reynolds, and A. Muthitacharoen, "Performance debugging for distributed systems of black boxes," Proc. SOSP, 2003, pp. 74--89.
[4]
M. Arlitt, K. Farkas, S. Iyer, S. P. Kumaresan, S. Rafaeli, "Data assurance: a prerequisite for IT automation", HPL-TR-2005-212, HP Laboratories, November 2005.
[5]
P. Barham, A. Donnelly, R. Isaacs, and R. Mortier, "Using Magpie for request extraction and workload modelling," Proc. OSDI, 2004, pp. 259--272.
[6]
P. Bartlet-Ros, G. Iannaccone, J. Sanjuas-Cuxart, D. Amores-Lopez and J. Sole-Pareta, "Load shedding in network monitoring applications," Proc. USENIX Annual Technical Conf., 2007, pp. 59--72.
[7]
Y. Beth, B. Plale and D. Gannon,"A survey of data provenance in e-Science," SIGMOD Record, Vol. 34, 2005, pp. 31--36.
[8]
I. Cohen, M. Goldszmidt, T. Kelly, J. Symons, and J. Chase, "Correlating instrumentation data to system states: a building block for automated diagnosis and control," Proc. OSDI, 2004, pp. 231--244.
[9]
I. Cohen, S. Zhang, M. Goldszmidt, J. Symons, T. Kelly, and A. Fox,"Capturing, indexing, clustering, and retrieving system history, Proc. SOSP, 2005, pp. 105--118.
[10]
E. Cohen, N. Duffield, C. Lund, M. Thorup, "Confident estimation for multistage measurement sampling and aggregation,", Proc. SIGMETRICS, 2008, pp. 109--120.
[11]
N. Dalvi and D. Suciu, "Management of probabilistic data: foundations and challenges," Proc. PODS, 2007, pp. 1--12.
[12]
Frank Dravos. "Information quality: the quest for justification", Business Intelligence Journal 7(2), Spring 2002.
[13]
S. Duan, S. Babu and K. Munagala, "Fa: A system for automating failure diagnosis," Proc. ICDE, 2009, pp.1012--1023.
[14]
V.F. Grasso, J.L. Beck,. and G. Manfredi, "Seismic early warning systems: procedure for automated decision making," Technical report EERL-2005-02, Caltech, Pasadena, CA, November 2005.
[15]
R. Harji, "Harness Information to Deliver Enhanced Business Performance," Enterprise Search Summit, New York, NY, May 2009.
[16]
N. Jain, P. Mahajan, D. Kit, P. Yalagandula, M. Dahlin, and Y. Zhang, "Network imprecision: a new consistency metric for scalable monitoring," Proc. OSDI'08, December 2008.
[17]
D. Kahneman, P. Slovic and A. Tversky, Judgment under Uncertainty : Heuristics and Biases, Cambridge University Press, April 1982.
[18]
J. Kiernan and E. Terze, "EventSummarizer: a tool for summarizing large event sequences," Proc. 12th Intl. Conf. on Extending Database Technnology (EDBT'09), March 2009.
[19]
D. Krishna, "Calculating the Value of Information," The Data Warehousing Institute (TDWI) New York City Chapter, June 10, 2009.
[20]
M. Mesnier, M. Wachs, R. Sambasivan, A. Zheng, and G. Ganger, "Modeling the relative fitness of storage," Proc. SIGMETRICS, 2007, pp. 37--48.
[21]
R. Murty and M. Welsh, "Towards a dependable architecture for Internet-scale sensing," Proc. 2nd Workshop on Hot Topics in Dependability (HotDep '06), November 2006.
[22]
A. Preece, P. Missier, S. Embury, B. Jin and M. Greenwood,"An ontology-based approach to handling information quality in e-Science", Concurrency and Computation: Practice and Experience 20:253--264, 2008.
[23]
S. Rajbhandari, O. Rana and I. Wootten, "A fuzzy model for calculating workflow trust using provenance data," Proc. of 15th ACM Mardi Gras Conf., 2008, pp. 1--8.
[24]
E. Thereska, B. Salmon, J. Strunk, M. Wachs, M. Abd-El-Malik, J. Lopez and G. Ganger, "Stardust: tracking activity in a distributed storage system," Proc. SIGMETRICS, June 2006, pp. 3--14.
[25]
C. Wang, K.-L. Ma, "A statistical approach to volume data quality assessment," IEEE Trans on Visualization and Computer Graphics 14(3): 590--602, May/June 2008.
[26]
D. Wang, E. Michelakis, M. Garofalakis, and J. Hellerstein, "BayesStore: Managing Large, Uncertain Data Repositories with Probabilistic Graphical Models," Proc. VLDB, 2008, pp. 340--351.
[27]
J. Widom, "Trio: a system for data, uncertainty, and lineage," In C. Aggarwal, editor, Managing and Mining Uncertain Data, Springer, 2009, pp. 113--148.
[28]
"NATO bombing of the Chinese embassy in Belgrade", Wikipedia, Dec. 2008.

Cited By

View all
  • (2022)Managing Expectations: Runtime Negotiation of Information Quality Requirements in Event-Based SystemsService-Oriented Computing10.1007/978-3-662-45391-9_14(199-213)Online publication date: 11-Mar-2022
  • (2021)Big Data Quality for Data Mining in Business Intelligence ApplicationsIntegration Challenges for Analytics, Business Intelligence, and Data Mining10.4018/978-1-7998-5781-5.ch004(64-91)Online publication date: 2021
  • (2021)Determining Information Quality in ICT SystemsEnergies10.3390/en1417554914:17(5549)Online publication date: 5-Sep-2021
  • Show More Cited By

Index Terms

  1. Do you know your IQ?: a research agenda for information quality in systems

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM SIGMETRICS Performance Evaluation Review
      ACM SIGMETRICS Performance Evaluation Review  Volume 37, Issue 3
      December 2009
      70 pages
      ISSN:0163-5999
      DOI:10.1145/1710115
      Issue’s Table of Contents

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 21 January 2010
      Published in SIGMETRICS Volume 37, Issue 3

      Check for updates

      Author Tags

      1. IQ
      2. QoI
      3. data quality
      4. goal-directed design
      5. information processing pipeline
      6. information quality
      7. modeling
      8. prediction
      9. uncertainty

      Qualifiers

      • Research-article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)5
      • Downloads (Last 6 weeks)1
      Reflects downloads up to 27 Feb 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2022)Managing Expectations: Runtime Negotiation of Information Quality Requirements in Event-Based SystemsService-Oriented Computing10.1007/978-3-662-45391-9_14(199-213)Online publication date: 11-Mar-2022
      • (2021)Big Data Quality for Data Mining in Business Intelligence ApplicationsIntegration Challenges for Analytics, Business Intelligence, and Data Mining10.4018/978-1-7998-5781-5.ch004(64-91)Online publication date: 2021
      • (2021)Determining Information Quality in ICT SystemsEnergies10.3390/en1417554914:17(5549)Online publication date: 5-Sep-2021
      • (2018)Cost benefit analysis of cloud computing in educationInternational Journal of Business Information Systems10.1504/IJBIS.2018.08911227:2(205-221)Online publication date: 1-Jan-2018
      • (2018)NorthstarProceedings of the VLDB Endowment10.14778/3229863.324049311:12(2150-2164)Online publication date: 1-Aug-2018
      • (2017)A data quality metric (DQM)Proceedings of the VLDB Endowment10.14778/3115404.311541410:10(1094-1105)Online publication date: 1-Jun-2017
      • (2015)Data Quality for Data Mining in Business Intelligence ApplicationsIntegration of Data Mining in Business Intelligence Systems10.4018/978-1-4666-6477-7.ch003(34-59)Online publication date: 2015
      • (2015)Tag clouds with a twist: using tag clouds coloured by information’s trustworthiness to support situational awarenessJournal of Trust Management10.1186/s40493-015-0021-52:1Online publication date: 17-Dec-2015
      • (2015)Towards Optimizing Wide-Area Streaming AnalyticsProceedings of the 2015 IEEE International Conference on Cloud Engineering10.1109/IC2E.2015.53(452-457)Online publication date: 9-Mar-2015
      • (2014)Effective Measurement of DQ/IQ for BIInformation Quality and Governance for Business Intelligence10.4018/978-1-4666-4892-0.ch012(236-252)Online publication date: 2014
      • Show More Cited By

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media