Skip to main content

A Data Quality in Use Model for Big Data

(Position Paper)

  • Conference paper
Advances in Conceptual Modeling (ER 2014)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8823))

Included in the following conference series:

Abstract

Organizations are nowadays immersed in the Big Data Era. Beyond the hype of the concept of Big Data, it is true that something in the way of doing business is really changing. Although some challenges keep being the same as for regular data, with big data, the focus has changed. The reason is due to Big Data is not only data, but also a complete framework including data themselves, storage, formats, and ways of provisioning, processing and analytics. A challenge that becomes even trickier is the one concerning to the management of the quality of big data. More than ever the need for assessing the quality-in-use of big datasets gains importance since the real contribution – business value- of a dataset to a business can be only estimated in its context of use. Although there exists different data quality models to assess the quality of data there still lacks of a quality-in-use model adapted to big data. To fill this gap, and based on ISO 25012 and ISO 25024, we propose the 3Cs model, which is composed of three data quality dimensions for assessing the quality-in-use of big datasets: Contextual Consistency, Operational Consistency and Temporal Consistency.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Loshin, D.: Big Data Analytics: From Strategic Planning to Enterprise Integration with Tools, Techniques, NoSQL, and Graph. Elsevier, Walthman (2013)

    Google Scholar 

  2. Mantha, B.: Five Guiding Principles for Realizing the Promise of Big Data. Business Intelligence Journal 19, 8–11 (2014)

    Google Scholar 

  3. Kambatla, K., Kollias, G., Kumar, V., Grama, A.: Trends in big data analytics. Journal of Parallel and Distributed Computing - In Press - Corrected Proof (2014)

    Google Scholar 

  4. Redman, T.C.: Data’s Credibility Problem. Harvard Business Review 91, 84–88 (2013)

    Google Scholar 

  5. Quality in Progress

    Google Scholar 

  6. McAfee, A., Brynjolfsson, E.: Big data: The management revolution. Harvard Business Review 90, 60–68 (2012)

    Google Scholar 

  7. CIO INSIGHT

    Google Scholar 

  8. Deutsch, T.: Putting big data myths to rest. IBM Data Management Magazine (2013)

    Google Scholar 

  9. Howles, T.: Data, Data Quality, and Ethical Use. Software Quality Professional 16, 4–12 (2014)

    Google Scholar 

  10. ISO: ISO/IEC 25010, Systems and software engineering - Systems and software Quality Requirements and Evaluation (SQuaRE) - System and software quality models. International Organization for Standardization, Ginebra, Suiza (2011)

    Google Scholar 

  11. ISO: ISO/IEC 25012:2008 - Software engineering. Software product quality requirements and evaluation (SQuaRE). Data quality model International Organization for Standarization (2009)

    Google Scholar 

  12. Greenberg, P.: Big Data, Big Deal (2012), www.destinationCRM.com

  13. Russom, P.: Big Data Analytics (2011), ftp://ftp.software.ibm.com/software/tw/Defining_Big_Data_through_3V_v.pdf

  14. Buyya, R., Yeo, C.S., Venugopal, S., Broberg, J., Brandic, I.: Cloud computing and emerging IT platforms: Vision, hype, and reality for delivering computing as the 5th utility. Future Generation Computer Systems 25, 599–616 (2009)

    Article  Google Scholar 

  15. Armbrust, M., Fox, A., Griffith, R., Joseph, A.D., Katz, R., Konwinski, A., Lee, G., Patterson, D., Rabkin, A., Stoica, I., Zaharia, M.: A view of cloud computing. Commun. ACM 53, 50–58 (2010)

    Article  Google Scholar 

  16. Strong, D., Lee, Y., Wang, R.: Ten Potholes in the Road to Information Quality. IEEE Computer, 38–46 (1997)

    Google Scholar 

  17. Howard, P.: Market update- Data Quality - Market trends. Bloor (2013)

    Google Scholar 

  18. Lundquist, E.: Data Quality Is First Step Toward Reliable Data Analysis. p. 5. QuinStreet, Inc. (2013)

    Google Scholar 

  19. Becla, J., Wang, D.L., Lim, K.T.: Report from the 5th workshop on extremely large databases. Data Science Journal 11, 37–45 (2012)

    Article  Google Scholar 

  20. Kwon, O., Lee, N., Shin, B.: Data quality management, data usage experience and acquisition intention of big data analytics. International Journal of Information Management (2014)

    Google Scholar 

  21. Tee, J.: The Server Side (2013), http://www.theserverside.com/feature/Handling-the-four-Vs-of-big-data-volume-velocity-variety-and-veracity

  22. Lukoianova, T., Rubin, V.L.: Veracity roadmap: Is big data objective, truthful and credible? Advances in Classification Research Online 24 (2013)

    Google Scholar 

  23. ISO: ISO/IEC CD 25024 - Systems and software engineering – Systems and software Quality Requirements and Evaluation (SQuaRE) – Measurement of data quality

    Google Scholar 

  24. Lee, Y., Madnick, S., Wang, R., Wang, F., Hongyun, Z.: A Cubic Framework for the Chief Data Officer: Succeeding in a World of Big Data. MIS Quarterly Executive 13, 1–13 (2014)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Caballero, I., Serrano, M., Piattini, M. (2014). A Data Quality in Use Model for Big Data. In: Indulska, M., Purao, S. (eds) Advances in Conceptual Modeling. ER 2014. Lecture Notes in Computer Science, vol 8823. Springer, Cham. https://doi.org/10.1007/978-3-319-12256-4_7

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-12256-4_7

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-12255-7

  • Online ISBN: 978-3-319-12256-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics