Skip to main content

Data Quality (Poor Quality Data: The Fly in the Data Analytics Ointment)

  • Reference work entry
  • First Online:
  • 236 Accesses

Introduction and Summary

In recent years a powerful combination of database technologies, data mining techniques (see Data Mining) and analytics software have created vast new opportunities for data analysts and statisticians. For example, corporations have duly stored the results of their customer transactions in corporate databases for over a generation. There are, quite literally millions of records. Massively parallel engines can examine these data in heretofore unimagined ways. The potentials to understand customer profitability, develop better understandings of customers’ past needs and predict future ones, and to use those insights to develop new product niches are enormous.

Yet all is not well in the world of data analytics. Unlocking the mysteries data have to offer is difficult at best. And putting the discoveries to work can be even harder. One major reason is poor quality data. Bad data camouflage the hidden nuggets in data or, worse, send an analysis in the wrong direction...

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   1,100.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD   549.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References and Further Reading

  • Box GEP (1976) Science and statistics. J Am Stat Assoc 71:791–799

    MATH  MathSciNet  Google Scholar 

  • Box GEP, Draper NR (1987) Empirical model-building and response surfaces. Wiley, New York

    MATH  Google Scholar 

  • English LP (1999) Improving data warehouse and business information quality: methods for reducing costs and increasing profits. Wiley, New York

    Google Scholar 

  • Huang K-T, Lee YL, Wang RY (1999) Quality information and knowledge. Prentice Hall, New York

    Google Scholar 

  • Redman TC (2001) Data quality: the field guide. Butterworth-Heinemann Digital Press, Boston, MA

    Google Scholar 

  • Redman TC (2008) Data driven: profiting from your most important business asset. Harvard Business School Press, Boston, MA

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this entry

Cite this entry

Guess, F.G., Redman, T.c. (2011). Data Quality (Poor Quality Data: The Fly in the Data Analytics Ointment). In: Lovric, M. (eds) International Encyclopedia of Statistical Science. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04898-2_21

Download citation

Publish with us

Policies and ethics