Skip to main content
Log in

Progress in Intelligent Data Analysis

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

Intelligent data analysis (IDA) is an interdisciplinary study concerned with the effective analysis of data. In response to the challenge of extracting useful information from large quantities of online data, much work has appeared in the intersection of artificial intelligence, database, high-performance computing, pattern recognition, and statistics. Intelligent systems for data analysis have been developed in different application fields and much progress has been made. This editorial looks into a few key IDA topics, introduces the papers in this special issue, and identifies those challenging and fruitful areas for further research.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. E.F. Codd, Providing OLAP (On-Line Analytic Processing) to User-Analysts: An IT Mandate, E.F. Codd and Associates, 1994.

  2. A. Berson and S. Smith, Data Warehousing, Data Mining, and OLAP, McGraw Hill, 1997.

  3. G. Piatetsky-Shapiro and W.J. Frawley, Knowledge Discovery in Databases, AAAI Press/The MIT Press, 1991.

  4. T. Mitchell, Machine Learning, McGraw Hill, 1997.

  5. D.J. Hand, “Intelligent data analysis: Issues and opportunities,” in Advances Intelligent Data Analysis: Reasoning About Data, LNCS 1280, edited by X. Liu, P. Cohen, and M. Berthold, Springer-Verlag, pp. 1-14, 1997.

  6. X. Liu, “Intelligent data analysis: Issues and challenges,” The Knowledge Engineering Review, vol. 11, pp. 365-371, 1996.

    Google Scholar 

  7. D.R. Cox and E.J. Snell, Applied Statistics: Principles and Examples. Chapman and Hall, 1981.

  8. D.J. Hand, “Emergent themes in statistical expert systems research,” in Knowledge, Data and Computer-Assisted Decisions, edited by M. Schader and W. Gaul, Springer, pp. 279-288, 1990.

  9. J.W. Tukey, “An alphabet for statisticians' expert systems,” in Artificial Intelligence and Statistics, edited by W.A. Gale, Addison-Wesley, pp. 401-409, 1986.

  10. D.J. Hand, “Patterns in statistical strategy,” in Artificial Intelligence and Statistics, edited by W.A. Gale, Addison-Wesley, pp. 353-387, 1986.

  11. D. Pregibon, “A diy guide to statistical strategy,” in Artificial Intelligence and Statistics, edited by W.A. Gale, Addison-Wesley, pp. 389-399, 1986.

  12. D.J. Hand, “Deconstructing statistical questions (with discussion),” Journal of the Royal Statistical Society Series A, vol. 157, pp. 317-356, 1994.

    Google Scholar 

  13. C. Chatfield, “Model uncertainty, data mining and statistical inference (with discussion),” Journal of the Royal Statistical Society Series A, vol 158, pp. 419-466, 1995.

    Google Scholar 

  14. E.E. Leamer, Specification Searches: Ad hoc Inference with NonExperimental Data, Wiley, 1978.

  15. L. Breiman, “The little bootstrap and other methods of dimensionality selection in regression: X-fixed prediction error,” Journal of the American Statistical Association, vol. 87, pp. 738-754, 1992.

    Google Scholar 

  16. B. Efron and R. Tibshirani, An Introduction to the Bootstrap, Chapman and Hall, 1993.

  17. D. Draper, “Assessment and propagation of model uncertainty (with discussion),” Journal of the Royal Statistical Society, Series B, vol. 57, pp. 45-97, 1995.

    Google Scholar 

  18. W.R. Gilks, S. Richardson, and D.J. Spiegelhalter, Markov Chain Monte Carlo in Practice, Chapman and Hall, 1996.

  19. D. Michie, D.J. Spiegelhalter, and C.C. Taylor, Machine Learning, Neural and Statistical Classification, Ellis Harwood, 1994.

  20. V. Dhar and R. Stein, Seven Methods for Transforming Corporate Data into Business Intelligence, Prentice Hall, 1997.

  21. C. Brodley and P. Smyth, “Applying classification algorithms in practice,” Statistics and Computing, vol. 7, pp. 45-56, 1997.

    Google Scholar 

  22. G. Tayi and T. Ballou, “Examining data quality,” Communications of the ACM, vol. 41, p. 2, 1998.

    Google Scholar 

  23. T. Wright, Statistical Methods and the Improvement of Data Quality, Academic Press, 1983.

  24. R. Wang, V. Storey, and C. Firth, “A framework for analysis of data quality research,” IEEE Trans. on Knowledge and Data Engineering, vol. 7, pp. 623-640, 1995.

    Google Scholar 

  25. T. Redman, Data Quality for the Information Age, Artech House, Boston, 1996.

    Google Scholar 

  26. R. Little and D. Rubin, Statistical Analysis with Missing Data, Wiley, 1987.

  27. M. Ramoni and P. Sebastiani, “The use of exogenous knowledge to learn bayesian networks from incomplete databases,” in Advances in Intelligent Data Analysis: Reasoning about Data, LNCS 1280, edited by X. Liu, P. Cohen, and M. Berthold, Springer-Verlag, pp. 539-548, 1997.

  28. V. Barnet and T. Lewis, Outliers in Statistical Data. Wiley, 1994.

  29. X. Liu, G. Cheng, and J. Wu, “Noise and uncertainty management in intelligent data modeling,” in Proc. of the 12th National Conference on Artificial Intelligence (AAAI-94), 1994, pp. 263-268.

  30. R. Agrawal, T. Imielinski, and A. Swami, “Mining association rules between sets of items in large databases,” in Proc. of ACM SIGMOD Conference on Management of Data, 1993, pp. 207-216.

  31. W.W. Cohen, “Fast effective rule induction,” in Proc. of the Twelfth International Conference on Machine Learning, 1995, pp. 115-123.

  32. J. Han and Y. Fu, “Discovery of multi-level association rules from large databases,” in Proc. of the 21st Very Large Data Bases Conference, 1995, pp. 420-431.

  33. E. Narendra and K. Fukunaga, “A branch and bound algorithm for feature subset selection,” IEEE Trans. on Computers, vol. 26, pp. 917-922, 1977.

    Google Scholar 

  34. J. Holland, Adaptation in Natural and Artificial Systems, The MIT Press, 1975.

  35. D.J. Hand, Discrimination and Classification, Wiley, 1981.

  36. C. Bishop, Neural Networks for Pattern Recognition, Oxford University Press, 1995.

  37. H. Liu and H. Motoda, “Guest editors' introduction: Feature transformation and subset selection,” IEEE Intelligent Systems, pp. 26-29, 1998.

  38. R. Stevens, P. Woodward, T. DeFanti, and C. Catlett, “From the i-way to the national technology grid,” Communications of the ACM, vol. 40, pp. 51-60, 1997.

    Google Scholar 

  39. K. Kennedy and et al., “A nationwide parallel computing environment,” Communications of the ACM, vol. 40, pp. 63-72, 1997.

    Google Scholar 

  40. C. Cruz-Neira, D. Sandin, and T. DeFanti, “Surround-screen projection-based virtual reality: the design and implementation of cave,” in Proc. of Siggraph'93 Computer Graphics Conference, 1993, pp. 135-142.

  41. P. Stolorz and C. Dean, “Quakefinder: a scalable data mining system for detecting earthquakes from space,” in Proc. of the 2nd Int. Conf. on Knowledge Discovery and Data Mining, edited by J. Han, E. Simoudis, and U. Fayyad, AAAI Press, 1996, pp. 208-213.

  42. C. Glymour, D. Madigan, D. Pregibon, and P. Smyth, “Statistical themes and lessons for data mining,” Data Mining and Knowledge Discovery, vol. 1, pp. 11-28, 1997.

    Google Scholar 

  43. P.J. Huber, “From large to huge: a statistician's reactions to kdd and dm,” in Proc. of the Third International Conference on Knowledge Discovery and Data Mining, edited by D. Heckerman, H. Mannila, D. Pregibon, and R. Uthurusamy, AAAI Press, 1997, pp. 304-308.

  44. R. St. Amant and P. Cohen, “Interaction with a mixed-initiative system for exploratory data analysis,” Knowledge-Based Systems, vol. 10, pp. 265-273, 1998.

    Google Scholar 

  45. A. Silberschatz and A. Tuzhilin, “User-assisted knowledge discovery: How much should the user be involved?,” in Proc. of the ACM-SIGMOD'96 Workshop on Research Issues on Data Mining and Knowledge Discovery, ACM, 1996.

  46. B. Grosz, “Collaborative systems,” AI Magazine, vol. 17, pp. 67-86, 1996.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Liu, X. Progress in Intelligent Data Analysis. Applied Intelligence 11, 235–240 (1999). https://doi.org/10.1023/A:1008384708180

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/A:1008384708180

Navigation