Abstract
Intelligent data analysis (IDA) is an interdisciplinary study concerned with the effective analysis of data. In response to the challenge of extracting useful information from large quantities of online data, much work has appeared in the intersection of artificial intelligence, database, high-performance computing, pattern recognition, and statistics. Intelligent systems for data analysis have been developed in different application fields and much progress has been made. This editorial looks into a few key IDA topics, introduces the papers in this special issue, and identifies those challenging and fruitful areas for further research.
Similar content being viewed by others
References
E.F. Codd, Providing OLAP (On-Line Analytic Processing) to User-Analysts: An IT Mandate, E.F. Codd and Associates, 1994.
A. Berson and S. Smith, Data Warehousing, Data Mining, and OLAP, McGraw Hill, 1997.
G. Piatetsky-Shapiro and W.J. Frawley, Knowledge Discovery in Databases, AAAI Press/The MIT Press, 1991.
T. Mitchell, Machine Learning, McGraw Hill, 1997.
D.J. Hand, “Intelligent data analysis: Issues and opportunities,” in Advances Intelligent Data Analysis: Reasoning About Data, LNCS 1280, edited by X. Liu, P. Cohen, and M. Berthold, Springer-Verlag, pp. 1-14, 1997.
X. Liu, “Intelligent data analysis: Issues and challenges,” The Knowledge Engineering Review, vol. 11, pp. 365-371, 1996.
D.R. Cox and E.J. Snell, Applied Statistics: Principles and Examples. Chapman and Hall, 1981.
D.J. Hand, “Emergent themes in statistical expert systems research,” in Knowledge, Data and Computer-Assisted Decisions, edited by M. Schader and W. Gaul, Springer, pp. 279-288, 1990.
J.W. Tukey, “An alphabet for statisticians' expert systems,” in Artificial Intelligence and Statistics, edited by W.A. Gale, Addison-Wesley, pp. 401-409, 1986.
D.J. Hand, “Patterns in statistical strategy,” in Artificial Intelligence and Statistics, edited by W.A. Gale, Addison-Wesley, pp. 353-387, 1986.
D. Pregibon, “A diy guide to statistical strategy,” in Artificial Intelligence and Statistics, edited by W.A. Gale, Addison-Wesley, pp. 389-399, 1986.
D.J. Hand, “Deconstructing statistical questions (with discussion),” Journal of the Royal Statistical Society Series A, vol. 157, pp. 317-356, 1994.
C. Chatfield, “Model uncertainty, data mining and statistical inference (with discussion),” Journal of the Royal Statistical Society Series A, vol 158, pp. 419-466, 1995.
E.E. Leamer, Specification Searches: Ad hoc Inference with NonExperimental Data, Wiley, 1978.
L. Breiman, “The little bootstrap and other methods of dimensionality selection in regression: X-fixed prediction error,” Journal of the American Statistical Association, vol. 87, pp. 738-754, 1992.
B. Efron and R. Tibshirani, An Introduction to the Bootstrap, Chapman and Hall, 1993.
D. Draper, “Assessment and propagation of model uncertainty (with discussion),” Journal of the Royal Statistical Society, Series B, vol. 57, pp. 45-97, 1995.
W.R. Gilks, S. Richardson, and D.J. Spiegelhalter, Markov Chain Monte Carlo in Practice, Chapman and Hall, 1996.
D. Michie, D.J. Spiegelhalter, and C.C. Taylor, Machine Learning, Neural and Statistical Classification, Ellis Harwood, 1994.
V. Dhar and R. Stein, Seven Methods for Transforming Corporate Data into Business Intelligence, Prentice Hall, 1997.
C. Brodley and P. Smyth, “Applying classification algorithms in practice,” Statistics and Computing, vol. 7, pp. 45-56, 1997.
G. Tayi and T. Ballou, “Examining data quality,” Communications of the ACM, vol. 41, p. 2, 1998.
T. Wright, Statistical Methods and the Improvement of Data Quality, Academic Press, 1983.
R. Wang, V. Storey, and C. Firth, “A framework for analysis of data quality research,” IEEE Trans. on Knowledge and Data Engineering, vol. 7, pp. 623-640, 1995.
T. Redman, Data Quality for the Information Age, Artech House, Boston, 1996.
R. Little and D. Rubin, Statistical Analysis with Missing Data, Wiley, 1987.
M. Ramoni and P. Sebastiani, “The use of exogenous knowledge to learn bayesian networks from incomplete databases,” in Advances in Intelligent Data Analysis: Reasoning about Data, LNCS 1280, edited by X. Liu, P. Cohen, and M. Berthold, Springer-Verlag, pp. 539-548, 1997.
V. Barnet and T. Lewis, Outliers in Statistical Data. Wiley, 1994.
X. Liu, G. Cheng, and J. Wu, “Noise and uncertainty management in intelligent data modeling,” in Proc. of the 12th National Conference on Artificial Intelligence (AAAI-94), 1994, pp. 263-268.
R. Agrawal, T. Imielinski, and A. Swami, “Mining association rules between sets of items in large databases,” in Proc. of ACM SIGMOD Conference on Management of Data, 1993, pp. 207-216.
W.W. Cohen, “Fast effective rule induction,” in Proc. of the Twelfth International Conference on Machine Learning, 1995, pp. 115-123.
J. Han and Y. Fu, “Discovery of multi-level association rules from large databases,” in Proc. of the 21st Very Large Data Bases Conference, 1995, pp. 420-431.
E. Narendra and K. Fukunaga, “A branch and bound algorithm for feature subset selection,” IEEE Trans. on Computers, vol. 26, pp. 917-922, 1977.
J. Holland, Adaptation in Natural and Artificial Systems, The MIT Press, 1975.
D.J. Hand, Discrimination and Classification, Wiley, 1981.
C. Bishop, Neural Networks for Pattern Recognition, Oxford University Press, 1995.
H. Liu and H. Motoda, “Guest editors' introduction: Feature transformation and subset selection,” IEEE Intelligent Systems, pp. 26-29, 1998.
R. Stevens, P. Woodward, T. DeFanti, and C. Catlett, “From the i-way to the national technology grid,” Communications of the ACM, vol. 40, pp. 51-60, 1997.
K. Kennedy and et al., “A nationwide parallel computing environment,” Communications of the ACM, vol. 40, pp. 63-72, 1997.
C. Cruz-Neira, D. Sandin, and T. DeFanti, “Surround-screen projection-based virtual reality: the design and implementation of cave,” in Proc. of Siggraph'93 Computer Graphics Conference, 1993, pp. 135-142.
P. Stolorz and C. Dean, “Quakefinder: a scalable data mining system for detecting earthquakes from space,” in Proc. of the 2nd Int. Conf. on Knowledge Discovery and Data Mining, edited by J. Han, E. Simoudis, and U. Fayyad, AAAI Press, 1996, pp. 208-213.
C. Glymour, D. Madigan, D. Pregibon, and P. Smyth, “Statistical themes and lessons for data mining,” Data Mining and Knowledge Discovery, vol. 1, pp. 11-28, 1997.
P.J. Huber, “From large to huge: a statistician's reactions to kdd and dm,” in Proc. of the Third International Conference on Knowledge Discovery and Data Mining, edited by D. Heckerman, H. Mannila, D. Pregibon, and R. Uthurusamy, AAAI Press, 1997, pp. 304-308.
R. St. Amant and P. Cohen, “Interaction with a mixed-initiative system for exploratory data analysis,” Knowledge-Based Systems, vol. 10, pp. 265-273, 1998.
A. Silberschatz and A. Tuzhilin, “User-assisted knowledge discovery: How much should the user be involved?,” in Proc. of the ACM-SIGMOD'96 Workshop on Research Issues on Data Mining and Knowledge Discovery, ACM, 1996.
B. Grosz, “Collaborative systems,” AI Magazine, vol. 17, pp. 67-86, 1996.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Liu, X. Progress in Intelligent Data Analysis. Applied Intelligence 11, 235–240 (1999). https://doi.org/10.1023/A:1008384708180
Issue Date:
DOI: https://doi.org/10.1023/A:1008384708180