Hostname: page-component-8448b6f56d-xtgtn Total loading time: 0 Render date: 2024-04-23T11:07:42.326Z Has data issue: false hasContentIssue false

Comparing mathematical and heuristic approaches for scientific data analysis

Published online by Cambridge University Press:  12 December 2007

Aparna S. Varde
Affiliation:
Mathematics and Computer Science, Virginia State University, Petersburg, Virginia, USA
Shuhui Ma
Affiliation:
Tiffany & Company, Cumberland, Rhode Island, USA
Mohammed Maniruzzaman
Affiliation:
Materials Science, Worcester Polytechnic Institute, Worcester, Massachusetts, USA
David C. Brown
Affiliation:
Computer Science, Worcester Polytechnic Institute, Worcester, Massachusetts, USA
Elke A. Rundensteiner
Affiliation:
Computer Science, Worcester Polytechnic Institute, Worcester, Massachusetts, USA
Richard D. SissonJR.
Affiliation:
Manufacturing and Materials Science, Worcester Polytechnic Institute, Worcester, Massachusetts, USA

Abstract

Scientific data is often analyzed in the context of domain-specific problems, for example, failure diagnostics, predictive analysis, and computational estimation. These problems can be solved using approaches such as mathematical models or heuristic methods. In this paper we compare a heuristic approach based on mining stored data with a mathematical approach based on applying state-of-the-art formulae to solve an estimation problem. The goal is to estimate results of scientific experiments given their input conditions. We present a comparative study based on sample space, time complexity, and data storage with respect to a real application in materials science. Performance evaluation with real materials science data is also presented, taking into account accuracy and efficiency. We find that both approaches have their pros and cons in computational estimation. Similar arguments can be applied to other scientific problems such as failure diagnostics and predictive analysis. In the estimation problem in this paper, heuristic methods outperform mathematical models.

Type
Research Article
Copyright
Copyright © Cambridge University Press 2008

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

REFERENCES

Aamodt, A., & Plaza, E. (2003). Case based reasoning: foundational issues, methodological variations & system approaches. Artificial Intelligence Communications 7(1), 3959.Google Scholar
Beck, J.V., Blackwell, B., & St. Clair, C.R. (1985). Inverse Heat Conduction. New York: Wiley.Google Scholar
Bierman, D., & Kamsteeg, P. (1988). Elicitation of knowledge with and for intelligent tutoring systems. IICAI-03: IEEE Systems, Man, and Cybernetics Society's 1st Indian Int. Conf. Artificial Intelligence.Google Scholar
Gehrke, J., Ramakrishnan, R., & Ganti, V. (1998). Rainforest—a framework for fast decision tree construction of large datasets. Data Mining and Knowledge Discovery 4, 127162.CrossRefGoogle Scholar
Han, J., & Kamber, M. (2001). Data Mining: Concepts and Techniques. San Mateo, CA: Morgan Kaufmann.Google Scholar
Helfman, J., & Hollan, J. (2001). Image representations for accessing and organizing Web information. Proc. SPIE Int. Society for Optical Engineering Internet Imaging II Conf., pp. 91101.Google Scholar
Hinneburg, A., Aggarwal, C., & Keim, D. (2000). What is the nearest neighbor in high dimensional spaces. Proc. VLDB, pp. 506515.Google Scholar
Huang, C.-H., Yuan, I.C., & Ay, H. (2003). A three-dimensional inverse problem in imaging the local heat transfer coefficients for plate finned-tube heat exchangers. International Journal of Heat and Mass Transfer 4(6), 36293638.CrossRefGoogle Scholar
Janecek, P., & Pu, P. (2004). Opportunistic Search with Semantic Fisheye Views, Technical Report TR IC/2004/42. Lausanne: Swiss Federal Institute of Technology.CrossRefGoogle Scholar
Keim, D., & Bustos, B. (2004). Similarity search in multimedia databases. Proc. ICDE, pp. 873874.Google Scholar
Kolodner, J. (1993). Case-Based Reasoning. San Mateo, CA: Morgan Kaufmann.CrossRefGoogle Scholar
Leake, D. (1996). Case-Based Reasoning: Experiences, Lessons and Future Directions. New York: AAAI Press.Google Scholar
Lu, Q., Vader, R., Kang, J., & Rong, Y. (2002). Development of a computer-aided heat treatment planning system. Heat Treatment of Metals 3, 6570.Google Scholar
Ma, S., Maniruzzaman, M., & Sisson, R.D. Jr. (2002). Characterization of the performance of mineral oil based quenchants using the CHTE Quench Probe System. Proc. 1st Int. Surface Engineering Congr. and 13th IFHTSE Congr.Google Scholar
Ma, S., Maniruzzaman, M., & Sisson, R.D. Jr. (2004). Inverse Heat Conduction Problem in Estimating the Surface Heat Transfer Coefficients by Steepest Descent Method, Technical Report. Worcester, MA: Worcester Polytechnic Institute.Google Scholar
MacQueen, J.B. (1967). Some methods for classification and analysis of multivariate observations. Proc. Mathematical Statistics and Probability, pp. 281297.Google Scholar
Maniruzzaman, M., Varde, A.S., & Sisson, R.D. Jr. (2006). Estimation of surface heat transfer coefficients for quenching process simulation. ASM Int. Conf. Materials Science and Technology.Google Scholar
Mitchell, T. (1997). Machine Learning. New York: McGraw–Hill.Google Scholar
Newell, A., Shaw, J.C., & Simon, H.A. (1988). Chess playing programs and the problem of complexity. In Computer Chess Compendium (Levy, D., Ed.), pp. 2942. New York: Springer–Verlag.CrossRefGoogle Scholar
Pal, K., & Campbell, J. (1997). An application of rule-based and case-based reasoning in a single legal knowledge-based system. Database for Advances in Information Systems 28(4), 4863.CrossRefGoogle Scholar
Quinlan, J.R. (1986). Induction of decision trees. Machine Learning 1, 81106.CrossRefGoogle Scholar
Roy, R.K. (2001). Design of Experiments Using The Taguchi Approach: 16 Steps to Product and Process Improvement. New York: Wiley.Google Scholar
Rissanen, J. (1987). Stochastic complexity and the MDL principle. Econometric Reviews 6, 85102.CrossRefGoogle Scholar
Russell, S., & Norvig, P. (1995). Artificial Intelligence: A Modern Approach. Englewood Cliffs, NJ: Prentice–Hall.Google Scholar
Stolz, G. Jr. (1960). Heat Transfer. New York: Wiley.Google Scholar
Scientific Forming Technologies Corporation. (2005). DEFORM-HT. Columbus, OH: Scientific Forming Technologies Corporation.Google Scholar
Sisson, R. Jr., Maniruzzaman, M., & Ma, S. (2004). Quenching: understanding, controlling and optimizing the process. Proc. Center for Heat Treating Excellence Fall Seminar, Columbus, OH.Google Scholar
Varde, A.S., Rundensteiner, E.A., Ruiz, C., Brown, D.C., Maniruzzaman, M., & Sisson, R.D. (2006 a). Effectiveness of domain-specific cluster representatives for graphical plots. Proc. SIGMOD IQIS Workshop, pp. 3136.Google Scholar
Varde, A.S., Rundensteiner, E.A., Ruiz, C., Brown, D.C., Maniruzzaman, M., & Sisson, R.D. Jr. (2006 b). Integrating clustering and classification for estimating process variables in materials science. AAAI Poster Track.Google Scholar
Varde, A.S., Rundensteiner, E.A., Ruiz, C., Maniruzzaman, M., & Sisson, R.D. Jr. (2005). Learning semantics-preserving distance metrics for graphical plots. Proc. SIGKDD MDM Workshop, pp. 107112.Google Scholar
Varde, A.S., Takahashi, M., Rundensteiner, E.A., Ward, M.O., Maniruzzaman, M., & Sisson, R.D. Jr. (2003). QuenchMiner™: decision support for optimization of heat treating processes. IICAI, pp. 9931003.Google Scholar
Varde, A.S., Takahashi, M., Rundensteiner, E.A., Ward, M.O., Maniruzzaman, M., & Sisson, R.D. Jr. (2004). A priori algorithm and game-of-life for predictive analysis in materials science. International Journal of Knowledge-Based & Intelligent Engineering Systems 8(4), 213228.CrossRefGoogle Scholar
Xing, E., Ng, A., Jordan, M., & Russell, S. (2003). Distance metric learning with application to clustering with side information. Proc. Neural Information Processing Systems, pp. 503512.Google Scholar