Abstract
Scientific datasets often consist of complex data types such as images. Mining such data presents interesting issues related to semantics. In this paper, we explore the research issues in mining data from the field of nanotechnology. More specifically, we focus on a problem that relates to image comparison of material nanostructures. A significant challenge here relates to the notion of similarity between the images. Features such as size and height of nano-particles and inter-particle distance are important in image similarity as conveyed by domain experts. However, there are no precise notions of similarity defined apriori. Hence there is a need for learning similarity measures. In this paper, we describe our proposed approach to learn similarity measures for graphical data. We discuss this with reference to nanostructure images. Other challenges in image comparison are also outlined. The use of this research is discussed with respect to targeted applications.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Chen, L., Ng, R.: On the Marriage of Lp-Norm and Edit Distance. In: VLDB, Toronto, Canada, August 2004, pp. 792–803 (2004)
Dougherty, S., Liang, J., Pins, G.: Preceision Nanostructure Fabrication for the Investigation of Cell Substrate Interactions. Technical Report, Worcester Polytechnic Institute, Worcester, Massachusetts (June 2006)
Fayyad, U., Haussler, D., Storoltz, P.: Mining Scientific Data. Communications of the ACM 39(11), 51–57 (1996)
Flemming, R., Murphy, C., Abrams, G., Goodman, S., Nealey, P.: Effects of Synthetic Micro and Nanostructured Surfaces on Cell Behavior. Biomaterials 20, 573–588 (1999)
Gao, L., Wang, X.: Continually Evaluating Similarity-Based Pattern Queries on a Streaming Time Series. In: SIGMOD, Madison, Wisconsin, pp. 370–381 (2002)
Han, J., Kamber, M.: Data Mining Concepts and Techniques. Morgan Kaufmann, California (2001)
Hinneburg, A., Aggarwal, C., Keim, D.: What is the Nearest Neighbor in High Dimensional Spaces. In: VLDB, Cairo, Egypt, August 2000, pp. 506–515 (2000)
Keim, D., Bustos, B.: Similarity Search in Multimedia Databases. In: ICDE 2004, Boston, Massachusetts, March 2004, pp. 873–874 (2004)
Kaufman, L., Rosseau, P.: Finding Groups in Data - An Introduction to Cluster Analysis. John Wiley, New York (1988)
Rissanen, J.: Stochastic Complexity and the MDL Principle. Econometric Reviews 6, 85–102 (1987)
Reich, Y., Barai, S.: Evaluating Machine Learning Models for Engineering Problems. Artificial Intelligence in Engineering 13, 257–272 (1999)
Traina, A., Traina, C., Papadimitriou, S., Faloutsos, C.: TriPlots - Scalable Tools for Multidimensional Data Mining. In: KDD, San Francisco, California, pp. 184–193 (2001)
United States National Nanotechnology Initiative. Technical Report (February 2006)
Varde, A., Rundensteiner, E., Ruiz, C., Maniruzzaman, M., Sisson Jr., R.: Learning Semantics-Preserving Distance Metrics for Clustering Graphical Data. In: KDD’s MDM, Chicago, Illinois, August 2005, pp. 107–112 (2005)
Varde, A., Rundensteiner, E., Ruiz, C., Brown, D., Maniruzzaman, M., Sisson, J.R.: Integrating Clustering and Classification for Estimating Process Variables in Materials Science. In: AAAI Poster Track, July 2006, Boston, Massachusetts (2006)
Ward, M.: XMDV Tool: Integrating Multiple Methods for Visualizing Multivariate Data. In: Visualization, Washington DC, October 1994, pp. 326–333 (1994)
Webster, T., Smith, T.: Increased Osteoblast Function on PLGA Composites Containing Nanophase Titania. Journal of Biomedical Materials Research 74A, 677–686 (2005)
Witten, I., Frank, E.: Data Mining: Practical Machine Learning Algorithms with Java Implementations. Morgan Kaufmann Publishers, California (2000)
Xing, E., Ng, A., Jordan, M., Russell, S.: Distance Metric Learning with Application to Clustering with Side Information. In: NIPS, December 2003, Vancouver, Canada, pp. 503–512 (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Varde, A., Liang, J., Rundensteiner, E., Sisson, R. (2006). Mining Images of Material Nanostructure Data. In: Madria, S.K., Claypool, K.T., Kannan, R., Uppuluri, P., Gore, M.M. (eds) Distributed Computing and Internet Technology. ICDCIT 2006. Lecture Notes in Computer Science, vol 4317. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11951957_35
Download citation
DOI: https://doi.org/10.1007/11951957_35
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-68379-7
Online ISBN: 978-3-540-68380-3
eBook Packages: Computer ScienceComputer Science (R0)