Abstract
Data extraction and information retrieval from a great volume of data set always is a tedious and difficult work. Therefore, an effective and efficient technology for searching for desired data becomes increasingly important. Since metadata with certain attributes may characterize data files, to extract data with the help of metadata can be expectably to simplify the work. Metadata Classification has been proposed to improve significantly the performance of scientific data extraction. In this paper, a scientific data extraction architecture based on the assistance of metadata classification mechanism is proposed. The architecture is built by utilizing mediator/wrapper architecture to develop a scientific data extracting system to help oceanographer analyzing ocean’s ecology. The result of performance evaluation shows that the architecture with the help of metadata classification can extract user’s desired data effectively and efficiently.
Similar content being viewed by others
References
Bouguettaya A, Benatallah B, Hendra L, Ouzzani M, Beard J (2000) Supporting dynamic interactions among web-based information sources. IEEE Trans Knowl Data Eng 12(5):779–801
Chang YS, Lai HJ, Cheng HT (2009) Improving scientific data extraction using metadata classification. In: 2009 international workshop on grid computing, applications, and technologies, Kaohsiung, Taiwan, December 14–16, pp 669–673
Chang YS, Liang KC, Cheng MC, Yuan SM (2004) Prototyping an integrated information gathering system on CORBA. J Syst Softw 72(2):281–294
Chang YS, Cheng HT, Lai HJ (2009) Metadata miner assisted integrated information retrieval for Argo ocean data. In: IEEE international conference on systems, man, and cybernetics, San Antonio, Texas, USA, October 11–14, pp 2930–2935
Huang YP, Hsu WT, Sandnes FE (2008) Association analysis of ocean salinity and temperature variations. In: 2008 Third international conference on convergence and hybrid information technology, vol 2, pp 680–685
Huang YP, Kao LJ, Sandnes FE (2008) Efficient mining of salinity and temperature association rules from ARGO data. Expert Syst Appl 35(1–2):59–68
Huang YP, Jau JS, Sandnes FE (2009) Temporal-spatial association analysis of ocean salinity and temperature variations. In: Proceedings of the 2nd international conference on interaction sciences: information technology, culture and human, Seoul, Korea, pp 1001–1006
Jones MB, Berkley C, Bojilova J, Schildhauer M (2001) Managing scientific metadata. IEEE Internet Comput 5(5):59–68
Korfhage RR (1997) Information storage and retrieval. Wiley, New York
Lagoze C, Krafft D, Cornwell T, Dushay N, Eckstrom D, Saylor J (2006) Metadata aggregation and “automated digital libraries”: a retrospective on the NSDL experience. In: 6th ACM/IEEE-CS joint conference on digital libraries, Chapel Hill, NC, USA June 11–15, pp 230–239
Mena E, Illarramendi A, Kashyap V, Seth AP (2000) OBSERVER: an approach for query processing in global information systems based on interoperation across pre-existing ontologies. Distrib Parallel Databases 8(2):223–271
Pereira F, Vetra A, Sikora T (2008) Multimedia retrieval and delivery: essential metadata challenges and standards. Proc IEEE 96(4):721–744
Plale B, Gannon D, Alameda J, Wilhelmson B, Hampton S, Rossi A, Droegemeier K (2005) Active management of scientific data. IEEE Internet Comput 9(1):27–34
Roemmich DH, Davis RE, Brechner Owens W, Molinari RL, Garzoli SL, Johnson GC (2009) The Argo project: global ocean observations for understanding and prediction of climate variability. http://www.coreocean.org/nopp/project-reports/reports/02roemmi.pdf
Song J, Yoo S, Park CS, Choi DH, Lee YJ (2006) The design of a grid-enabled information integration system based on mediator/wrapper architectures. In: International conference on grid computing & application, pp 114–120
Spertus E, Stein LA (2000) Squeal: a structured query language for the Web. Int J Comput Netw 33:95–103
Teng J, Liu Z, Sun M, Sun C, Xu J (2006) Development of online Argo data service platform based on GIS. In: IEEE international conference on geoscience and remote sensing symposium, July 31–Aug 4, pp 1316–1318
Thomson J, Adams D, Cowley PJ, Walker K (2003) Metadata’s role in a scientific achive. IEEE Comput 36(12):27–34
Wohrer A, Brezany P, Tjoa AM (2005) Novel mediator architecture for grid information systems. Future Gener Comput Syst 21:107–114
Yang R, Kafatos M, Wang XS (2002) Managing scientific metadata using XML. IEEE Internet Comput 6(4):52–59
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Chang, YS., Cheng, HT. A scientific data extraction architecture using classified metadata. J Supercomput 60, 338–359 (2012). https://doi.org/10.1007/s11227-010-0462-7
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11227-010-0462-7