Skip to main content
Log in

A scientific data extraction architecture using classified metadata

  • Published:
The Journal of Supercomputing Aims and scope Submit manuscript

Abstract

Data extraction and information retrieval from a great volume of data set always is a tedious and difficult work. Therefore, an effective and efficient technology for searching for desired data becomes increasingly important. Since metadata with certain attributes may characterize data files, to extract data with the help of metadata can be expectably to simplify the work. Metadata Classification has been proposed to improve significantly the performance of scientific data extraction. In this paper, a scientific data extraction architecture based on the assistance of metadata classification mechanism is proposed. The architecture is built by utilizing mediator/wrapper architecture to develop a scientific data extracting system to help oceanographer analyzing ocean’s ecology. The result of performance evaluation shows that the architecture with the help of metadata classification can extract user’s desired data effectively and efficiently.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Bouguettaya A, Benatallah B, Hendra L, Ouzzani M, Beard J (2000) Supporting dynamic interactions among web-based information sources. IEEE Trans Knowl Data Eng 12(5):779–801

    Article  Google Scholar 

  2. Chang YS, Lai HJ, Cheng HT (2009) Improving scientific data extraction using metadata classification. In: 2009 international workshop on grid computing, applications, and technologies, Kaohsiung, Taiwan, December 14–16, pp 669–673

  3. Chang YS, Liang KC, Cheng MC, Yuan SM (2004) Prototyping an integrated information gathering system on CORBA. J Syst Softw 72(2):281–294

    Article  Google Scholar 

  4. Chang YS, Cheng HT, Lai HJ (2009) Metadata miner assisted integrated information retrieval for Argo ocean data. In: IEEE international conference on systems, man, and cybernetics, San Antonio, Texas, USA, October 11–14, pp 2930–2935

  5. Huang YP, Hsu WT, Sandnes FE (2008) Association analysis of ocean salinity and temperature variations. In: 2008 Third international conference on convergence and hybrid information technology, vol 2, pp 680–685

  6. Huang YP, Kao LJ, Sandnes FE (2008) Efficient mining of salinity and temperature association rules from ARGO data. Expert Syst Appl 35(1–2):59–68

    Article  Google Scholar 

  7. Huang YP, Jau JS, Sandnes FE (2009) Temporal-spatial association analysis of ocean salinity and temperature variations. In: Proceedings of the 2nd international conference on interaction sciences: information technology, culture and human, Seoul, Korea, pp 1001–1006

  8. Jones MB, Berkley C, Bojilova J, Schildhauer M (2001) Managing scientific metadata. IEEE Internet Comput 5(5):59–68

    Article  Google Scholar 

  9. Korfhage RR (1997) Information storage and retrieval. Wiley, New York

    Google Scholar 

  10. Lagoze C, Krafft D, Cornwell T, Dushay N, Eckstrom D, Saylor J (2006) Metadata aggregation and “automated digital libraries”: a retrospective on the NSDL experience. In: 6th ACM/IEEE-CS joint conference on digital libraries, Chapel Hill, NC, USA June 11–15, pp 230–239

  11. Mena E, Illarramendi A, Kashyap V, Seth AP (2000) OBSERVER: an approach for query processing in global information systems based on interoperation across pre-existing ontologies. Distrib Parallel Databases 8(2):223–271

    Article  Google Scholar 

  12. Pereira F, Vetra A, Sikora T (2008) Multimedia retrieval and delivery: essential metadata challenges and standards. Proc IEEE 96(4):721–744

    Article  Google Scholar 

  13. Plale B, Gannon D, Alameda J, Wilhelmson B, Hampton S, Rossi A, Droegemeier K (2005) Active management of scientific data. IEEE Internet Comput 9(1):27–34

    Article  Google Scholar 

  14. Roemmich DH, Davis RE, Brechner Owens W, Molinari RL, Garzoli SL, Johnson GC (2009) The Argo project: global ocean observations for understanding and prediction of climate variability. http://www.coreocean.org/nopp/project-reports/reports/02roemmi.pdf

  15. Song J, Yoo S, Park CS, Choi DH, Lee YJ (2006) The design of a grid-enabled information integration system based on mediator/wrapper architectures. In: International conference on grid computing & application, pp 114–120

  16. Spertus E, Stein LA (2000) Squeal: a structured query language for the Web. Int J Comput Netw 33:95–103

    Article  Google Scholar 

  17. Teng J, Liu Z, Sun M, Sun C, Xu J (2006) Development of online Argo data service platform based on GIS. In: IEEE international conference on geoscience and remote sensing symposium, July 31–Aug 4, pp 1316–1318

  18. Thomson J, Adams D, Cowley PJ, Walker K (2003) Metadata’s role in a scientific achive. IEEE Comput 36(12):27–34

    Article  Google Scholar 

  19. Wohrer A, Brezany P, Tjoa AM (2005) Novel mediator architecture for grid information systems. Future Gener Comput Syst 21:107–114

    Article  Google Scholar 

  20. Yang R, Kafatos M, Wang XS (2002) Managing scientific metadata using XML. IEEE Internet Comput 6(4):52–59

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yue-Shan Chang.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chang, YS., Cheng, HT. A scientific data extraction architecture using classified metadata. J Supercomput 60, 338–359 (2012). https://doi.org/10.1007/s11227-010-0462-7

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11227-010-0462-7

Keywords

Navigation