Abstract
Metadata extraction is an important prerequisite for remote sensing images management and sharing. Business users urgently need to extract metadata automatically and quickly. However, existing metadata extraction applications extract only one single type of image metadata. In this paper, we proposed a generic and extensible metadata extraction approach based on Programming by Demonstration (PbD). Data owners can specify the metadata items to be extracted and the extraction methods in an interactive and visual user interface. Such knowledge will be stored in the rule base in the form of rules. The advantage of this approach is that users do not need any programming knowledge, but they can handle new types of images themselves.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Seymore, K., McCallum, A., Rosenreid, R.: Learning hidden Markov model structure for information extraction. In: Califf, M.E., Freitag, D., Kushmerick, N., Muslea, I. (eds.) Proc. of the AAAI 1999 Workshop on Machine Learning for Information Extraction, pp. 37–42. MIT Press, Cambridge (1999)
McCallum, A., Freitag, D., Pereira, F.: Maximum entropy Markov models for information extraction and segmentation. In: Langley, P. (ed.) Proc. of the Int’l Conf. on Machine Learning (ICML 2000), pp. 591–598. Morgan Kaufmann Publishers, San Francisco (2000)
Chidlovskii, B.: Wrapping Web Information Providers by Transducer Induction. In: De Raedt, L., Flach, P.A. (eds.) ECML 2001. LNCS (LNAI), vol. 2167, pp. 61–72. Springer, Heidelberg (2001)
Hitchcock, S., Carr, L., et al.: Developing services for open eprint archives: Globalisation, integration and the impact of links. In: Proc. of the 5th ACM Conf. on Digital Libraries (ACMDL 2000), pp. 143–151. ACM Press, New York (2000)
Klink, S., Dengel, A., Kieninger, T.: Rule-Based document structure understanding with a fuzzy combination of layout and textual features. Int’l Journal on Document Analysis and Recognition 4(1), 18–26 (2001)
Kim, J., Le, D.X., Thoma, G.R.: Automated labeling algorithms for biomedical document images. In: Proc. of the 7th World Multiconference on Systemics, Cybernetics and Informatics, pp. 352–357. IIIS, Orlando (2003)
Zhao, P.X., Zhang, M., et al.: Automatic extraction of metadata from digital documents. Computer Science 30(10), 217–204 (2003) (in Chinese with English abstract)
Leshed, G., Haber, E.M., et al.: CoScripter: Automating & Sharing How-To Knowledge in the Enterprise. In: Proc. of the CHI 2008 Proceedings · End-Users Sharing and Tailoring Software, pp. 1719–1728. ACM Press, New York (2008)
Tuchinda, R., Szekely, P., Knoblock, C.A.: Building Mashups by Example. In: Proc. of the 13th International Conference on Intelligent User Interfaces (IUI 2008), Gran Canaria, Spain, pp. 139–148 (2008)
Wang, G.L., Yang, S.H., Han, Y.B.: Mashroom: End-User Mashup Programming Using Nested Tables. In: Proc. of the 18th International World Wide Web Conference (WWW), Madrid, Spain, pp. 861–870 (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Cui, B., Zhang, J. (2012). An Intelligent Metadata Extraction Approach Based on Programming by Demonstration. In: Wang, F.L., Lei, J., Gong, Z., Luo, X. (eds) Web Information Systems and Mining. WISM 2012. Lecture Notes in Computer Science, vol 7529. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33469-6_84
Download citation
DOI: https://doi.org/10.1007/978-3-642-33469-6_84
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33468-9
Online ISBN: 978-3-642-33469-6
eBook Packages: Computer ScienceComputer Science (R0)