Abstract
Annotating digital imagery of historical materials for the purpose of computer-based retrieval is a labor-intensive task for many historians and digital collection managers. We have explored the possibilities of automated annotation and retrieval of images from collections of art and cultural images. In this paper, we introduce the application of the ALIP (Automatic Linguistic Indexing of Pictures) system, developed at Penn State, to the problem of machine-assisted annotation of images of historical materials. The ALIP system learns the expertise of a human annotator on the basis of a small collection of annotated representative images. The learned knowledge about the domain-specific concepts is stored as a dictionary of statistical models in a computer-based knowledge base. When an un-annotated image is presented to ALIP, the system computes the statistical likelihood of the image resembling each of the learned statistical models and the best concept is selected to annotate the image. Experimental results, obtained using the Emperor image collection of the Chinese Memory Net project, are reported and discussed. The system has been trained using subsets of images and metadata from the Emperor collection. Finally, we introduce an integration of wavelet-based annotation and wavelet-based progressive displaying of very high resolution copyright-protected images.
Similar content being viewed by others
References
Chen, C.-c., The First Emperor of China: interactive videodisc, the Voyager Company, 1991. Multimedia CD-ROM published in 1993. Result of PROJECT EMPEROR-I, supported by the US National Endowment for the Humanities
Chen, C.-c.: Chinese Memory Net (CMNet): A model for collaborative global digital library development. In: Chen, C.-c. (ed.) Global Digital Library in the New Millennium: Fertile Ground for Distributed Cross-Disciplinary Collaboration, pp. 21–32. Tsinghua University Press, Beijing (2001)
Smeulders, A.W.M., Worring, M., Santini, S., Gupta, A., Jain, R.: Content-based image retrieval at the end of the early years. IEEE Trans. Pattern Anal. Mach. Intell. 22(12), 1349–1380 (2000)
Wang, J.Z.: Integrated Region-Based Image Retrieval. Kluwer Academic Publishers, Dordrecht (2001)
Wang, J.Z., Li, J., Wiederhold, G.: SIMPLIcity: Semantics-sensitive Integrated Matching for Picture LIbraries. IEEE Trans. Pattern Anal. Mach. Intell. 23(9), 947–963 (2001)
Wang, J.Z., Li, J., Chen, C.-c.: Interdisciplinary research to advance digital imagery indexing and retrieval technologies for Asian art and cultural heritages. In: Proceeding of the 4th International Workshop on Multimedia Information Retrieval, in conjunction with ACM Multimedia, Juan Les Pins, France, ACM, 6 pp. (2002)
Chen, C.-c., Wactlar, H., Wang, J.Z., Kiernan, K.: Digital imagery for significant cultural and historical materials: an emerging research field bridging people, culture, and technologies. Int. J. Digital Libr. Special Issue: Towards the New Generation Digital Libraries: Recommendations of the US-NSF/EU-DELOS Working Groups (2005)
Chen, Y., Li, J., Wang, J.Z.: Machine Learning and Statistical Modeling Approaches to Image Retrieval. Kluwer Academic Publishers, Dordrecht (2004)
Kriegman, D., Ponce, J.: On recognizing and positioning curved 3D objects from image contours. IEEE Trans. Pattern Anal. Mach. Intell. 12(12), 1127–1137 (1990)
Dickinson, S., Pentland, A., Rosenfeld, A.: 3-D shape recovery using distributed aspect matching. IEEE Trans. Pattern Anal. Mach. Intell. 14(2), 174–198 (1992)
Wactlar, H.D., Kanade, T., Smith, M.A., Stevens, S.M.: Intelligent access to digital video: Informedia project. IEEE Comp. 29(3), 46–52 (1996)
Chandrasekaran, S., Manjunath, B.S., Wang, Y.F., Winkler, J., Zhang, H.: An eigenspace update algorithm for image analysis. Graph. Models Image Process. 59(5), 321–332 (1997)
Chu, W.W., Hsu, C.C., Cardenas, A.F., Taira, R.K.: A knowledge-based image retrieval with spatial and temporal constructs. IEEE Trans. Knowledge Data Eng. 10(6), 872–888 (1998)
Sheikholeslami, G., Chatterjee, S., Zhang, A.: WaveCluster: A multi-resolution clustering approach for very large spatial databases. In: Proceeding of the VLDB Conference, New York City, pp. 428–439 (1998)
Wang, J.Z., Wiederhold, G., Firschein, O., Sha, X.W.: Content-based image indexing and searching using Daubechies' wavelets. Int. J. Digital Libr.(IJODL) 1(4), 311–328 (1998)
Vailaya, A., Figueiredo, M., Jain, A., Zhang, H.J.: Content-based hierarchical classification of vacation images. In: Proceeding of the IEEE International Conference on Multimedia Computing and Systems, ICMCS, Amsterdam, The Netherlands (1999)
Chen, Y., Wang, J.Z.: A region-based fuzzy feature matching approach to content-based image retrieval. IEEE Trans. Pattern Anal. Mach. Intell. 24(9), 1252–1267 (2002)
Barnard, K., Duygulu, P., de Freitas, N., Forsyth, D.A., Blei, D.M., Jordan, M.I.: Matching words and pictures. J. Mach. Learn. Res. 3, 1107–1135 (2003)
Li, J., Wang, J.Z.: Automatic linguistic indexing of pictures by a statistical modeling approach. IEEE Trans. Pattern Anal. Mach. Intell. 25(9), 1075–1088 (2003)
Li, J., Gray, R.M.: Image Segmentation and Compression Using Hidden Markov Models. Kluwer Academic Publishers, Dordrecht (2000)
Cox, I.J., Miller, M.L., Bloom, J.A.: Digital Watermarking. Morgan Kaufmann, San Francisco, CA (2002)
Li, J., Joshi, D., Wang, J.Z.: Stochastic modeling of volume images with a 3-D hidden Markov model. In: Proceeding of the IEEE International Conference on Image Processing, Singapore, IEEE, pp. 2359–2362 (2004)
Author information
Authors and Affiliations
Corresponding author
Additional information
A preliminary version of this work has been presented at the DELOS-NSF Workshop on Multimedia in Digital Libraries, Crete, Greece, June 2003. The work was completed when Kurt Grieb and Ya Zhang were students of The Pennsylvania State University. James Z. Wang and Jia Li are also affiliated with Department of Computer Science and Engineering, The Pennsylvania State University. Yixin Chen is also with the Research Institute for Children, Children's Hospital, New Orleans.
Rights and permissions
About this article
Cite this article
Wang, J.Z., Grieb, K., Zhang, Y. et al. Machine annotation and retrieval for digital imagery of historical materials. Int J Digit Libr 6, 18–29 (2006). https://doi.org/10.1007/s00799-005-0121-4
Received:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00799-005-0121-4