Skip to main content
Log in

Machine annotation and retrieval for digital imagery of historical materials

  • Regular Paper
  • Published:
International Journal on Digital Libraries Aims and scope Submit manuscript

Abstract

Annotating digital imagery of historical materials for the purpose of computer-based retrieval is a labor-intensive task for many historians and digital collection managers. We have explored the possibilities of automated annotation and retrieval of images from collections of art and cultural images. In this paper, we introduce the application of the ALIP (Automatic Linguistic Indexing of Pictures) system, developed at Penn State, to the problem of machine-assisted annotation of images of historical materials. The ALIP system learns the expertise of a human annotator on the basis of a small collection of annotated representative images. The learned knowledge about the domain-specific concepts is stored as a dictionary of statistical models in a computer-based knowledge base. When an un-annotated image is presented to ALIP, the system computes the statistical likelihood of the image resembling each of the learned statistical models and the best concept is selected to annotate the image. Experimental results, obtained using the Emperor image collection of the Chinese Memory Net project, are reported and discussed. The system has been trained using subsets of images and metadata from the Emperor collection. Finally, we introduce an integration of wavelet-based annotation and wavelet-based progressive displaying of very high resolution copyright-protected images.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Chen, C.-c., The First Emperor of China: interactive videodisc, the Voyager Company, 1991. Multimedia CD-ROM published in 1993. Result of PROJECT EMPEROR-I, supported by the US National Endowment for the Humanities

  2. Chen, C.-c.: Chinese Memory Net (CMNet): A model for collaborative global digital library development. In: Chen, C.-c. (ed.) Global Digital Library in the New Millennium: Fertile Ground for Distributed Cross-Disciplinary Collaboration, pp. 21–32. Tsinghua University Press, Beijing (2001)

  3. Smeulders, A.W.M., Worring, M., Santini, S., Gupta, A., Jain, R.: Content-based image retrieval at the end of the early years. IEEE Trans. Pattern Anal. Mach. Intell. 22(12), 1349–1380 (2000)

    Article  Google Scholar 

  4. Wang, J.Z.: Integrated Region-Based Image Retrieval. Kluwer Academic Publishers, Dordrecht (2001)

  5. Wang, J.Z., Li, J., Wiederhold, G.: SIMPLIcity: Semantics-sensitive Integrated Matching for Picture LIbraries. IEEE Trans. Pattern Anal. Mach. Intell. 23(9), 947–963 (2001)

    Article  Google Scholar 

  6. Wang, J.Z., Li, J., Chen, C.-c.: Interdisciplinary research to advance digital imagery indexing and retrieval technologies for Asian art and cultural heritages. In: Proceeding of the 4th International Workshop on Multimedia Information Retrieval, in conjunction with ACM Multimedia, Juan Les Pins, France, ACM, 6 pp. (2002)

  7. Chen, C.-c., Wactlar, H., Wang, J.Z., Kiernan, K.: Digital imagery for significant cultural and historical materials: an emerging research field bridging people, culture, and technologies. Int. J. Digital Libr. Special Issue: Towards the New Generation Digital Libraries: Recommendations of the US-NSF/EU-DELOS Working Groups (2005)

  8. Chen, Y., Li, J., Wang, J.Z.: Machine Learning and Statistical Modeling Approaches to Image Retrieval. Kluwer Academic Publishers, Dordrecht (2004)

  9. Kriegman, D., Ponce, J.: On recognizing and positioning curved 3D objects from image contours. IEEE Trans. Pattern Anal. Mach. Intell. 12(12), 1127–1137 (1990)

    Article  Google Scholar 

  10. Dickinson, S., Pentland, A., Rosenfeld, A.: 3-D shape recovery using distributed aspect matching. IEEE Trans. Pattern Anal. Mach. Intell. 14(2), 174–198 (1992)

    Article  Google Scholar 

  11. Wactlar, H.D., Kanade, T., Smith, M.A., Stevens, S.M.: Intelligent access to digital video: Informedia project. IEEE Comp. 29(3), 46–52 (1996)

    Google Scholar 

  12. Chandrasekaran, S., Manjunath, B.S., Wang, Y.F., Winkler, J., Zhang, H.: An eigenspace update algorithm for image analysis. Graph. Models Image Process. 59(5), 321–332 (1997)

    Article  Google Scholar 

  13. Chu, W.W., Hsu, C.C., Cardenas, A.F., Taira, R.K.: A knowledge-based image retrieval with spatial and temporal constructs. IEEE Trans. Knowledge Data Eng. 10(6), 872–888 (1998)

    Article  Google Scholar 

  14. Sheikholeslami, G., Chatterjee, S., Zhang, A.: WaveCluster: A multi-resolution clustering approach for very large spatial databases. In: Proceeding of the VLDB Conference, New York City, pp. 428–439 (1998)

  15. Wang, J.Z., Wiederhold, G., Firschein, O., Sha, X.W.: Content-based image indexing and searching using Daubechies' wavelets. Int. J. Digital Libr.(IJODL) 1(4), 311–328 (1998)

    Google Scholar 

  16. Vailaya, A., Figueiredo, M., Jain, A., Zhang, H.J.: Content-based hierarchical classification of vacation images. In: Proceeding of the IEEE International Conference on Multimedia Computing and Systems, ICMCS, Amsterdam, The Netherlands (1999)

  17. Chen, Y., Wang, J.Z.: A region-based fuzzy feature matching approach to content-based image retrieval. IEEE Trans. Pattern Anal. Mach. Intell. 24(9), 1252–1267 (2002)

    Google Scholar 

  18. Barnard, K., Duygulu, P., de Freitas, N., Forsyth, D.A., Blei, D.M., Jordan, M.I.: Matching words and pictures. J. Mach. Learn. Res. 3, 1107–1135 (2003)

    Google Scholar 

  19. Li, J., Wang, J.Z.: Automatic linguistic indexing of pictures by a statistical modeling approach. IEEE Trans. Pattern Anal. Mach. Intell. 25(9), 1075–1088 (2003)

    Google Scholar 

  20. Li, J., Gray, R.M.: Image Segmentation and Compression Using Hidden Markov Models. Kluwer Academic Publishers, Dordrecht (2000)

    Google Scholar 

  21. Cox, I.J., Miller, M.L., Bloom, J.A.: Digital Watermarking. Morgan Kaufmann, San Francisco, CA (2002)

    Google Scholar 

  22. Li, J., Joshi, D., Wang, J.Z.: Stochastic modeling of volume images with a 3-D hidden Markov model. In: Proceeding of the IEEE International Conference on Image Processing, Singapore, IEEE, pp. 2359–2362 (2004)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to James Z. Wang.

Additional information

A preliminary version of this work has been presented at the DELOS-NSF Workshop on Multimedia in Digital Libraries, Crete, Greece, June 2003. The work was completed when Kurt Grieb and Ya Zhang were students of The Pennsylvania State University. James Z. Wang and Jia Li are also affiliated with Department of Computer Science and Engineering, The Pennsylvania State University. Yixin Chen is also with the Research Institute for Children, Children's Hospital, New Orleans.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wang, J.Z., Grieb, K., Zhang, Y. et al. Machine annotation and retrieval for digital imagery of historical materials. Int J Digit Libr 6, 18–29 (2006). https://doi.org/10.1007/s00799-005-0121-4

Download citation

  • Received:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00799-005-0121-4

Keywords

Navigation