ABSTRACT
The interpretation of natural scenes, generally so obvious and effortless for humans, still remains a challenge in computer vision. To allow the search of image-based documents in digital libraries, we propose to design classifiers able to annotate images with keywords. First, we propose an image representation appropriate for scene description. Images are segmented into regions, and then indexed according to the presence of given region types. Second, we propound a classification scheme designed to separate images in the descriptor space. This is achieved by combining feature selection and kernel-method-based classification
- Dublin Core Metadata Initiative. http://dublincore.org.Google Scholar
- G. Amato, F. Debole, F. Rabitti, and P. Zezula. YAPI: Yet another path index for XML searching. In European Conference on Digital Libraries, Trondheim, Norway, August 2003.Google ScholarCross Ref
- G. Amato, C. Gennaro, and P. Savino. Indexing and retrieving documentary films: managing metadata in the ECHO system. In 4th Intl. Workshop on Multimedia Information Retrieval, in conjunction with ACM Multimedia, Juan-les-Pins, France, December 2002.Google Scholar
- G. Amato, C. Gennaro, P. Savino, and F. Rabitti. Milos: a multimedia content management system for digital library applications. In European Conference on Digital Libraries, Bath, U.K., september 2004.Google ScholarCross Ref
- J. Anlauf and M. Biehl. The adatron: an adaptive perceptron algorithm. Neurophysics Letters, 10:687--692, 1989.Google ScholarCross Ref
- R. Battiti. Using mutual information for selecting features in supervised neural network learning. Neural Networks, 5(4):537--550, 1994.Google ScholarDigital Library
- S. Belongie, C. Carson, H. Greenspan, and J. Malik. Recognition of images in large databases using a learning framework. Technical report, University of California at Berkeley, 1997. Google ScholarDigital Library
- J. C. Bezdek. Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum Press, New-York, N.Y., 1981. Google ScholarDigital Library
- C. Bühm, S. Berchtold, and D. Keim. Searching in high-dimensional spaces: Index structures for improving the performance of multimedia databases. ACM Computing Surveys, 33(3):322--373, September 2001. Google ScholarDigital Library
- D. Castelli and P. Pagano. Opendlib: A digital library service system. In M. Agosti and C. Thanos, editors, European Conference on Digital Libraries, Rome, Italy, September 2002. Google ScholarDigital Library
- O. Chapelle, P. Haffner, and V. Vapnik. Svms for histogram-based image classification. IEEE Transactions on Neural Networks, 10:1055--1065, 1999. Google ScholarDigital Library
- N. Christianini and J. Shawe-Taylor. An Introduction to Support Vector Machines. Cambridge University Press, 2000. Google ScholarDigital Library
- D. Comaniciu and P. Meer. Robust analysis of feature spaces: Color image segmentation. In Conference on Computer Vision and Pattern Recognition, pages 750--755, San Juan, Porto Rico, June 1997. Google ScholarDigital Library
- W. W. W. Consortium. XQuery 1.0: An XML query language. W3C Working Draft, November 2002. http://www.w3.org/TR/xquery.Google Scholar
- A. Del Bimbo. Visual information retrieval. Morgan Kaufmann, San Francisco, CA, 1999. Google ScholarDigital Library
- P. Duygulu, K. Barnard, J. de Freitas, and D. Forsyth. Object recognition as machine translation: Learning a lexicon for a fixed image vocabulary. In European Conference on Computer Vision, volume 4, pages 97--112, Copenhagen, Denmark, May 2002. Google ScholarDigital Library
- T.-T. Friess, N. Christianini, and C. Campbell. The kernel-adatron algorithm: a fast and simple learning procedure for support vector machines. In International Conference on Machine Learning, Madison, Wisconsin, July 1998. Google ScholarDigital Library
- R. M. Gray. Entropy and Information Theory. Springer-Verlag, New York, N.Y., 1990. Google ScholarDigital Library
- I. Guyon and A. Elisseff. An introduction to variable and feature selection. Journal of Machine Learning Research, 3:1157--1182, 2003. Google ScholarDigital Library
- B. Le Saux and N. Boujemaa. Unsupervised robust clustering for image database categorization. In International Conference on Pattern Recognition, Quebec, Canada, August 2002. Google ScholarDigital Library
- M. Lew. Next generation web searches for visual content. IEEE Computer, pages 46--53, 2000. Google ScholarDigital Library
- M. Lew. Principles of Visual Information Retrieval. Springer-Verlag, London, U.K., 2001. Google ScholarDigital Library
- J. Li and J. Z. Wang. Automatic linguistic indexing of pictures by a statistic modeling approach. IEEE Transactions on Pattern Analysis and Machine Intelligence, 25(9):1075--1088, 2003. Google ScholarDigital Library
- B. Manjunath, P. Salembier, and T. Sikora. Introduction to MPEG-7: Multimedia Content Description Interface. John Wiley & Sons, New-York, N.Y., 2002. Google ScholarDigital Library
- T. Minka and R. Picard. Interactive learning using a society of models. Pattern Recognition, 30(4):565--581, 1997.Google Scholar
- V. Vapnik. The Nature of Statistical Learning Theory. Springer Verlag, New-York, N.Y., 1995. Google ScholarDigital Library
- P. Zezula, G. Amato, F. Debole, and F. Rabitti. Tree signatures for xml querying and navigation. In Database and XML Technologies, First International XML Database Symposium, pages 149--163, Berlin, Germany, September 2003.Google Scholar
Index Terms
- Image recognition for digital libraries
Recommendations
Integrated image representation based natural scene classification
Natural scene classification (NSC) is a challenging pattern classification problem. As one of state-of-the-art techniques, the bag-of-feature (BOF) model has received extensive considerations in characterizing the image. To boost the flexibility during ...
Early versus Late Dimensionality Reduction of Bag-of-Words Feature Representation for Image Classification
ICBRA '17: Proceedings of the 4th International Conference on Bioinformatics Research and ApplicationsExtracting the bag-of-words (BoW) feature from images has been widely used for image classification. In general, some local keypoints are first of all detected from each image and the keypoint descriptor, such as scale-invariant feature transform (SIFT),...
SIMPLIcity: Semantics-Sensitive Integrated Matching for Picture LIbraries
The need for efficient content-based image retrieval has increased tremendously in many application areas such as biomedicine, military, commerce, education, and Web image classification and searching. We present here SIMPLIcity (Semantics-sensitive ...
Comments