Abstract
We have proposed the EXTENT system for automated photograph annotation using image content and context analysis. A key component of EXTENT is a Landmark recognition system called LandMarker. In this paper, we present the architecture of LandMarker. The content of a query photograph is analyzed and compared against a database of sample landmark images, to recognize any landmarks it contains. An algorithm is presented for comparing a query image with a sample image. Context information may be used to assist landmark recognition. Also, we show how LandMarker deals with scalability to allow recognition of a large number of landmarks. We have implemented a prototype of the system, and present empirical results on a large dataset.
Similar content being viewed by others
References
Amores J, Sebe N, Radeva P (2005) Fast spatial pattern discovery integrating boosting with constellations of contextual descriptors. In: Proceedings of the international conference on computer vision and pattern recognition
Barnard K, Forsyth D (2000) Learning the semantics of words and pictures. In: International conference on computer vision, vol 2, pp 408–415
Bartolini I, Ciaccia P, Patella M (2000) A sound algorithm for region-based image retrieval using an index. In: Proceedings of the 4th international workshop on query processing and multimedia issues in distributed systems
Carson C, Thomas M, Belongie S, Hellerstein JM, Malik J (1999) Blobworld: a system for region-based image imdexing and retrieval. In: Proceedings of the international conference on visual information systems
Chang EY (2005) EXTENT: fusing context, content, and semantic ontology for photo annotation. In: Workshop on computer vision meets databases (CVDB) in cooperation with ACM international conference on management of data (SIGMOD)
Datar M, Immorlica N, Indyk P, Mirrokni V (2004) Locality-sensitive hashing scheme based on p-stable distributions. In: Proceedings of the 20th symposium on computational geometry
Davis M, King S, Good N, Sarvas R (2004) From context to content: leveraging context to infer media metadata. In: Proceedings of the ACM international conference on multimedia
Dey AK (2001) Understanding and using context. Personal Ubiquitous Comput J 5(1):4–7
Diomidis DS (2003) Position-annotated photographs: a geotemporal web. IEEE Pervasive Comput 2(2):72–79
Friedman N, Koller D (2001) Learning bayesian networks from data (tutorial). NIPS
Gionis A, Indyk P, Motwani R (1999) Similarity search in high dimensions via hashing. VLDB J, pp 518–529
Goh K-S, Chang EY, Cheng K-T (2001) SVM binary classifier ensembles for multi-class image classification. In: ACM international conference on information and knowledge management (CIKM), pp 395–402
Grauman K, Darrell T (2005) Efficient image matching with distributions of local invariant features. In: Proceedings of the international conference on computer vision and pattern recognition
Heckerman D, Shachter R (1994) Decision-theoretic foundations for causal reasoning. MSR-TR-94-11
Indyk P, Thaper N (2003) Fast image retrieval via embeddings. In: 3rd Intl. workshop on statistical and computational theories of vision
Ke Y, Sukthankar R, Huston L (2004) Efficient near-duplicate detection and sub-image retrieval. In: Proceedings of the ACM international conference on multimedia
Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60(2):91–110
Lv Q, Charikar M, Li K (2004) Image similarity search with compact data structures. In: Proceedings of the thirteenth ACM conference on information and knowledge management. New York, NY, USA, pp 208–217
Naaman M, Harada S, Wang Q, Garcia-Molina H, Paepcke A (2004) Context data in geo-referenced digital photo collections. In: Proceedings of the ACM international conference on multimedia
Naaman M, Paepcke A, Garcia-Molina H (2003) From where to what: metadata sharing for digital photographs with geographic coordinates. In: International conference on cooperative information systems (CoopIS)
Novick LR, Cheng PW (2004) Assessing interactive causal influence. Psychol Rev 111(2):455–485
Rubner Y, Tomasi C, Guibas LJ (2000) The earth mover’s distance as a metric for image retrieval. Int J Comput Vis 40(2):99–121
Schmid C, abd Svetlana Lazebnik GD, Mikolajczyk K (2005) Patter recognition with local invariant features. Handbook of pattern recognition and computer vision
Sivic J, Zisserman A (2003) Video google: a text retrieval approach to object matching in videos. In: Proceedings of the international conference on computer vision
Tong S, Chang E (2001) Support vector machine active learning for image retrieval. In: Proceedings of ACM international conference on multimedia, pp 107–118
Weber R, Mlivoncic M (2003) Efficient region-based image retrieval. In: Proceedings of the twelth ACM conference on information and knowledge management
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Qamra, A., Chang, E.Y. Scalable landmark recognition using EXTENT. Multimed Tools Appl 38, 187–208 (2008). https://doi.org/10.1007/s11042-007-0178-8
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-007-0178-8