ABSTRACT
Increasingly geographic properties are being associated with videos, especially those captured from mobile cameras. The meta data from camera-attached sensors can be used to model the coverage area of the scene as a spatial object such that videos can be organized, indexed and searched based on their field of views (FOV). The most accurate representation of an FOV is through the geometric shape of a circular sector. However, spatial search and indexing methods are traditionally optimized for rectilinear shapes because of their simplicity. Established methods often use an approximation shape, such as a minimum bounding rectangle (MBR), to efficiently filter a large archive for possibly matching candidates. A second, refinement step is then applied to perform the time-consuming, precise matching function. MBR estimation has been successful for general spatial overlap queries, however it provides limited flexibility for georeferenced video search. In this study we propose a novel vector-based model for FOV estimation which provides a more versatile basis for georeferenced video search while providing competitive performance for the filter step. We demonstrate how the vector model can provide a unified method to perform traditional overlap queries while also enabling searches that, for example, concentrate on the vicinity of the camera's position or harness its view direction. To the best of our knowledge no comparable technique exists today.
- Flickr. http://www.flickr.com.Google Scholar
- Woophy. http://www.woophy.com.Google Scholar
- Sakire Arslan Ay, Roger Zimmermann, and Seon Ho Kim. Viewable Scene Modeling for Geospatial Video Search. In ACM International Conference on Multimedia, pages 309---18, 2008. Google ScholarDigital Library
- N. Beckmann, H.-P. Kriegel, R. Schneider, and B. Seeger. The R*-Tree: An Efficient and Robust Access Method for Points and Rectangles. In ACM International Conference on Management of Data, 1990. Google ScholarDigital Library
- T. Brinkhoff, H.-P. Kriegel, R. Schneider, and B. Seeger. Multi-step Processing of Spatial Joins. In ACM International Conference on Management of Data, 1994. Google ScholarDigital Library
- Boris Epshtein, Eyal Ofek, Yonatan Wexler, and Pusheng Zhang. Hierarchical Photo Organization Using Geo-Relevance. In ACM Intl. Symposium on Advances in Geographic Information Systems, pages 1--7, 2007. Google ScholarDigital Library
- Shantanu Gautam, Gabi Sarkis, Edwin Tjandranegara, Evan Zelkowitz, Yung-Hsiang Lu, and Edward J. Delp. Multimedia for Mobile Environment: Image Enhanced Navigation. volume SPIE 6073, pages 1--11, 2006.Google Scholar
- Clarence H. Graham, Neil R. Bartlett, John Lott Brown, Yun Hsia, Conrad C. Mueller, and Lorrin A. Riggs. Vision and Visual Perception. John Wiley & Sons, Inc., 1965.Google Scholar
- Rieko Kadobayashi and Katsumi Tanaka. 3D Viewpoint-Based Photo Search and Information Browsing. In 28th Intl. ACM SIGIR Conference on Research and Development in Information Retrieval, pages 621--622, 2005. Google ScholarDigital Library
- Lyndon S. Kennedy and Mor Naaman. Generating Diverse and Representative Image Search Results for Landmarks. In International Conference on the World Wide Web, pages 297--306, 2008. Google ScholarDigital Library
- Xiaotao Liu, Mark Corner, and Prashant Shenoy. SEVA: Sensor-Enhanced Video Annotation. In ACM International Conference on Multimedia, pages 618--627, 2005. Google ScholarDigital Library
- Mor Naaman, Yee Jiun Song, Andreas Paepcke, and Hector Garcia-Molina. Automatic Organization for Digital Photographs with Geographic Coordinates. In 4th ACM/IEEE-CS Joint Conference on Digital Libraries, pages 53--62, 2004. Google ScholarDigital Library
- A. Orenstein. Spatial Query Processing in an Object-Oriented Database System. In ACM International Conference on Management of Data, pages 326--336, 1986. Google ScholarDigital Library
- A. Pigeau and M. Gelgon. Building and Tracking Hierarchical Geographical & Temporal Partitions for Image Collection Management on Mobile Devices. In ACM International Conference on Multimedia, pages 141--150, 2005. Google ScholarDigital Library
- Kerry Rodden and Kenneth R. Wood. How do People Manage their Digital Photographs? In SIGCHI Conference on Human Factors in Computing Systems, pages 409--416, 2003. Google ScholarDigital Library
- Ian Simon and Steven M. Seitz. Scene Segmentation Using the Wisdom of Crowds. In Proc. ECCV, pages 541--553, 2008. Google ScholarDigital Library
- Carlo Torniai, Steve Battle, and Steve Cayzer. Sharing, Discovering and Browsing Geotagged Pictures on the Web. Springer, 2006.Google Scholar
- Kentaro Toyama, Ron Logan, and Asta Roseway. Geographic Location Tags on Digital Images. In ACM International Conference on Multimedia, pages 156--166, 2003. Google ScholarDigital Library
Index Terms
- Vector model in support of versatile georeferenced video search
Recommendations
Viewable scene modeling for geospatial video search
MM '08: Proceedings of the 16th ACM international conference on MultimediaVideo sensors are becoming ubiquitous and the volume of captured video material is very large. Therefore, tools for searching video databases are indispensable. Current techniques that extract features purely based on the visual signals of a video are ...
GRVS: a georeferenced video search engine
MM '09: Proceedings of the 17th ACM international conference on MultimediaAn increasing number of recorded videos are being tagged with geographic properties of the camera scenes. This meta-data is of significant use for storing, indexing and searching large collections of videos. By considering video related meta-information,...
GeoTree: using spatial information for georeferenced video search
With the rapid popularization of video recording devices, more multimedia content is available to the public. However, current video search engines rely on textual data such as video titles, annotations, and text around the video. Video recording ...
Comments