Toward motion picture grammars

Bolle, Ruud; Aloimonos, Yiannis; Fermüller, Cornelia

doi:10.1007/3-540-63931-4_228

Toward motion picture grammars

Ruud Bolle¹,
Yiannis Aloimonos² &
Cornelia Fermüller²

Session S1A: Recent Advances in Computer Vision
Conference paper
First Online: 01 January 2005

2675 Accesses
3 Citations

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1352))

Abstract

We are interested in processing video data for the purpose of solving a variety of problems in video search, analysis, indexing, browsing and compression. Instead of concentrating on a particular problem, in this paper we present a framework for developing video applications. Our basic thesis is that video data can be represented at a higher level of abstraction as a string generated by a grammar, termed motion picture grammar. The rules of that grammar relate different spatiotemporal representations of the video content and, in particular, representations of action.

This is a preview of subscription content, log in via an institution.

Preview

Unable to display preview. Download preview PDF.

References

Arman, F., Hsu, A., Chiu, M.: Feature management for large video databases. SPIE 1908, Storage and Retrieval for Image and Video Databases (1993) 2–12
Google Scholar
Liu, H. C., Zick, G. L.: Scene decomposition of mpeg compressed video. SPIE 2419, Digital Video Compression: Algorithms and Technologies (1995) 26–37
Google Scholar
Otsuji, K., Tonomura, Y., Ohba, Y.: Video browsing using brightness data. SPIE 1606, Visual Communications and Image Processing (1991) 980–989
Google Scholar
Sethi, I. K., Patel, N.: A statistical approach to scene change detection. SPIE 2420, Storage and Retrieval for Image and Video Databases III (1995) 329–338
Google Scholar
Shahraray, B.: Scene change detection and content-based sampling of video sequences. SPIE 2419, Digital Video Compression: Algorithms and Technologies (1995) 2–13
Google Scholar
Swain, M. J., Ballard, D. H.: Color indexing. International Journal of Computer Vision 7 (1991) 11–32
Google Scholar
Zhang, H., Kankanhalli, A., Smoliar, S. W.: Automatic partitioning of full motion video. Multimedia Systems 1 (1993) 10–28
Google Scholar
Zhang, H. J., Low, C. Y., Smoliar, S. W.: Video parsing and browsing using compressed data. Multimedia Tools and Applications 1 (1995) 89–111
Google Scholar
Mann, S., Picard, R. W.: Virtual bellows: Constructing high quality still from video. International Conference on Image Processing, volume 1 (1994) 363–367
Google Scholar
Sawhney, H. S., Ayer, S., Gorkani, M.: Model based 2D & 3D dominant motion estimation for mosaicking and video representation. Technical report, IBM Almaden Research Laboratory (December 1994)
Google Scholar
Szeliski, R.: Image mosaicking for telereality applications. Technical Report CRL 94/2, DEC Cambridge Research Laboratory (1994)
Google Scholar
Teodosio, L., Bender, W.: Salient video stills: Content and context preserved. Proceedings, Multimedia '93, ACM (1993) 39–46
Google Scholar
Tonomura, Y., Akutsu, A., Otsuji, K., Sadakata, T.: Video map and video space icon: Tools for anatomizing video content. INTERCHI '93 Conference on Human Factors in Computing Systems, ACM (1993) 131–136
Google Scholar
Yeung, M. M., Yeo, B. L.: Data modelling of videos with temporal events and its applications. Technical Report TR-EE-ISS-YM9603, Princeton University (April 1996)
Google Scholar
Yow, K. D., Yeo, B. L., Yeung, M. M., Liu, B.: Analysis and presentation of soccer highlights from digital video. Second Asian Conference on Computer Vision, volume 2 (1995) 499–503
Google Scholar
Hibino, S., Steiner, E. A. R.: A visual query language for identifying temporal trends in video data. International Workshop on Multi-media Database Management Systems (1995) 74–81
Google Scholar
Yeung, M., Yeo, B.: Time-constrained clustering for segmentation of video into story units. ICPR '96, volume 6 (August 1996) 375–380
Google Scholar
Swanberg, D., Shu, C. F., Jain, R.: Knowledge-guided parsing in video databases. SPIE 1908, Storage and Retrieval for Image and Video Databases (1993) 13–25
Google Scholar
Zhang, H. J., Gong, Y. H., Smoliar, S. W., Yan, S. Y.: Automatic parsing of news video. International Conference on Multimedia Computing and Systems (1994) 45–54
Google Scholar
Dierckx, P.: Curve and Surface Fitting with Splines. Clarendon: Oxford (1993)
Google Scholar
Tiller, W.: Rational b-splines for curve and surface representation. IEEE CGA 3 (1983) 61–69
Google Scholar
Faugeras, O. D.: Three-Dimensional Computer Vision. Cambridge, MA: MIT Press (1992)
Google Scholar
Shulman, D., Aloimonos, J. Y.: (non-)rigid motion interpretation: a regularized approach. Proc. Royal Society, London B 233 (1988) 217–234
Google Scholar
Fu, K. S.: Syntactic Pattern Recognition and Applications. Englewood Cliffs, NJ: Prentice Hall (1982)
Google Scholar
Lee, K. H., Eom, K. B., Kashyap, R. L.: Character recognition based on attribute-dependent programmed grammar. IEEE Transactions on Pattern Analysis and Machine Intelligence 14 (1992)
Google Scholar
Zhao, M.: Two-dimensional extended attribute grammar method for the recognition of hand-printed chinese characters. Pattern Recognition 23 1990
Google Scholar
Charniak, E.: Statistical Language Learning. MIT Press (1993)
Google Scholar
Huang, X. D., Ariki, Y., Jack, M. A.: Hidden Markov models for Speech Recognition. Edinburgh University Press (1990)
Google Scholar
Grenander, U.: Elements of pattern theory. Johns Hopkins, Baltimore (1996)
Google Scholar
Abney, S.: Stochastic attribute valued grammars. Currently working at AT & T Labs Research in Florsham Park, NJ
Google Scholar
Keller, B., Lutz, R.: Learning stochastic context-free grammars from corpora using a genetic algorithm. ICANNGA (1997)
Google Scholar
Johnson, M.: Attribute valued logic and the theory of grammar. CSLI Lecture Notes, volume 16. CSLI (1988)
Google Scholar
Torenvliet, L., Trautwein, M.: A note on the complexity of restricted attribute valued grammars. Computational Linguistics in the Netherlands, Meeting at Twente (1995)
Google Scholar

Download references

Author information

Authors and Affiliations

Exploratory Computer Vision Group, IBM T.J. Watson Research Center, 10598, Yorktown Heights, NY, USA
Ruud Bolle
Computer Vision Laboratory, Center for Automation Research, Institute for Advanced Computer Studies, Computer Science Department, University of Maryland, 20742-3275, College Park, MD, USA
Yiannis Aloimonos & Cornelia Fermüller

Authors

Ruud Bolle
View author publications
You can also search for this author in PubMed Google Scholar
Yiannis Aloimonos
View author publications
You can also search for this author in PubMed Google Scholar
Cornelia Fermüller
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Roland Chin Ting-Chuen Pong

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bolle, R., Aloimonos, Y., Fermüller, C. (1997). Toward motion picture grammars. In: Chin, R., Pong, TC. (eds) Computer Vision — ACCV'98. ACCV 1998. Lecture Notes in Computer Science, vol 1352. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-63931-4_228

Download citation

DOI: https://doi.org/10.1007/3-540-63931-4_228
Published: 29 July 2005
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-63931-2
Online ISBN: 978-3-540-69670-4
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics