Abstract
We are interested in processing video data for the purpose of solving a variety of problems in video search, analysis, indexing, browsing and compression. Instead of concentrating on a particular problem, in this paper we present a framework for developing video applications. Our basic thesis is that video data can be represented at a higher level of abstraction as a string generated by a grammar, termed motion picture grammar. The rules of that grammar relate different spatiotemporal representations of the video content and, in particular, representations of action.
This is a preview of subscription content, log in via an institution.
Preview
Unable to display preview. Download preview PDF.
References
Arman, F., Hsu, A., Chiu, M.: Feature management for large video databases. SPIE 1908, Storage and Retrieval for Image and Video Databases (1993) 2–12
Liu, H. C., Zick, G. L.: Scene decomposition of mpeg compressed video. SPIE 2419, Digital Video Compression: Algorithms and Technologies (1995) 26–37
Otsuji, K., Tonomura, Y., Ohba, Y.: Video browsing using brightness data. SPIE 1606, Visual Communications and Image Processing (1991) 980–989
Sethi, I. K., Patel, N.: A statistical approach to scene change detection. SPIE 2420, Storage and Retrieval for Image and Video Databases III (1995) 329–338
Shahraray, B.: Scene change detection and content-based sampling of video sequences. SPIE 2419, Digital Video Compression: Algorithms and Technologies (1995) 2–13
Swain, M. J., Ballard, D. H.: Color indexing. International Journal of Computer Vision 7 (1991) 11–32
Zhang, H., Kankanhalli, A., Smoliar, S. W.: Automatic partitioning of full motion video. Multimedia Systems 1 (1993) 10–28
Zhang, H. J., Low, C. Y., Smoliar, S. W.: Video parsing and browsing using compressed data. Multimedia Tools and Applications 1 (1995) 89–111
Mann, S., Picard, R. W.: Virtual bellows: Constructing high quality still from video. International Conference on Image Processing, volume 1 (1994) 363–367
Sawhney, H. S., Ayer, S., Gorkani, M.: Model based 2D & 3D dominant motion estimation for mosaicking and video representation. Technical report, IBM Almaden Research Laboratory (December 1994)
Szeliski, R.: Image mosaicking for telereality applications. Technical Report CRL 94/2, DEC Cambridge Research Laboratory (1994)
Teodosio, L., Bender, W.: Salient video stills: Content and context preserved. Proceedings, Multimedia '93, ACM (1993) 39–46
Tonomura, Y., Akutsu, A., Otsuji, K., Sadakata, T.: Video map and video space icon: Tools for anatomizing video content. INTERCHI '93 Conference on Human Factors in Computing Systems, ACM (1993) 131–136
Yeung, M. M., Yeo, B. L.: Data modelling of videos with temporal events and its applications. Technical Report TR-EE-ISS-YM9603, Princeton University (April 1996)
Yow, K. D., Yeo, B. L., Yeung, M. M., Liu, B.: Analysis and presentation of soccer highlights from digital video. Second Asian Conference on Computer Vision, volume 2 (1995) 499–503
Hibino, S., Steiner, E. A. R.: A visual query language for identifying temporal trends in video data. International Workshop on Multi-media Database Management Systems (1995) 74–81
Yeung, M., Yeo, B.: Time-constrained clustering for segmentation of video into story units. ICPR '96, volume 6 (August 1996) 375–380
Swanberg, D., Shu, C. F., Jain, R.: Knowledge-guided parsing in video databases. SPIE 1908, Storage and Retrieval for Image and Video Databases (1993) 13–25
Zhang, H. J., Gong, Y. H., Smoliar, S. W., Yan, S. Y.: Automatic parsing of news video. International Conference on Multimedia Computing and Systems (1994) 45–54
Dierckx, P.: Curve and Surface Fitting with Splines. Clarendon: Oxford (1993)
Tiller, W.: Rational b-splines for curve and surface representation. IEEE CGA 3 (1983) 61–69
Faugeras, O. D.: Three-Dimensional Computer Vision. Cambridge, MA: MIT Press (1992)
Shulman, D., Aloimonos, J. Y.: (non-)rigid motion interpretation: a regularized approach. Proc. Royal Society, London B 233 (1988) 217–234
Fu, K. S.: Syntactic Pattern Recognition and Applications. Englewood Cliffs, NJ: Prentice Hall (1982)
Lee, K. H., Eom, K. B., Kashyap, R. L.: Character recognition based on attribute-dependent programmed grammar. IEEE Transactions on Pattern Analysis and Machine Intelligence 14 (1992)
Zhao, M.: Two-dimensional extended attribute grammar method for the recognition of hand-printed chinese characters. Pattern Recognition 23 1990
Charniak, E.: Statistical Language Learning. MIT Press (1993)
Huang, X. D., Ariki, Y., Jack, M. A.: Hidden Markov models for Speech Recognition. Edinburgh University Press (1990)
Grenander, U.: Elements of pattern theory. Johns Hopkins, Baltimore (1996)
Abney, S.: Stochastic attribute valued grammars. Currently working at AT & T Labs Research in Florsham Park, NJ
Keller, B., Lutz, R.: Learning stochastic context-free grammars from corpora using a genetic algorithm. ICANNGA (1997)
Johnson, M.: Attribute valued logic and the theory of grammar. CSLI Lecture Notes, volume 16. CSLI (1988)
Torenvliet, L., Trautwein, M.: A note on the complexity of restricted attribute valued grammars. Computational Linguistics in the Netherlands, Meeting at Twente (1995)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1997 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bolle, R., Aloimonos, Y., Fermüller, C. (1997). Toward motion picture grammars. In: Chin, R., Pong, TC. (eds) Computer Vision — ACCV'98. ACCV 1998. Lecture Notes in Computer Science, vol 1352. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-63931-4_228
Download citation
DOI: https://doi.org/10.1007/3-540-63931-4_228
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-63931-2
Online ISBN: 978-3-540-69670-4
eBook Packages: Springer Book Archive