Skip to main content

Toward motion picture grammars

  • Session S1A: Recent Advances in Computer Vision
  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1352))

Abstract

We are interested in processing video data for the purpose of solving a variety of problems in video search, analysis, indexing, browsing and compression. Instead of concentrating on a particular problem, in this paper we present a framework for developing video applications. Our basic thesis is that video data can be represented at a higher level of abstraction as a string generated by a grammar, termed motion picture grammar. The rules of that grammar relate different spatiotemporal representations of the video content and, in particular, representations of action.

This is a preview of subscription content, log in via an institution.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Arman, F., Hsu, A., Chiu, M.: Feature management for large video databases. SPIE 1908, Storage and Retrieval for Image and Video Databases (1993) 2–12

    Google Scholar 

  2. Liu, H. C., Zick, G. L.: Scene decomposition of mpeg compressed video. SPIE 2419, Digital Video Compression: Algorithms and Technologies (1995) 26–37

    Google Scholar 

  3. Otsuji, K., Tonomura, Y., Ohba, Y.: Video browsing using brightness data. SPIE 1606, Visual Communications and Image Processing (1991) 980–989

    Google Scholar 

  4. Sethi, I. K., Patel, N.: A statistical approach to scene change detection. SPIE 2420, Storage and Retrieval for Image and Video Databases III (1995) 329–338

    Google Scholar 

  5. Shahraray, B.: Scene change detection and content-based sampling of video sequences. SPIE 2419, Digital Video Compression: Algorithms and Technologies (1995) 2–13

    Google Scholar 

  6. Swain, M. J., Ballard, D. H.: Color indexing. International Journal of Computer Vision 7 (1991) 11–32

    Google Scholar 

  7. Zhang, H., Kankanhalli, A., Smoliar, S. W.: Automatic partitioning of full motion video. Multimedia Systems 1 (1993) 10–28

    Google Scholar 

  8. Zhang, H. J., Low, C. Y., Smoliar, S. W.: Video parsing and browsing using compressed data. Multimedia Tools and Applications 1 (1995) 89–111

    Google Scholar 

  9. Mann, S., Picard, R. W.: Virtual bellows: Constructing high quality still from video. International Conference on Image Processing, volume 1 (1994) 363–367

    Google Scholar 

  10. Sawhney, H. S., Ayer, S., Gorkani, M.: Model based 2D & 3D dominant motion estimation for mosaicking and video representation. Technical report, IBM Almaden Research Laboratory (December 1994)

    Google Scholar 

  11. Szeliski, R.: Image mosaicking for telereality applications. Technical Report CRL 94/2, DEC Cambridge Research Laboratory (1994)

    Google Scholar 

  12. Teodosio, L., Bender, W.: Salient video stills: Content and context preserved. Proceedings, Multimedia '93, ACM (1993) 39–46

    Google Scholar 

  13. Tonomura, Y., Akutsu, A., Otsuji, K., Sadakata, T.: Video map and video space icon: Tools for anatomizing video content. INTERCHI '93 Conference on Human Factors in Computing Systems, ACM (1993) 131–136

    Google Scholar 

  14. Yeung, M. M., Yeo, B. L.: Data modelling of videos with temporal events and its applications. Technical Report TR-EE-ISS-YM9603, Princeton University (April 1996)

    Google Scholar 

  15. Yow, K. D., Yeo, B. L., Yeung, M. M., Liu, B.: Analysis and presentation of soccer highlights from digital video. Second Asian Conference on Computer Vision, volume 2 (1995) 499–503

    Google Scholar 

  16. Hibino, S., Steiner, E. A. R.: A visual query language for identifying temporal trends in video data. International Workshop on Multi-media Database Management Systems (1995) 74–81

    Google Scholar 

  17. Yeung, M., Yeo, B.: Time-constrained clustering for segmentation of video into story units. ICPR '96, volume 6 (August 1996) 375–380

    Google Scholar 

  18. Swanberg, D., Shu, C. F., Jain, R.: Knowledge-guided parsing in video databases. SPIE 1908, Storage and Retrieval for Image and Video Databases (1993) 13–25

    Google Scholar 

  19. Zhang, H. J., Gong, Y. H., Smoliar, S. W., Yan, S. Y.: Automatic parsing of news video. International Conference on Multimedia Computing and Systems (1994) 45–54

    Google Scholar 

  20. Dierckx, P.: Curve and Surface Fitting with Splines. Clarendon: Oxford (1993)

    Google Scholar 

  21. Tiller, W.: Rational b-splines for curve and surface representation. IEEE CGA 3 (1983) 61–69

    Google Scholar 

  22. Faugeras, O. D.: Three-Dimensional Computer Vision. Cambridge, MA: MIT Press (1992)

    Google Scholar 

  23. Shulman, D., Aloimonos, J. Y.: (non-)rigid motion interpretation: a regularized approach. Proc. Royal Society, London B 233 (1988) 217–234

    Google Scholar 

  24. Fu, K. S.: Syntactic Pattern Recognition and Applications. Englewood Cliffs, NJ: Prentice Hall (1982)

    Google Scholar 

  25. Lee, K. H., Eom, K. B., Kashyap, R. L.: Character recognition based on attribute-dependent programmed grammar. IEEE Transactions on Pattern Analysis and Machine Intelligence 14 (1992)

    Google Scholar 

  26. Zhao, M.: Two-dimensional extended attribute grammar method for the recognition of hand-printed chinese characters. Pattern Recognition 23 1990

    Google Scholar 

  27. Charniak, E.: Statistical Language Learning. MIT Press (1993)

    Google Scholar 

  28. Huang, X. D., Ariki, Y., Jack, M. A.: Hidden Markov models for Speech Recognition. Edinburgh University Press (1990)

    Google Scholar 

  29. Grenander, U.: Elements of pattern theory. Johns Hopkins, Baltimore (1996)

    Google Scholar 

  30. Abney, S.: Stochastic attribute valued grammars. Currently working at AT & T Labs Research in Florsham Park, NJ

    Google Scholar 

  31. Keller, B., Lutz, R.: Learning stochastic context-free grammars from corpora using a genetic algorithm. ICANNGA (1997)

    Google Scholar 

  32. Johnson, M.: Attribute valued logic and the theory of grammar. CSLI Lecture Notes, volume 16. CSLI (1988)

    Google Scholar 

  33. Torenvliet, L., Trautwein, M.: A note on the complexity of restricted attribute valued grammars. Computational Linguistics in the Netherlands, Meeting at Twente (1995)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Roland Chin Ting-Chuen Pong

Rights and permissions

Reprints and permissions

Copyright information

© 1997 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Bolle, R., Aloimonos, Y., Fermüller, C. (1997). Toward motion picture grammars. In: Chin, R., Pong, TC. (eds) Computer Vision — ACCV'98. ACCV 1998. Lecture Notes in Computer Science, vol 1352. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-63931-4_228

Download citation

  • DOI: https://doi.org/10.1007/3-540-63931-4_228

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-63931-2

  • Online ISBN: 978-3-540-69670-4

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics