skip to main content
10.1145/354384.354433acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
Article
Free Access

Detecting topical events in digital video

Authors Info & Claims
Published:30 October 2000Publication History

ABSTRACT

The detection of events is essential to high-level semantic querying of video databases. It is also a very challenging problem requiring the detection and integration of evidence for an event available in multiple information modalities, such as audio, video and language. This paper focuses on the detection of specific types of events, namely, topic of discussion events that occur in classroom/lecture environments. Specifically, we present a query-driven approach to the detection of topic of discussion events with foils used in a lecture as a way to convey a topic. In particular, we use the image content of foils to detect visual events in which the foil is displayed and captured in the video stream. The recognition of a foil in video frames exploits the color and spatial layout of regions on foils using a technique called region hashing. Next, we use the textual phrases listed on a foil as an indication of a topic, and detect topical audio events as places in the audio track where the best evidence for the topical phrases was heard. Finally, we use a probabilistic model of event likelihood to combine the results of visual and audio avent detection that exploits their time cooccurrence. The resulting identification of topical events is evaluated in the domain of classroom lectures and talks.

References

  1. 1.G. Abowd et al. Teaching and learning as multimedia authoring: The classroom 2000 project. In Proc. ACM Multimedia, pages 104-111, 1996.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. 2.D.E. Appelt et al. Maestro: Conductor of multimedia analysis technologies, caem, 43:57-63, February 2000.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. 3.K. Bharat and M. Henzinger. Improved algorithms for topic distillation in a hypexlinked environment. In Proc. 22nd Annual SIGIR Conference, pages 326--327, 1999.]]Google ScholarGoogle Scholar
  4. 4.J.C. Clark and N. Ferrier. Modal control of an attentive vision system. In Proceedings of the International Conference on Computer Vision, pages 514-523. 1988.]]Google ScholarGoogle ScholarCross RefCross Ref
  5. 5.G. Hauptmann, D. Lee, and P.E. Kennedy. Topic labeling of multilingual broadcast news in the informedia digital video library. In Proc. A CM Digital Libraries/SIGIR MIDAS Workshop, 1999.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. 6.S. Jones and G. Paynter. Topic-based browsing within a digital library using keyphrases. In Proc. 4th ACM Conference on Digital Libraries, pages 114-121, 1999.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. 7.C. Koch and S. Ullman. Selecting one among the many: A simple network implementing shifts in selective visual attention. Technical report, Artificial Intelligence Lab, M.I.T., AI-Memo-770, Januaxy 1984.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. 8.Y. Lamdan and H.J. Wolfish. Geometric hashing: A general and efficient model-based recognition scheme. In Proceedings of the International Conference on Computer Vision, pages 218-249, 1988.]]Google ScholarGoogle ScholarCross RefCross Ref
  9. 9.S. Mukhopadhyay and B. Smith. Passive capturing and structuring of lectures. In Proc. A CM Multimedia, pages 477-488, 1999.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. 10.W. Niblack. Slidefinder: A tool for browsing presentation graphics using content-based retrieval. In Proc. IEEE Workshop on Content-based Access of Image and Video Libraries, pages 114-118, 1999.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. 11.R. Schwartz et al. A maximum likelihood model for topic classification in broadcast news. In Proc. European Conf. on Speech Communication and Technology, 1997.]]Google ScholarGoogle Scholar
  12. 12.J.M. Siskind and Q. Morris. A maximum likelihood approach to visual event classification. In European Conf. Computer Vision, pages 347-362, 1996.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. 13.S. Srinivasan et al. Query expansion for imperfect speech: Applications in distributed learning. In Proc. IEEE Workshop on Content-based Access of Image and Video Libraries (CBAIVL-2000), 2000.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. 14.S. Srinivasan and D. Petkovic. Phonetic confusion matrix-based spoken document retrieval. In Proc. Special Interest Group on Information Retrieval (SIGIR) 2000, 2000.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. 15.S. Srinivasan, D. Petkovic, and D. Ponceleon. Towards robust features for classifying audio in the cuevideo system. In Proc. A CM Multimedia, pages 393-400, 1999.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. 16.T. Syeda-Mahmood. Indexing of topics using foils. In IEEE Conf. on Computer Vision and Pattern Recognition, 2000.]]Google ScholarGoogle Scholar
  17. 17.T. Syeda-Mahmood, P. Raghavan, and N. Megiddo. Interval hash trees: An efficient index structure for searching object queries in large image databases. In IEEE Workshop on Content-based Access of Image and Video Libraries, 2000.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. 18.T.F. Syeda-Mahmood and Y-Q. Cheng. Indexing colored surfaces in images. In Proceedings Int. Conf., on Pattern Recognition, 1996.]] Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Detecting topical events in digital video

            Recommendations

            Comments

            Login options

            Check if you have access through your login credentials or your institution to get full access on this article.

            Sign in
            • Published in

              cover image ACM Conferences
              MULTIMEDIA '00: Proceedings of the eighth ACM international conference on Multimedia
              October 2000
              523 pages
              ISBN:1581131984
              DOI:10.1145/354384

              Copyright © 2000 ACM

              Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

              Publisher

              Association for Computing Machinery

              New York, NY, United States

              Publication History

              • Published: 30 October 2000

              Permissions

              Request permissions about this article.

              Request Permissions

              Check for updates

              Qualifiers

              • Article

              Acceptance Rates

              Overall Acceptance Rate995of4,171submissions,24%

              Upcoming Conference

              MM '24
              MM '24: The 32nd ACM International Conference on Multimedia
              October 28 - November 1, 2024
              Melbourne , VIC , Australia

            PDF Format

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader