ABSTRACT
The detection of events is essential to high-level semantic querying of video databases. It is also a very challenging problem requiring the detection and integration of evidence for an event available in multiple information modalities, such as audio, video and language. This paper focuses on the detection of specific types of events, namely, topic of discussion events that occur in classroom/lecture environments. Specifically, we present a query-driven approach to the detection of topic of discussion events with foils used in a lecture as a way to convey a topic. In particular, we use the image content of foils to detect visual events in which the foil is displayed and captured in the video stream. The recognition of a foil in video frames exploits the color and spatial layout of regions on foils using a technique called region hashing. Next, we use the textual phrases listed on a foil as an indication of a topic, and detect topical audio events as places in the audio track where the best evidence for the topical phrases was heard. Finally, we use a probabilistic model of event likelihood to combine the results of visual and audio avent detection that exploits their time cooccurrence. The resulting identification of topical events is evaluated in the domain of classroom lectures and talks.
- 1.G. Abowd et al. Teaching and learning as multimedia authoring: The classroom 2000 project. In Proc. ACM Multimedia, pages 104-111, 1996.]] Google ScholarDigital Library
- 2.D.E. Appelt et al. Maestro: Conductor of multimedia analysis technologies, caem, 43:57-63, February 2000.]] Google ScholarDigital Library
- 3.K. Bharat and M. Henzinger. Improved algorithms for topic distillation in a hypexlinked environment. In Proc. 22nd Annual SIGIR Conference, pages 326--327, 1999.]]Google Scholar
- 4.J.C. Clark and N. Ferrier. Modal control of an attentive vision system. In Proceedings of the International Conference on Computer Vision, pages 514-523. 1988.]]Google ScholarCross Ref
- 5.G. Hauptmann, D. Lee, and P.E. Kennedy. Topic labeling of multilingual broadcast news in the informedia digital video library. In Proc. A CM Digital Libraries/SIGIR MIDAS Workshop, 1999.]] Google ScholarDigital Library
- 6.S. Jones and G. Paynter. Topic-based browsing within a digital library using keyphrases. In Proc. 4th ACM Conference on Digital Libraries, pages 114-121, 1999.]] Google ScholarDigital Library
- 7.C. Koch and S. Ullman. Selecting one among the many: A simple network implementing shifts in selective visual attention. Technical report, Artificial Intelligence Lab, M.I.T., AI-Memo-770, Januaxy 1984.]] Google ScholarDigital Library
- 8.Y. Lamdan and H.J. Wolfish. Geometric hashing: A general and efficient model-based recognition scheme. In Proceedings of the International Conference on Computer Vision, pages 218-249, 1988.]]Google ScholarCross Ref
- 9.S. Mukhopadhyay and B. Smith. Passive capturing and structuring of lectures. In Proc. A CM Multimedia, pages 477-488, 1999.]] Google ScholarDigital Library
- 10.W. Niblack. Slidefinder: A tool for browsing presentation graphics using content-based retrieval. In Proc. IEEE Workshop on Content-based Access of Image and Video Libraries, pages 114-118, 1999.]] Google ScholarDigital Library
- 11.R. Schwartz et al. A maximum likelihood model for topic classification in broadcast news. In Proc. European Conf. on Speech Communication and Technology, 1997.]]Google Scholar
- 12.J.M. Siskind and Q. Morris. A maximum likelihood approach to visual event classification. In European Conf. Computer Vision, pages 347-362, 1996.]] Google ScholarDigital Library
- 13.S. Srinivasan et al. Query expansion for imperfect speech: Applications in distributed learning. In Proc. IEEE Workshop on Content-based Access of Image and Video Libraries (CBAIVL-2000), 2000.]] Google ScholarDigital Library
- 14.S. Srinivasan and D. Petkovic. Phonetic confusion matrix-based spoken document retrieval. In Proc. Special Interest Group on Information Retrieval (SIGIR) 2000, 2000.]] Google ScholarDigital Library
- 15.S. Srinivasan, D. Petkovic, and D. Ponceleon. Towards robust features for classifying audio in the cuevideo system. In Proc. A CM Multimedia, pages 393-400, 1999.]] Google ScholarDigital Library
- 16.T. Syeda-Mahmood. Indexing of topics using foils. In IEEE Conf. on Computer Vision and Pattern Recognition, 2000.]]Google Scholar
- 17.T. Syeda-Mahmood, P. Raghavan, and N. Megiddo. Interval hash trees: An efficient index structure for searching object queries in large image databases. In IEEE Workshop on Content-based Access of Image and Video Libraries, 2000.]] Google ScholarDigital Library
- 18.T.F. Syeda-Mahmood and Y-Q. Cheng. Indexing colored surfaces in images. In Proceedings Int. Conf., on Pattern Recognition, 1996.]] Google ScholarDigital Library
Index Terms
- Detecting topical events in digital video
Recommendations
Detecting hot events from web search logs
WAIM'10: Proceedings of the 11th international conference on Web-age information managementDetecting events from web resources is a challenging task, attracting many attentions in recent years. Web search log is an important data source for event detection because the information it contains reflects users' activities and interestingness to ...
Sentiment classification of blog posts using topical extracts
ADC '12: Proceedings of the Twenty-Third Australasian Database Conference - Volume 124Unlike news stories and product reviews which usually have a strong focus on a single topic, blog posts are often unstructured, and opinions expressed in blog posts do not necessarily correspond to a specific topic. This can lead to unsatisfactory ...
Detecting Video Anomalous Events with an Enhanced Abnormality Score
PRICAI 2022: Trends in Artificial IntelligenceAbstractDetecting video anomalous events is vital for human monitoring. Anomalous events usually contain abnormal actions with exaggerated motion and little motion. We define the former and the latter as dynamic anomalies and static anomalies, ...
Comments