Detecting topical events in digital video

Authors:
Tanveer Syeda-Mahmood

IBM Almaden Research Center, K57/B2, 650 Harry Road, San Jose, CA

IBM Almaden Research Center, K57/B2, 650 Harry Road, San Jose, CA
View Profile

,
S. Srinivasan

IBM Almaden Research Center, K57/B2, 650 Harry Road, San Jose, CA

IBM Almaden Research Center, K57/B2, 650 Harry Road, San Jose, CA
View Profile

MULTIMEDIA '00: Proceedings of the eighth ACM international conference on MultimediaOctober 2000Pages 85–94https://doi.org/10.1145/354384.354433

Published:30 October 2000Publication History

MULTIMEDIA '00: Proceedings of the eighth ACM international conference on Multimedia

Pages 85–94

ABSTRACT

The detection of events is essential to high-level semantic querying of video databases. It is also a very challenging problem requiring the detection and integration of evidence for an event available in multiple information modalities, such as audio, video and language. This paper focuses on the detection of specific types of events, namely, topic of discussion events that occur in classroom/lecture environments. Specifically, we present a query-driven approach to the detection of topic of discussion events with foils used in a lecture as a way to convey a topic. In particular, we use the image content of foils to detect visual events in which the foil is displayed and captured in the video stream. The recognition of a foil in video frames exploits the color and spatial layout of regions on foils using a technique called region hashing. Next, we use the textual phrases listed on a foil as an indication of a topic, and detect topical audio events as places in the audio track where the best evidence for the topical phrases was heard. Finally, we use a probabilistic model of event likelihood to combine the results of visual and audio avent detection that exploits their time cooccurrence. The resulting identification of topical events is evaluated in the domain of classroom lectures and talks.

References

1.G. Abowd et al. Teaching and learning as multimedia authoring: The classroom 2000 project. In Proc. ACM Multimedia, pages 104-111, 1996.]] Google ScholarDigital Library
2.D.E. Appelt et al. Maestro: Conductor of multimedia analysis technologies, caem, 43:57-63, February 2000.]] Google ScholarDigital Library
3.K. Bharat and M. Henzinger. Improved algorithms for topic distillation in a hypexlinked environment. In Proc. 22nd Annual SIGIR Conference, pages 326--327, 1999.]]Google Scholar
4.J.C. Clark and N. Ferrier. Modal control of an attentive vision system. In Proceedings of the International Conference on Computer Vision, pages 514-523. 1988.]]Google ScholarCross Ref
5.G. Hauptmann, D. Lee, and P.E. Kennedy. Topic labeling of multilingual broadcast news in the informedia digital video library. In Proc. A CM Digital Libraries/SIGIR MIDAS Workshop, 1999.]] Google ScholarDigital Library
6.S. Jones and G. Paynter. Topic-based browsing within a digital library using keyphrases. In Proc. 4th ACM Conference on Digital Libraries, pages 114-121, 1999.]] Google ScholarDigital Library
7.C. Koch and S. Ullman. Selecting one among the many: A simple network implementing shifts in selective visual attention. Technical report, Artificial Intelligence Lab, M.I.T., AI-Memo-770, Januaxy 1984.]] Google ScholarDigital Library
8.Y. Lamdan and H.J. Wolfish. Geometric hashing: A general and efficient model-based recognition scheme. In Proceedings of the International Conference on Computer Vision, pages 218-249, 1988.]]Google ScholarCross Ref
9.S. Mukhopadhyay and B. Smith. Passive capturing and structuring of lectures. In Proc. A CM Multimedia, pages 477-488, 1999.]] Google ScholarDigital Library
10.W. Niblack. Slidefinder: A tool for browsing presentation graphics using content-based retrieval. In Proc. IEEE Workshop on Content-based Access of Image and Video Libraries, pages 114-118, 1999.]] Google ScholarDigital Library
11.R. Schwartz et al. A maximum likelihood model for topic classification in broadcast news. In Proc. European Conf. on Speech Communication and Technology, 1997.]]Google Scholar
12.J.M. Siskind and Q. Morris. A maximum likelihood approach to visual event classification. In European Conf. Computer Vision, pages 347-362, 1996.]] Google ScholarDigital Library
13.S. Srinivasan et al. Query expansion for imperfect speech: Applications in distributed learning. In Proc. IEEE Workshop on Content-based Access of Image and Video Libraries (CBAIVL-2000), 2000.]] Google ScholarDigital Library
14.S. Srinivasan and D. Petkovic. Phonetic confusion matrix-based spoken document retrieval. In Proc. Special Interest Group on Information Retrieval (SIGIR) 2000, 2000.]] Google ScholarDigital Library
15.S. Srinivasan, D. Petkovic, and D. Ponceleon. Towards robust features for classifying audio in the cuevideo system. In Proc. A CM Multimedia, pages 393-400, 1999.]] Google ScholarDigital Library
16.T. Syeda-Mahmood. Indexing of topics using foils. In IEEE Conf. on Computer Vision and Pattern Recognition, 2000.]]Google Scholar
17.T. Syeda-Mahmood, P. Raghavan, and N. Megiddo. Interval hash trees: An efficient index structure for searching object queries in large image databases. In IEEE Workshop on Content-based Access of Image and Video Libraries, 2000.]] Google ScholarDigital Library
18.T.F. Syeda-Mahmood and Y-Q. Cheng. Indexing colored surfaces in images. In Proceedings Int. Conf., on Pattern Recognition, 1996.]] Google ScholarDigital Library

Index Terms

Detecting topical events in digital video
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
2. Information systems
  1. Information systems applications
    1. Multimedia information systems
      1. Multimedia databases

Recommendations

Detecting hot events from web search logs
WAIM'10: Proceedings of the 11th international conference on Web-age information management

Detecting events from web resources is a challenging task, attracting many attentions in recent years. Web search log is an important data source for event detection because the information it contains reflects users' activities and interestingness to ...
Read More
Sentiment classification of blog posts using topical extracts
ADC '12: Proceedings of the Twenty-Third Australasian Database Conference - Volume 124

Unlike news stories and product reviews which usually have a strong focus on a single topic, blog posts are often unstructured, and opinions expressed in blog posts do not necessarily correspond to a specific topic. This can lead to unsatisfactory ...
Read More
Detecting Video Anomalous Events with an Enhanced Abnormality Score
PRICAI 2022: Trends in Artificial Intelligence
Abstract
Detecting video anomalous events is vital for human monitoring. Anomalous events usually contain abnormal actions with exaggerated motion and little motion. We define the former and the latter as dynamic anomalies and static anomalies, ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
MULTIMEDIA '00: Proceedings of the eighth ACM international conference on Multimedia
October 2000
523 pages
ISBN:1581131984
DOI:10.1145/354384
Chairmen:
Shahram Ghandeharizadeh
USC
,
Shih-Fu Chang
Columbia Univ., New York, NY
,
Stephen Fischer
GMD-IPSI
,
Joseph A. Konstan
Univ. of Minnesota
,
Klara Nahrstedt
Univ. of Illinois, Urbana-Champaign
Copyright © 2000 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 30 October 2000
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
multi-modal fusion
query-driven topic detection
slide detection
topic of discussion events
topical audio events
Qualifiers
- Article
Conference

Acceptance Rates
Overall Acceptance Rate995of4,171submissions,24%
Upcoming Conference
MM '24

Sponsor:

sigmm

MM '24: The 32nd ACM International Conference on Multimedia

October 28 - November 1, 2024

Melbourne , VIC , Australia
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 26
  Total Citations
  View Citations
- 673
  Total Downloads
- Downloads (Last 12 months)15
- Downloads (Last 6 weeks)5
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Detecting topical events in digital video

MULTIMEDIA '00: Proceedings of the eighth ACM international conference on Multimedia

ABSTRACT

References

Cited By

Index Terms

Recommendations

Detecting hot events from web search logs

Sentiment classification of blog posts using topical extracts

Detecting Video Anomalous Events with an Enhanced Abnormality Score