research-article

Content-enriched classifier for web video classification

Authors:

Gao CongAuthors Info & Claims

SIGIR '10: Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval

Pages 619 - 626

https://doi.org/10.1145/1835449.1835553

Published: 19 July 2010 Publication History

Abstract

With the explosive growth of online videos, automatic real-time categorization of Web videos plays a key role for organizing, browsing and retrieving the huge amount of videos on the Web. Previous work shows that, in addition to text features, content features of videos are also useful for Web video classification. Unfortunately, extracting content features is computationally prohibitive for real-time video classification. In this paper we propose a novel video classification framework that is able to exploit both content and text features for video classification while avoiding the expensive computation of extracting content features at classification time. The main idea of our approach is to utilize the content features extracted from training data to enrich the text based semantic kernels, yielding content-enriched semantic kernels. The content-enriched semantic kernels enable to utilize both content and text features for classifying new videos without extracting their content features. The experimental results show that our approach significantly outperforms the state-of-the-art video classification methods.

References

[1]

F. Bach, G. Lanckriet and M. Jordan. Multiple kernel learning, conic duality, and the SMO algorithm. In Proc. of ICML Conference, 2004.

Digital Library

[2]

S. Bloehdorn and A. Moschitti. Structure and semantics for expressive text kernels. In Proc. of CIKM conference, 2007.

Digital Library

[3]

M. Cammisa, S. Bloehdorn, R. Basili and A. Moschitti. Semantic kernels for text classification based on topological measures of feature similarity. In Proc. of IEEE ICDM Conference, 2006.

Digital Library

[4]

J. H. Chow, W. Dai, R. F. Zhang, R. Sarukkai and Z. F. Zhang. Joint categorization of queries and clips for Web--based video search. In Proc. of ACM MM Workshop on MIR, 2006.

Digital Library

[5]

K. W. Church and P. Hanks. Word association norms, mutual information, and lexicography. In Computational Linguistics 16, 1990.

Digital Library

[6]

C. Ciro, B. Dominik, H. Andreas and S. Gerd. Semantic Grounding of Tag Relatedness in Social Bookmarking Systems. In Proc. of ISWC Conference, 2008.

Digital Library

[7]

N. Cristianini and J. S. Taylor. An Introduction to Support Vector Machines and Other Kernel-based Learning Methods. Cambridge University Press, 2000.

[8]

F. J. Damerau, C. Apte and S. M. Weiss. Automated learning of decision rules for text categorization. In ACM Trans. Information Systems, vol. 12, no. 3, pp. 233--251, 1994.

Digital Library

[9]

Z. Dong, G. Zhang, J. Jia and H. Bao. Keyframe-Based Real-Time Camera Tracking. In Proc. of IEEE ICCV Conference, 2009.

[10]

C. D. Fellbaum. WordNet: An Electronic Lexical Database. MIT Press, 1998.

[11]

C.W. Hsu and C. J. Lin A comparison of methods for multiclass support vector machines. In IEEE Trans. on Neural Networks, vol. 13, no. 2, pp. 415--425, 2002.

Digital Library

[12]

A. B. A. Graf and S. Borer. Normalization in Support Vector Machines. In Proc. of DAGM-Symposium on Pattern Recognition, 2001.

Digital Library

[13]

G. Lanckriet, N. Cristianini, P. Bartlett, L. Ghaoui and M. Jordan. Learning the Kernel Matrix with Semidefinite Programming. In Journal of Machine Learning Research, Vol. 5, pp 27--72, 2004.

Digital Library

[14]

C. Leacock and M. Chodorow. Combining local context and wordnet similarity for word sense identification. MITPress, 1998.

[15]

R. Lienhart, S. Fischer and W. Effelsberg. Automatic recogition of film genres. In Proc. of ACM MM Conference, 1995.

Digital Library

[16]

W. H. Lin and A. Hauptmann. News video classification using svm-based multimodal classifiers and combination strategies. In Proc. of ACM MM Conference, 2002.

Digital Library

[17]

Y. Liu and Y. F. Zheng. One-against-all multi-class svm classification using reliability measures. In Proc. of IJCNN Conference, 2005.

[18]

A. Moschitti. Efficient convolution kernels for dependency and constituent syntactic trees. In Proc. of ECML Conference, 2006.

Digital Library

[19]

T. Mei, X. S. Hua, X. Yuan, W. Lai and X. Q. Wu. Automatic video genre categorization using hierarchical svm. In Proc. of ICIP Confernece, 2006.

[20]

A. Rakotomamonjy, F. Bach, Y. Grandvalet and S. Canu. SimpleMKL. In Journal of Machine Learning Research, Vol. 9, pp 2491--2521, 2008

[21]

F. Sebastiani. Machine learning in automated text categorization. In ACM Computing Surveys, 2002.

Digital Library

[22]

J. Shawe-Taylor, N. Cristianini and H. Lodhi. Latent semantic kernels. In Journal of Intelligent Information Systems, 18(2-3):127--152, 2002.

Digital Library

[23]

B. T. Truong, S. Venkatesh and C. Dorai. Automatic Genre Indentification for Content-based Video Categorizaion. In Proc. of ICPR Conference, 2000.

Digital Library

[24]

P. Wang and C. Domeniconi. Building Semantic Kernels for Text Classification using Wikipedia. In Proc. of SIGKDD conference, 2008.

Digital Library

[25]

Z. Wu and M. Palmer. Verb semantic and lexical selection. In Proc. of ACL Conference, 1994.

Digital Library

[26]

X. Yang, X.--S. Hua, L. Yang and J. Liu. Multi-modality Web video categorization. In Proc. of ACM MM Workshop on MIR, 2007.

Digital Library

[27]

Y. Yang and X. Liu. A re-examination of text categorization methods. In Proc. of SIGIR Conference, 1998.

Digital Library

Cited By

Afzal MShah NMuhammad T(2019)Web video classification with visual and contextual semanticsInternational Journal of Communication Systems10.1002/dac.399432:13Online publication date: 23-Jun-2019
https://doi.org/10.1002/dac.3994
Xia SLi TGe SDong Z(2016)Efficient Web Video Classification via Cross-modality Knowledge TransferringProceedings of the International Conference on Internet Multimedia Computing and Service10.1145/3007669.3007677(211-216)Online publication date: 19-Aug-2016
https://dl.acm.org/doi/10.1145/3007669.3007677
Wang ZCui PXie LZhu WRui YYang S(2014)Bilateral Correspondence Model for Words-and-Pictures Association in Multimedia-Rich MicroblogsACM Transactions on Multimedia Computing, Communications, and Applications10.1145/261138810:4(1-21)Online publication date: 4-Jul-2014
https://dl.acm.org/doi/10.1145/2611388
Show More Cited By

Index Terms

Content-enriched classifier for web video classification
1. Information systems
  1. Information retrieval

Recommendations

Text-based video content classification for online video-sharing sites

With the emergence of Web 2.0, sharing personal content, communicating ideas, and interacting with other online users in Web 2.0 communities have become daily routines for online users. User-generated data from Web 2.0 sites provide rich personal ...
Classification of Faults in Web Applications using Machine Learning
ISMSI '17: Proceedings of the 2017 International Conference on Intelligent Systems, Metaheuristics & Swarm Intelligence

Web is huge, abundant and heterogeneous and so are the challenges that arise due to this versatility. Web Applications as the new task-centric and action-oriented facilities have assumed a distinguished role in today's Web. At the same time, faults in ...
Classifier and feature set ensembles for web page classification

Web page classification is an important research direction on web mining. The abundant amount of data available on the web makes it essential to develop efficient and robust models for web mining tasks. Web page classification is the process of ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

SIGIR '10: Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval

July 2010

944 pages

ISBN:9781450301534

DOI:10.1145/1835449

General Chairs:
Fabio Crestani
University of Lugano, CH
,
Stéphane Marchand-Maillet
University of Geneva, CH
,
Program Chairs:
Hsin-Hsi Chen
National Taiwan University, TW
,
Efthimis N. Efthimiadis
University of Washington, USA
,
Jacques Savoy
University of Neuchatel, CH

Copyright © 2010 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGIR: ACM Special Interest Group on Information Retrieval

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 19 July 2010

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

SIGIR '10

Sponsor:

SIGIR

SIGIR '10: The 33rd International ACM SIGIR conference on research and development in Information Retrieval

July 19 - 23, 2010

Geneva, Switzerland

Acceptance Rates

SIGIR '10 Paper Acceptance Rate 87 of 520 submissions, 17%;

Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

5
Total Citations
View Citations
663
Total Downloads

Downloads (Last 12 months)8
Downloads (Last 6 weeks)0

Reflects downloads up to 20 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

Afzal MShah NMuhammad T(2019)Web video classification with visual and contextual semanticsInternational Journal of Communication Systems10.1002/dac.399432:13Online publication date: 23-Jun-2019
https://doi.org/10.1002/dac.3994
Xia SLi TGe SDong Z(2016)Efficient Web Video Classification via Cross-modality Knowledge TransferringProceedings of the International Conference on Internet Multimedia Computing and Service10.1145/3007669.3007677(211-216)Online publication date: 19-Aug-2016
https://dl.acm.org/doi/10.1145/3007669.3007677
Wang ZCui PXie LZhu WRui YYang S(2014)Bilateral Correspondence Model for Words-and-Pictures Association in Multimedia-Rich MicroblogsACM Transactions on Multimedia Computing, Communications, and Applications10.1145/261138810:4(1-21)Online publication date: 4-Jul-2014
https://dl.acm.org/doi/10.1145/2611388
Filippova KHall KMa WNie JBaeza-Yates RChua TCroft W(2011)Improved video categorization from text metadata and user commentsProceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval10.1145/2009916.2010028(835-842)Online publication date: 24-Jul-2011
https://dl.acm.org/doi/10.1145/2009916.2010028
Kamie MHashimoto TKitagawa H(2010)Topic-based awareness computing model for video-sharing service2010 2nd International Symposium on Aware Computing10.1109/ISAC.2010.5670453(44-50)Online publication date: Nov-2010
https://doi.org/10.1109/ISAC.2010.5670453

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents