technical-note

Massive-scale multimedia semantic modeling

Authors:
John R. Smith

IBM T. J. Watson Research Center, Yorktown Heights,, NY, USA

IBM T. J. Watson Research Center, Yorktown Heights,, NY, USA
View Profile

,
Liangliang Cao

IBM T. J. Watson Research Center, Yorktown Heights,, NY, USA

IBM T. J. Watson Research Center, Yorktown Heights,, NY, USA
View Profile

MM '13: Proceedings of the 21st ACM international conference on MultimediaOctober 2013Pages 1113–1114https://doi.org/10.1145/2502081.2502235

Published:21 October 2013Publication History

MM '13: Proceedings of the 21st ACM international conference on Multimedia

Pages 1113–1114

ABSTRACT

Visual data is exploding! 500 billion consumer photos are taken each year world-wide, 633 million photos taken per year in NYC alone. 120 new video-hours are uploaded on YouTube per minute. The explosion of digital multimedia data is creating a valuable open source for insights. However, the unconstrained nature of 'image/video in the wild' makes it very challenging for automated computer-based analysis. Furthermore, the most interesting content in the multimedia files is often complex in nature reflecting a diversity of human behaviors, scenes, activities and events. To address these challenges, this tutorial will provide a unified overview of the two emerging techniques: Semantic modeling and Massive scale visual recognition, with a goal of both introducing people from different backgrounds to this exciting field and reviewing state of the art research in the new computational era.

References

L. Cao, L. Gong, J. R. Kender, N. C. Codella, and J. R. Smith. Learning by focusing: A new framework for concept recognition and feature selection. Proc. of IEEE Conference on Multimedia and Expo, 2013.Google Scholar
A. Hanjalic, R. Lienhart, W.-Y. Ma, and J. R. Smith. The holy grail of multimedia information retrieval: So close or yet so far away? Proc. IEEE, 94(4):541--547, 2008.Google ScholarCross Ref
M. Naphade, J. R. Smith, J. Tesic, S.-F. Chang, W. Hsu, L. Kenendy, A. Hauptmann, and J. Curtis. Large-scale concept ontology for multimedia. IEEE MultiMedia, 13(3), July-September 2006. Google ScholarDigital Library
J. R. Smith. History made everyday. IEEE MultiMedia, 18(2), July-September 2011. Google ScholarDigital Library
J. R. Smith. Minding the gap. IEEE MultiMedia, 19(2):53--62, January-March 2012.Google Scholar
J. R. Smith. Just the facets. IEEE MultiMedia, 20(1), January-March 2013. Google ScholarDigital Library
L. Xie, A. Natsev, J. R. Kender, M. Hill, and J. R. Smith. Visual memes in social media: Tracking real-world news in youtube videos. Proc. of the 19th ACM Intl. Conf. on Multimedia, pages 53--62, November 2011. Google ScholarDigital Library
R. Yan, M. O. Fleury, M. Merler, A. Natsev, and J. R. Smith. Large-scale multimedia semantic concept modeling using robust subspace bagging and map-reduce. Proc. of the First ACM Workshop on Large-Scale Multimedia Retrieval, 2009. Google ScholarDigital Library
F. Yu, L. Cao, R. S. Feris, J. R. Smith, and S.-F. Chang. Designing category-level attributes for discriminative visual recognition. Proc. of IEEE Conference on Computer Vision and Pattern Recognition, 2013. Google ScholarDigital Library

Index Terms

Massive-scale multimedia semantic modeling
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision tasks
        Video summarization
  2. Machine learning
2. Information systems
  1. Information systems applications
    1. Multimedia information systems
      1. Multimedia databases

Recommendations

Riding the multimedia big data wave
SIGIR '13: Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval

In this talk we present a perspective across multiple industry problems, including safety and security, medical, Web, social and mobile media, and motivate the need for large-scale analysis and retrieval of multimedia data. We describe a multi-layer ...
Read More
Multimedia Big Data Analytics: A Survey

With the proliferation of online services and mobile technologies, the world has stepped into a multimedia big data era. A vast amount of research work has been done in the multimedia area, targeting different aspects of big data analytics, such as the ...
Read More
What happens where?
GeoMM '13: Proceedings of the 2nd ACM international workshop on Geotagging and its applications in multimedia

The explosion of geo-tagged images taken from mobile devices around the world is visually capturing life at amazingly high spatial-, temporal-, and semantic-density. In places like cities, which cover only 3% of the Earth's landmass, yet account for 50% ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
MM '13: Proceedings of the 21st ACM international conference on Multimedia
October 2013
1166 pages
ISBN:9781450324045
DOI:10.1145/2502081
General Chairs:
Alejandro (Alex) Jaimes
Yahoo!, Spain
,
Nicu Sebe
University of Trento, Italy
,
Nozha Boujemaa
INRIA, France
,
Program Chairs:
Daniel Gatica-Perez
IDIAP & EPFL, Switzerland
,
David A. Shamma
Yahoo!, USA
,
Marcel Worring
University of Amsterdam, The Netherlands
,
Roger Zimmermann
National University of Singapore, Singapore
Copyright © 2013 Owner/Author
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 21 October 2013
Check for updates
Author Tags
content-based search
machine learning
multimedia information retrieval
semantic modeling
video analysis
Qualifiers
- technical-note
Conference

Acceptance Rates
MM '13 Paper Acceptance Rate47of235submissions,20%Overall Acceptance Rate995of4,171submissions,24%
More
Upcoming Conference
MM '24

Sponsor:

sigmm

MM '24: The 32nd ACM International Conference on Multimedia

October 28 - November 1, 2024

Melbourne , VIC , Australia
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 1
  Total Citations
  View Citations
- 250
  Total Downloads
- Downloads (Last 12 months)0
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Massive-scale multimedia semantic modeling

MM '13: Proceedings of the 21st ACM international conference on Multimedia

ABSTRACT

References

Cited By

Index Terms

Recommendations

Riding the multimedia big data wave

Multimedia Big Data Analytics: A Survey

What happens where?