Proceedings of the 2016 ACM Workshop on Multimedia COMMONS

MMCommons '16: Proceedings of the 2016 ACM Workshop on Multimedia COMMONS

October 2016

2016 Proceeding

General Chairs:
Bart Thomee
Yahoo Labs, USA
,
Damian Borth
German Research Center for Artificial Intelligence (DFKI), Germany
,
Julia Bernd
International Computer Science Institute, USA

Publisher:

Association for Computing Machinery
New York
NY
United States

Conference:

MM '16: ACM Multimedia Conference Amsterdam The Netherlands 16 October 2016

ISBN:

978-1-4503-4515-6

Published:

16 October 2016

Sponsors:

SIGMM

Recommend ACM DL

ALREADY A SUBSCRIBER?SIGN IN

Bibliometrics

Abstract

Leveraged wisely, new datasets can inspire new multimedia methods and algorithms, as well as catalyze innovations in how their efficacy, efficiency, and generalizability can be evaluated. The availability of very large multimedia datasets like the Yahoo-Flickr Creative Commons 100 Million (YFCC100M)---which spans 99.2 million images and 0.8 million videos---has offered unique opportunities for advancing the state of the art in multimedia processing, analysis, search, and visualization.

The Multimedia Commons Initiative has been developing a community around the YFCC100M, including associated annotation and evaluation efforts. Computed features, human-generated annotations, and analysis tools have been released into the public domain, hosted via Amazon's Public Data Sets program. In addition to research in several multimedia subfields, including computer vision, image processing, and video content analysis, the YFCC100M and Multimedia Commons resources have been used in various competitions and benchmarks, such as the MediaEval Placing Task and the ACM Multimedia Grand Challenge competition.

As use of the YFCC100M and the Multimedia Commons resources broadens across the multimedia community, the MMCommons'16 workshop offers an opportunity for participants to share new research results, compare approaches, and coordinate efforts to maximize the scientific benefit of the initiative. In particular, this massive, open dataset challenges us to pursue some important "meta-research" questions, such as how to measure the scalability, generalizability, and reproducibility of methods across datasets; whether we need to rethink our evaluation paradigms as the field moves in new directions, in particular to better approximate "in the wild" conditions; and how annotation strategies affect the impact of benchmarks and data challenges using that data.

Participants in MMCommons'16 will share novel research using the YFCC100M dataset, particularly focusing on solving multimedia problems in ways that were not possible with previous data collections. Themes that will receive particular focus in the paper sessions include improving the understanding and representation of multimedia content; leveraging user-supplied metadata to bootstrap analysis and benchmarking; enabling web-scale distributed search and indexing; and defining strategies for performance evaluation, with an eye towards maximizing generalizability.

These themes will also be explored in special sessions and discussions on dataset bias, reproducibility, and task-driven annotation. The workshop will kick off with a keynote by Roeland Ordelman on the importance of the benchmark development process in shaping our understanding of the research problems being addressed, with examples from audiovisual search evaluations.

Proceeding Downloads

PDF(Title Page, Copyright, Multimedia COMMONS (MMCommons) 2016 Workshop Chairs' Welcome, Contents, Organization)

PDF(Author Index)

Select All

Export Citations Save to Binder

SESSION: Keynote Address

invited-talk

Developing Benchmarks: The Importance of the Process and New Paradigms

Roeland J.F. Ordelman

Page 1https://doi.org/10.1145/2983554.2983562

The value and importance of Benchmark Evaluations is widely acknowledged. Benchmarks play a key role in many research projects. It takes time, a well-balanced team of domain specialists preferably with links to the user community and industry, and a ...

SESSION: Paper Session 1: Retrieval at Scale

research-article

In-depth Exploration of Geotagging Performance using Sampling Strategies on YFCC100M

Pages 3–10https://doi.org/10.1145/2983554.2983558

Evaluating multimedia analysis and retrieval systems is a highly challenging task, of which the outcomes can be highly volatile depending on the selected test collection. In this paper, we focus on the problem of multimedia geotagging, i.e. estimating ...

research-article

YFCC100M HybridNet fc6 Deep Features for Content-Based Image Retrieval

Pages 11–18https://doi.org/10.1145/2983554.2983557

This paper presents a corpus of deep features extracted from the YFCC100M images considering the fc6 hidden layer activation of the HybridNet deep convolutional neural network. For a set of random selected queries we made available k-NN results obtained ...

research-article

Concept-Level Multimodal Ranking of Flickr Photo Tags via Recall Based Weighting

Pages 19–26https://doi.org/10.1145/2983554.2983555

Social media platforms allow users to annotate photos with tags that significantly facilitate an effective semantics understanding, search, and retrieval of photos. However, due to the manual, ambiguous, and personalized nature of user tagging, many ...

SESSION: Paper Session 2: Exploring the YFCC100M

research-article

Analysis of Spatial, Temporal, and Content Characteristics of Videos in the YFCC100M Dataset

Pages 27–34https://doi.org/10.1145/2983554.2983559

The Yahoo Flickr Creative Commons 100 Million dataset (YFCC100M) is one of the largest public databases containing images and videos and their annotations for research on multimedia analysis. In this paper, we present our study on analysis of ...

research-article

Which Languages do People Speak on Flickr?: A Language and Geo-Location Study of the YFCC100m Dataset

Pages 35–42https://doi.org/10.1145/2983554.2983560

Recently, the Yahoo Flickr Creative Commons 100 Million (YFCC100m) dataset was introduced to the computer vision and multimedia research community. This dataset consists of millions of images and videos spread over the globe. This geo-distribution hints ...

Cited By

Contributors

Bart Thomee
Google LLC
- Publication Years2004 - 2018
- Publication counts32
- Citation count1,605
- Available for Download27
- Downloads (cumulative)47,645
- Downloads (12 months)2,019
- Downloads (6 weeks)228
- Average Downloads per Article1,765
- Average Citation per Article50
View Full Profile
Damian Borth
University of Kaiserslautern-Landau
- Publication Years2008 - 2018
- Publication counts30
- Citation count2,068
- Available for Download25
- Downloads (cumulative)50,608
- Downloads (12 months)2,342
- Downloads (6 weeks)237
- Average Downloads per Article2,024
- Average Citation per Article69
View Full Profile
Julia Bernd
International Computer Science Institute
- Publication Years2015 - 2024
- Publication counts17
- Citation count86
- Available for Download10
- Downloads (cumulative)6,130
- Downloads (12 months)2,203
- Downloads (6 weeks)315
- Average Downloads per Article613
- Average Citation per Article5
View Full Profile

Proceedings of the 2016 ACM Workshop on Multimedia COMMONS
1. Information systems
  1. Information systems applications

Comments

Recommendations

ACM CoNEXT 2016 Student Workshop
CoNEXT '16: Proceedings of the 12th International on Conference on emerging Networking EXperiments and Technologies

The ACM CoNEXT 2016 Student Workshop is held in Irvine, California, USA on December 12, 2016 and co-located with the ACM 12th International Conference on emerging Networking Experiments and Technologies (CoNEXT 2016). The main objective of the workshop ...
WOWMOM '02: Proceedings of the 5th ACM international workshop on Wireless mobile multimedia
MM '06: Proceedings of the 14th ACM international conference on Multimedia

Save to Binder

Sections

Proceeding Downloads

Developing Benchmarks: The Importance of the Process and New Paradigms

In-depth Exploration of Geotagging Performance using Sampling Strategies on YFCC100M

YFCC100M HybridNet fc6 Deep Features for Content-Based Image Retrieval

Concept-Level Multimodal Ranking of Flickr Photo Tags via Recall Based Weighting

Analysis of Spatial, Temporal, and Content Characteristics of Videos in the YFCC100M Dataset

Which Languages do People Speak on Flickr?: A Language and Geo-Location Study of the YFCC100m Dataset

Cited By

Save to Binder

Recommendations

ACM CoNEXT 2016 Student Workshop

WOWMOM '02: Proceedings of the 5th ACM international workshop on Wireless mobile multimedia

MM '06: Proceedings of the 14th ACM international conference on Multimedia