Leveraged wisely, new datasets can inspire new multimedia methods and algorithms, as well as catalyze innovations in how their efficacy, efficiency, and generalizability can be evaluated. The availability of very large multimedia datasets like the Yahoo-Flickr Creative Commons 100 Million (YFCC100M)---which spans 99.2 million images and 0.8 million videos---has offered unique opportunities for advancing the state of the art in multimedia processing, analysis, search, and visualization.
The Multimedia Commons Initiative has been developing a community around the YFCC100M, including associated annotation and evaluation efforts. Computed features, human-generated annotations, and analysis tools have been released into the public domain, hosted via Amazon's Public Data Sets program. In addition to research in several multimedia subfields, including computer vision, image processing, and video content analysis, the YFCC100M and Multimedia Commons resources have been used in various competitions and benchmarks, such as the MediaEval Placing Task and the ACM Multimedia Grand Challenge competition.
As use of the YFCC100M and the Multimedia Commons resources broadens across the multimedia community, the MMCommons'16 workshop offers an opportunity for participants to share new research results, compare approaches, and coordinate efforts to maximize the scientific benefit of the initiative. In particular, this massive, open dataset challenges us to pursue some important "meta-research" questions, such as how to measure the scalability, generalizability, and reproducibility of methods across datasets; whether we need to rethink our evaluation paradigms as the field moves in new directions, in particular to better approximate "in the wild" conditions; and how annotation strategies affect the impact of benchmarks and data challenges using that data.
Participants in MMCommons'16 will share novel research using the YFCC100M dataset, particularly focusing on solving multimedia problems in ways that were not possible with previous data collections. Themes that will receive particular focus in the paper sessions include improving the understanding and representation of multimedia content; leveraging user-supplied metadata to bootstrap analysis and benchmarking; enabling web-scale distributed search and indexing; and defining strategies for performance evaluation, with an eye towards maximizing generalizability.
These themes will also be explored in special sessions and discussions on dataset bias, reproducibility, and task-driven annotation. The workshop will kick off with a keynote by Roeland Ordelman on the importance of the benchmark development process in shaping our understanding of the research problems being addressed, with examples from audiovisual search evaluations.
Proceeding Downloads
Developing Benchmarks: The Importance of the Process and New Paradigms
The value and importance of Benchmark Evaluations is widely acknowledged. Benchmarks play a key role in many research projects. It takes time, a well-balanced team of domain specialists preferably with links to the user community and industry, and a ...
In-depth Exploration of Geotagging Performance using Sampling Strategies on YFCC100M
Evaluating multimedia analysis and retrieval systems is a highly challenging task, of which the outcomes can be highly volatile depending on the selected test collection. In this paper, we focus on the problem of multimedia geotagging, i.e. estimating ...
YFCC100M HybridNet fc6 Deep Features for Content-Based Image Retrieval
This paper presents a corpus of deep features extracted from the YFCC100M images considering the fc6 hidden layer activation of the HybridNet deep convolutional neural network. For a set of random selected queries we made available k-NN results obtained ...
Concept-Level Multimodal Ranking of Flickr Photo Tags via Recall Based Weighting
Social media platforms allow users to annotate photos with tags that significantly facilitate an effective semantics understanding, search, and retrieval of photos. However, due to the manual, ambiguous, and personalized nature of user tagging, many ...
Analysis of Spatial, Temporal, and Content Characteristics of Videos in the YFCC100M Dataset
The Yahoo Flickr Creative Commons 100 Million dataset (YFCC100M) is one of the largest public databases containing images and videos and their annotations for research on multimedia analysis. In this paper, we present our study on analysis of ...
Which Languages do People Speak on Flickr?: A Language and Geo-Location Study of the YFCC100m Dataset
Recently, the Yahoo Flickr Creative Commons 100 Million (YFCC100m) dataset was introduced to the computer vision and multimedia research community. This dataset consists of millions of images and videos spread over the globe. This geo-distribution hints ...
Cited By
- Friedler S, Scheidegger C and Venkatasubramanian S (2021). The (Im)possibility of fairness, Communications of the ACM, 64:4, (136-143), Online publication date: 1-Apr-2021.
- Aboulnaga A, Abouzied A, Echihabi K and Ouzzani M (2021). Database systems research in the Arab world, Communications of the ACM, 64:4, (120-123), Online publication date: 1-Apr-2021.
- Lazem S, Saleh M and Alabdulqader E (2021). ArabHCI, Communications of the ACM, 64:4, (69-71), Online publication date: 1-Apr-2021.
- Darwish K, Habash N, Abbas M, Al-Khalifa H, Al-Natsheh H, Bouamor H, Bouzoubaa K, Cavalli-Sforza V, El-Beltagy S, El-Hajj W, Jarrar M and Mubarak H (2021). A panoramic survey of natural language processing in the Arab world, Communications of the ACM, 64:4, (72-81), Online publication date: 1-Apr-2021.
- Abbar S, Stanojevic R, Mustafa S and Mokbel M (2021). Traffic routing in the ever-changing city of Doha, Communications of the ACM, 64:4, (67-68), Online publication date: 1-Apr-2021.
- Zhang H, Lim H, Leis V, Andersen D, Kaminsky M, Keeton K and Pavlo A (2021). Succinct range filters, Communications of the ACM, 64:4, (166-173), Online publication date: 1-Apr-2021.
- Keyes D (2021). The Arab world prepares the exascale workforce, Communications of the ACM, 64:4, (82-87), Online publication date: 1-Apr-2021.
- Idreos S (2021). Technical perspective: The strength of SuRF, Communications of the ACM, 64:4, (165-165), Online publication date: 1-Apr-2021.
- Weber I, Imran M, Ofli F, Mrad F, Colville J, Fathallah M, Chaker A and Ahmed W (2021). Non-traditional data sources, Communications of the ACM, 64:4, (88-95), Online publication date: 1-Apr-2021.
- Sako M (2021). From remote work to working from anywhere, Communications of the ACM, 64:4, (20-22), Online publication date: 1-Apr-2021.
- De Mol L and Bullynck M (2021). Roots of 'program' revisited, Communications of the ACM, 64:4, (35-37), Online publication date: 1-Apr-2021.
- Vrandečić D (2021). Building a multilingual Wikipedia, Communications of the ACM, 64:4, (38-41), Online publication date: 1-Apr-2021.
- Reis E, Costa C, Silveira D, Bavaresco R, Righi R, Barbosa J, Antunes R, Gomes M and Federizzi G (2021). Transformers aftermath, Communications of the ACM, 64:4, (154-163), Online publication date: 1-Apr-2021.
- Pöpper C, Maniatakos M and Di Pietro R (2021). Cyber security research in the Arab region, Communications of the ACM, 64:4, (96-101), Online publication date: 1-Apr-2021.
- Jung R, Jourdan J, Krebbers R and Dreyer D (2021). Safe systems programming in Rust, Communications of the ACM, 64:4, (144-152), Online publication date: 1-Apr-2021.
- Proceedings of the 2016 ACM Workshop on Multimedia COMMONS
Recommendations
ACM CoNEXT 2016 Student Workshop
CoNEXT '16: Proceedings of the 12th International on Conference on emerging Networking EXperiments and TechnologiesThe ACM CoNEXT 2016 Student Workshop is held in Irvine, California, USA on December 12, 2016 and co-located with the ACM 12th International Conference on emerging Networking Experiments and Technologies (CoNEXT 2016). The main objective of the workshop ...