skip to main content
10.1145/3323873.3326921acmconferencesArticle/Chapter ViewAbstractPublication PagesicmrConference Proceedingsconference-collections
research-article
Best Demo

Multimodal Multimedia Retrieval with vitrivr

Published:05 June 2019Publication History

ABSTRACT

The steady growth of multimedia collections - both in terms of size and heterogeneity - necessitates systems that are able to conjointly deal with several types of media as well as large volumes of data. This is especially true when it comes to satisfying a particular information need, i.e., retrieving a particular object of interest from a large collection. Nevertheless, existing multimedia management and retrieval systems are mostly organized in silos and treat different media types separately. Hence, they are limited when it comes to crossing these silos for accessing objects. In this paper, we present vitrivr, a general-purpose content-based multimedia retrieval stack. In addition to the keyword search provided by most media management systems, vitrivr also exploits the object's content in order to facilitate different types of similarity search. This can be done within and, most importantly, across different media types giving rise to new, interesting use cases. To the best of our knowledge, the full vitrivr stack is unique in that it seamlessly integrates support for four different types of media, namely images, audio, videos, and 3D models.

References

  1. George Awad, Asad Butt, Keith Curtis, Yooyoung Lee, Jonathan Fiscus, Afzal Godil, David Joy, Andrew Delgado, Alan F. Smeaton, Yvette Graham, Wessel Kraaij, Georges Quénot, Joao Magalhaes, David Semedo, and Saverio Blasi. 2018. TRECVID 2018: Benchmarking Video Activity Detection, Video Captioning and Matching, Video Storytelling Linking and Video Search. In Proceedings of TRECVID 2018 . NIST, USA.Google ScholarGoogle Scholar
  2. Ding-Yun Chen, Xiao-Pei Tian, Yu-Te Shen, and Ming Ouhyoung. 2003. On Visual Similarity based 3D Model Retrieval. In Computer Graphics Forum, Vol. 22. Wiley Online Library, 223--232.Google ScholarGoogle Scholar
  3. Claudiu Cobârzan, Klaus Schoeffmann, Werner Bailer, Wolfgang Hürst, Adam Blavz ek, Jakub Lokovc, Stefanos Vrochidis, Kai Uwe Barthel, and Luca Rossetto. 2017. Interactive video Search Tools: a Detailed Analysis of the Video Browser Showdown 2015. Multimedia Tools and Applications, Vol. 76, 4 (2017), 5539--5571. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Myron Flickner, Harpreet Sawhney, Wayne Niblack, Jonathan Ashley, Qian Huang, Byron Dom, Monika Gorkani, Jim Hafner, Denis Lee, Dragutin Petkovic, et almbox. 1995. Query by Image and Video Content: The QBIC System. Computer, Vol. 28 (1995), 23--32. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Jonathan T Foote. 1997. Content-based Retrieval of Music and Audio. In Proc. SPIE 3229, Multimedia Storage and Archiving Systems II. 138--147.Google ScholarGoogle Scholar
  6. Ralph Gasser, Luca Rossetto, and Heiko Schuldt. 2019. Towards an All-Purpose Content-Based Multimedia Information Retrieval System. arXiv preprint arXiv:1902.03878 (2019).Google ScholarGoogle Scholar
  7. Ivan Giangreco and Heiko Schuldt. 2016. ADAM$_pro$: Database Support for Big Multimedia Retrieval . Datenbank-Spektrum, Vol. 16, 1 (2016), 17--26.Google ScholarGoogle ScholarCross RefCross Ref
  8. Emilia Gó mez. 2006. Tonal Description of Music Audio Signals. Doctoral Dissertation. Universitat Pompeu Fabra, Barcelona.Google ScholarGoogle Scholar
  9. Michael Kazhdan, Thomas Funkhouser, and Szymon Rusinkiewicz. 2003. Rotation Invariant Spherical Harmonic Representation of 3D Shape Descriptors. In Eurographics Symposium on Geometry Processing, Vol. 43. 156--164.Google ScholarGoogle Scholar
  10. Patrick M. Kelly, Michael Cannon, and Donald R. Hush. 1995. Query by image example: the CANDID approach.Google ScholarGoogle Scholar
  11. Serkan Kiranyaz, Kerem Caglar, Esin Guldogan, Olcay Guldogan, and Moncef Gabbouj. 2003. MUVIS: a Content-based Multimedia Indexing and Retrieval Framework. In Proceedings of the Seventh International Symposium on Signal Processing and Its Applications (ISSPA), Vol. 1. 1--8.Google ScholarGoogle ScholarCross RefCross Ref
  12. Goujun Lu. 2001. Indexing and retrieval of Audio: A Survey. Multimedia Tools and Applications, Vol. 15, 3 (2001), 269--290. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Meinard Muller, Frank Kurth, and Michael Clausen. 2005. Chroma-based Statistical Audio Features for Audio Matching. In IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2005. IEEE, New Paltz, NY, USA, 275--278.Google ScholarGoogle Scholar
  14. Luca Rossetto, Ivan Giangreco, and Heiko Schuldt. 2014. Cineast: a Multi-feature Sketch-based Video Retrieval Engine. In 2014 IEEE International Symposium on Multimedia. IEEE, Taichung, Taiwan, 18--23.Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Luca Rossetto, Ivan Giangreco, and Heiko Schuldt. 2015a. OSVC-Open Short Video Collection 1.0. Technical Report CS-2015-002 (2015).Google ScholarGoogle Scholar
  16. Luca Rossetto, Ivan Giangreco, Heiko Schuldt, Stéphane Dupont, Omar Seddati, Metin Sezgin, and Yusuf Sahillioug lu. 2015b. IMOTION -- a Content-based Video Retrieval Engine. In International Conference on Multimedia Modeling. Springer, 255--260.Google ScholarGoogle ScholarCross RefCross Ref
  17. Luca Rossetto, Ivan Giangreco, Claudiu Tua nase, and Heiko Schuldt. 2016. vitrivr: A Flexible Retrieval Stack Supporting Multiple Query Modes for Searching in Multimedia Collections. In Proceedings of the 2016 ACM on Multimedia Conference. ACM, Amsterdam, the Netherlands, 1183--1186.Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Luca Rossetto, Ivan Giangreco, Claudiu Tua nase, Heiko Schuldt, Stéphane Dupont, and Omar Seddati. 2017. Enhanced Retrieval and Browsing in the IMOTION System. In International Conference on Multimedia Modeling. Springer, 469--474.Google ScholarGoogle Scholar
  19. Luca Rossetto, Mahnaz Amiri Parian, Ralph Gasser, Ivan Giangreco, Silvan Heller, and Heiko Schuldt. 2019 a. Deep Learning-Based Concept Detection in vitrivr. In International Conference on Multimedia Modeling. Springer, 616--621.Google ScholarGoogle ScholarCross RefCross Ref
  20. Luca Rossetto, Heiko Schuldt, George Awad, and Asad A Butt. 2019 b. V3C--A Research Video Collection. In International Conference on Multimedia Modeling . Springer, 349--360.Google ScholarGoogle ScholarCross RefCross Ref
  21. Dietmar Saupe and Dejan V. Vranić. 2001. 3D Model Retrieval with Spherical Harmonics and Moments. In Proceedings of the 23rd DAGM-Symposium on Pattern Recognition, Vol. 2191. Springer, 392--397. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Klaus Schoeffmann, David Ahlström, Werner Bailer, Claudiu Cobârzan, Frank Hopfgartner, Kevin McGuinness, Cathal Gurrin, Christian Frisson, Duy-Dinh Le, Manfred Del Fabro, Hongliang Bai, and Wolfgang Weiss. 2014. The Video Browser Showdown: a Live Evaluation of Interactive Video Search Tools. International Journal of Multimedia Information Retrieval, Vol. 3, 2 (2014), 113--127.Google ScholarGoogle Scholar
  23. Johan WH Tangelder and Remco C Veltkamp. 2004. A survey of Content Based 3D Shape Retrieval Methods. In Shape Modeling Applications, 2004. Proceedings. IEEE, 145--156. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Rainer Typke, Frans Wiering, and Remco C Veltkamp. 2005. A Survey of Music Information Retrieval Systems. In Proceedings of the 6th International Conference on Music Information Retrieval. Queen Mary, University of London, 153--160.Google ScholarGoogle Scholar
  25. Avery Wang. 2006. The Shazam Music Recognition Service. Commun. ACM, Vol. 49, 8 (2006), 44--48. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Multimodal Multimedia Retrieval with vitrivr

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader