Skip to main content

METU-MMDS: An Intelligent Multimedia Database System for Multimodal Content Extraction and Querying

  • Conference paper
  • First Online:
Book cover MultiMedia Modeling (MMM 2016)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9517))

Included in the following conference series:

Abstract

Managing a large volume of multimedia data, which contain various modalities (visual, audio, and text), reveals the need for a specialized multimedia database system (MMDS) to efficiently model, process, store and retrieve video shots based on their semantic content. This demo introduces METU-MMDS, an intelligent MMDS which employs both machine learning and database techniques. The system extracts semantic content automatically by using visual, audio and textual data, stores the extracted content in an appropriate format and uses this content to efficiently retrieve video shots. The system architecture supports various multimedia query types including unimodal querying, multimodal querying, query-by-concept, query-by-example, and utilizes a multimedia index structure for efficiently querying multi-dimensional multimedia data. We demonstrate METU-MMDS for semantic data extraction from videos and complex multimedia querying by considering content and concept-based queries containing all modalities.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Rashid, U., Bhatti, M.A.: Exploration and management of web based multimedia information resources. In: Elleithy, K. (ed.) Innovations and Advanced Techniques in Systems, Computing Sciences and Software Engineering, pp. 500–506. Springer, The Netherlands (2008)

    Chapter  Google Scholar 

  2. Brendan, J., Hongzhi, L., et al.: Structured exploration of who, what, when, and where in heterogeneous multimedia news sources. In: ACM MM, pp. 357–360 (2013)

    Google Scholar 

  3. Stefanidis, K., Koutrika, G., Pitoura, E.: A survey on representation, composition and application of preferences in database systems. J. TODS 36, 19–45 (2011). ACM

    Google Scholar 

  4. Meng, T., Shyu, M.L.: Leveraging concept association network for multimedia rare concept mining and retrieval. In: ICME, pp. 860–865. IEEE, Melbourne (2012)

    Google Scholar 

  5. Smith, J.R.: Riding the multimedia big data wave. In: SIGIR, pp. 1–2. ACM (2013)

    Google Scholar 

  6. Aydinlilar, M., Yazici, A.: Semi-automatic semantic video annotation tool. In: Gelenbe, E., Lent, R. (eds.) International Symposium on Computer and Information Sciences, pp. 303–310. Springer, Paris (2012)

    Google Scholar 

  7. Yilmaz, T., Yazici, A., Yildirim, Y.: Exploiting class-specific features in multi-feature dissimilarity space for efficient querying of images. In: Christiansen, H., De Tré, G., Yazici, A., Zadrozny, S., Andreasen, T., Larsen, H.L. (eds.) FQAS 2011. LNCS, vol. 7022, pp. 149–161. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  8. Deng, Y., Manjunath, B.S.: Unsupervised segmentation of color-texture regions in images and video. IEEE J. TPAMI 23(8), 800–810 (2001)

    Article  Google Scholar 

  9. Okuyucu, C., Sert, M., Yazici, A.: Audio feature and classifier analysis for efficient recognition of environmental sounds. In: ISM, pp. 125–132. IEEE, USA (2013)

    Google Scholar 

  10. Kucuk, D., Yazici, A.: Exploiting information extraction techniques for automatic semantic video indexing with an application to Turkish news videos. J. Knowl.-Based Sys. 25(6), 844–857 (2011)

    Article  Google Scholar 

  11. Gulen, E., Yilmaz, T., Yazici, A.: Multimodal information fusion for semantic video analysis. J. IJMDEM 3(4), 52–74 (2012)

    Google Scholar 

  12. Kucuk, D., Ozgur, N.B., Yazici, A., Koyuncu, M.: A fuzzy conceptual model for multimedia data with a text-based automatic annotation scheme. J. IJUFKS 17(1), 135–152 (2009)

    Google Scholar 

  13. Yazici, A., Ince, C., Koyuncu, M.: Food index: a multidimensional index structure for similarity-based fuzzy object-oriented database models. J. IEEE Trans. Fuzzy Sys. 16(4), 942–957 (2008). IEEE

    Article  Google Scholar 

  14. Arslan, S., Yazici, A., Sacan, A., Toroslu, I.H., Acar, E.: Comparison of feature-based and image registration-based retrieval of image data using multidimensional data access methods. J. TKDE 86, 124–145 (2013). Elsevier

    Google Scholar 

  15. Safadi, B., Sahuguet, M., Huet, B.: When textual and visual information join forces for multimedia retrieval. In: ICMR, pp. 265–272. ACM (2014)

    Google Scholar 

  16. Yu, J., Cong, Y., Qin, Z., Wan, T.: Cross-modal topic correlations for multimedia retrieval. In: International Conference on Pattern Recognition, pp. 246–249. IEEE, Japan (2012)

    Google Scholar 

Download references

Acknowledgments

This work is supported by the research grant from TUBITAK with the grant number 114R0182. We also thank to all of the previous researchers of Multimedia Db. Lab. at METU who have contributed to this study.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Adnan Yazici .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Yazici, A., Sattari, S., Yilmaz, T., Sert, M., Koyuncu, M., Gulen, E. (2016). METU-MMDS: An Intelligent Multimedia Database System for Multimodal Content Extraction and Querying. In: Tian, Q., Sebe, N., Qi, GJ., Huet, B., Hong, R., Liu, X. (eds) MultiMedia Modeling. MMM 2016. Lecture Notes in Computer Science(), vol 9517. Springer, Cham. https://doi.org/10.1007/978-3-319-27674-8_33

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-27674-8_33

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-27673-1

  • Online ISBN: 978-3-319-27674-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics