VIRET Tool Meets NasNet

Lokoč, Jakub; Kovalčík, Gregor; Souček, Tomáš; Moravec, Jaroslav; Bodnár, Jan; Čech, Přemysl

doi:10.1007/978-3-030-05716-9_52

Jakub Lokoč¹⁹,
Gregor Kovalčík¹⁹,
Tomáš Souček¹⁹,
Jaroslav Moravec¹⁹,
Jan Bodnár¹⁹ &
…
Přemysl Čech¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11296))

Included in the following conference series:

International Conference on Multimedia Modeling

2325 Accesses
1 Citations

Abstract

The results of the last Video Browser Showdown in Bangkok 2018 show that multimodal search with interactive query reformulation represents a competitive search strategy for all the evaluated task categories. Therefore, we plan to target the effectiveness of involved retrieval models by making use of the most recent deep network architectures in the new version of our interactive video retrieval VIRET tool. Specifically, we apply the NasNet deep convolutional neural network architecture for automatic annotation and similarity search in the set of selected frames from the provided video collection. In addition, we implement temporal sequence queries and subimage similarity search to provide higher query formulation flexibility for users.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
The representative was selected as a mean descriptor of images in one category. The original GoogLeNet was used to extract descriptors.

References

Barthel, K.U., Hezel, N., Mackowiak, R.: Navigating a graph of scenes for exploring large video collections. In: Tian, Q., Sebe, N., Qi, G.-J., Huet, B., Hong, R., Liu, X. (eds.) MMM 2016. LNCS, vol. 9517, pp. 418–423. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-27674-8_43
Chapter Google Scholar
Blazek, A., Lokoc, J., Kubon, D.: Video hunter at VBS 2017. In: MultiMedia Modeling - 23rd International Conference, MMM 2017, Proceedings, Part II, Reykjavik, Iceland, 4–6 January 2017, pp. 493–498 (2017)
Google Scholar
Čech, P., Maroušek, J., Lokoč, J., Silva, Y.N., Starks, J.: Comparing MapReduce-based k-NN similarity joins on hadoop for high-dimensional data. In: Cong, G., Peng, W.-C., Zhang, W.E., Li, C., Sun, A. (eds.) ADMA 2017. LNCS (LNAI), vol. 10604, pp. 63–75. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-69179-4_5
Chapter Google Scholar
Cobârzan, C., et al.: Interactive video search tools: a detailed analysis of the video browser showdown 2015. Multimedia Tools Appl. 76(4), 5539–5571 (2017)
Article Google Scholar
Hu, P., Ramanan, D.: Finding tiny faces. CoRR abs/1612.04402 (2016)
Google Scholar
Lokoc, J., Bailer, W., Schoeffmann, K., Muenzer, B., Awad, G.: On influential trends in interactive video retrieval: video browser showdown 2015–2017. IEEE Trans. Multimedia 20(12), 3361–3376 (2018). https://ieeexplore.ieee.org/document/8352047
Article Google Scholar
Lokoč, J., Blažek, A., Skopal, T.: Signature-based video browser. In: Gurrin, C., Hopfgartner, F., Hurst, W., Johansen, H., Lee, H., O’Connor, N. (eds.) MMM 2014. LNCS, vol. 8326, pp. 415–418. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-04117-9_49
Chapter Google Scholar
Lokoč, J., Kovalčík, G., Souček, T.: Revisiting SIRET video retrieval tool. In: MultiMedia Modeling - 24th International Conference, MMM 2018, Bangkok, Thailand, Proceedings, Part II, 5–7 February 2018, pp. 419–424 (2018)
Google Scholar
Lokoč, J., Souček, T., Kovalčík, G.: Using an interactive video retrieval tool for lifelog data. In: Proceedings of the 2018 ACM Workshop on the Lifelog Search Challenge, LSC 2018, pp. 15–19. ACM, New York (2018)
Google Scholar
Nguyen, P.A., Lu, Y.-J., Zhang, H., Ngo, C.-W.: Enhanced VIREO KIS at VBS 2018. In: Schoeffmann, K., et al. (eds.) MMM 2018. LNCS, vol. 10705, pp. 407–412. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-73600-6_42
Chapter Google Scholar
Primus, M.J., Münzer, B., Leibetseder, A., Schoeffmann, K.: The ITEC collaborative video search system at the video browser showdown 2018. In: Schoeffmann, K., et al. (eds.) MMM 2018. LNCS, vol. 10705, pp. 438–443. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-73600-6_47
Chapter Google Scholar
Rossetto, L., Giangreco, I., Tănase, C., Schuldt, H., Dupont, S., Seddati, O.: Enhanced retrieval and browsing in the IMOTION system. In: Amsaleg, L., Guðmundsson, G.Þ., Gurrin, C., Jónsson, B.Þ., Satoh, S. (eds.) MMM 2017. LNCS, vol. 10133, pp. 469–474. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-51814-5_43
Chapter Google Scholar
Szegedy, C., et al.: Going deeper with convolutions. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015, Boston, MA, USA, 7–12 June 2015, pp. 1–9 (2015)
Google Scholar
Zhou, X., et al.: EAST: an efficient and accurate scene text detector. CoRR abs/1704.03155 (2017)
Google Scholar
Zoph, B., Vasudevan, V., Shlens, J., Le, Q.V.: Learning transferable architectures for scalable image recognition. CoRR abs/1707.07012 (2017)
Google Scholar

Download references

Acknowledgments

This paper has been supported in part by Czech Science Foundation (GAČR) project Nr. 17-22224S and by Charles University grant SVV-260451.

Author information

Authors and Affiliations

SIRET Research Group, Department of Software Engineering, Faculty of Mathematics and Physics, Charles University, Prague, Czech Republic
Jakub Lokoč, Gregor Kovalčík, Tomáš Souček, Jaroslav Moravec, Jan Bodnár & Přemysl Čech

Authors

Jakub Lokoč
View author publications
You can also search for this author in PubMed Google Scholar
Gregor Kovalčík
View author publications
You can also search for this author in PubMed Google Scholar
Tomáš Souček
View author publications
You can also search for this author in PubMed Google Scholar
Jaroslav Moravec
View author publications
You can also search for this author in PubMed Google Scholar
Jan Bodnár
View author publications
You can also search for this author in PubMed Google Scholar
Přemysl Čech
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Přemysl Čech .

Editor information

Editors and Affiliations

Information Technologies Institute, Centre for Research and Technology Hellas, Thessaloniki, Greece
Ioannis Kompatsiaris
EURECOM, Sophia Antipolis, France
Benoit Huet
Information Technologies Institute, Centre for Research and Technology Hellas, Thessaloniki, Greece
Vasileios Mezaris
Dublin City University, Dublin, Ireland
Cathal Gurrin
National Chiao Tung University, Hsinchu, Taiwan
Wen-Huang Cheng
Information Technologies Institute, Centre for Research and Technology Hellas, Thessaloniki, Greece
Stefanos Vrochidis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lokoč, J., Kovalčík, G., Souček, T., Moravec, J., Bodnár, J., Čech, P. (2019). VIRET Tool Meets NasNet. In: Kompatsiaris, I., Huet, B., Mezaris, V., Gurrin, C., Cheng, WH., Vrochidis, S. (eds) MultiMedia Modeling. MMM 2019. Lecture Notes in Computer Science(), vol 11296. Springer, Cham. https://doi.org/10.1007/978-3-030-05716-9_52

Download citation

DOI: https://doi.org/10.1007/978-3-030-05716-9_52
Published: 11 December 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-05715-2
Online ISBN: 978-3-030-05716-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics