PraK Tool: An Interactive Search Tool Based on Video Data Services

Lokoč, Jakub; Vopálková, Zuzana; Stroh, Michael; Buchmueller, Raphael; Schlegel, Udo

doi:10.1007/978-3-031-53302-0_30

Jakub Lokoč¹⁴,
Zuzana Vopálková¹⁴,
Michael Stroh¹⁵,
Raphael Buchmueller¹⁶ &
…
Udo Schlegel¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14557))

Included in the following conference series:

International Conference on Multimedia Modeling

Abstract

This paper presents a tool relying on data service architecture, where technical details of all VBS datasets are completely hidden behind an abstract stateless data layer. The data services allow independent development of interactive search interfaces and refinement techniques, which is demonstrated by a smart front-end component. The component supports common search features and allows users to exploit content-based statistics for effective filtering. We believe that video data services might be a valuable addition to the open-source VBS toolkit, especially when available for the competition on a shared server with all VBS datasets, extracted features, and meta-data behind.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 59.99; Price excludes VAT (USA)

Softcover Book: USD 74.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

An Interactive Video Search Tool: A Case Study Using the V3C1 Dataset

VERGE in VBS 2021

VERGE in VBS 2022

References

Amato, G.: VISIONE at video browser showdown 2023. In: Dang-Nguyen, D.T., et al. (eds.) MMM 2023. LNCS, vol. 13833, pp. 615–621. Springer, Cham (2023). https://doi.org/10.1007/978-3-031-27077-2_48
Chapter Google Scholar
Amato, G., et al.: VISIONE Feature Repository for VBS: Multi-Modal Features and Detected Objects from MVK Dataset (2023). https://doi.org/10.5281/zenodo.8355037
Amato, G., et al.: VISIONE Feature Repository for VBS: Multi-Modal Features and Detected Objects from V3C1+V3C2 Dataset (2023). https://doi.org/10.5281/zenodo.8188570
Chernoff, H.: The use of faces to represent points in k-dimensional space graphically. J. Am. Stat. Assoc. 68(342), 361–368 (1973). https://doi.org/10.1080/01621459.1973.10482434
Article Google Scholar
Cox, I.J., Miller, M.L., Minka, T.P., Papathomas, T.V., Yianilos, P.N.: The Bayesian image retrieval system, PicHunter: theory, implementation, and psychophysical experiments. IEEE Trans. Image Process. 9(1), 20–37 (2000)
Article Google Scholar
Heller, S., et al.: Interactive video retrieval evaluation at a distance: comparing sixteen interactive video search systems in a remote setting at the 10th video browser showdown. Int. J. Multim. Inf. Retr. 11(1), 1–18 (2022). https://doi.org/10.1007/s13735-021-00225-2
Article Google Scholar
Ilharco, G., et al.: Openclip (2021). https://doi.org/10.5281/zenodo.5143773, if you use this software, please cite it as below
Kratochvíl, M., Mejzlík, F., Veselý, P., Souček, T., Lokoč, J.: SOMHunter: lightweight video search system with SOM-guided relevance feedback. In: Proceedings of the 28th ACM International Conference on Multimedia, MM 2020. ACM (2020, in press)
Google Scholar
Lokoč, J., Mejzlík, F., Souček, T., Dokoupil, P., Peška, L.: Video search with context-aware ranker and relevance feedback. In: Þór Jónsson, B., et al. (eds.) MMM 2022. LNCS, vol. 13142, pp. 505–510. Springer, Cham (2022). https://doi.org/10.1007/978-3-030-98355-0_46
Chapter Google Scholar
Lokoč, J., Kovalčík, G., Souček, T., Moravec, J., Čech, P.: A framework for effective known-item search in video. In: Proceedings of the 27th ACM International Conference on Multimedia (MM 2019), Nice, France, 21–25 October 2019, pp. 1–9 (2019). https://doi.org/10.1145/3343031.3351046
Ma, Z., Wu, J., Loo, W., Ngo, C.W.: Reinforcement learning enhanced PicHunter for interactive search. In: Conference on Multimedia Modeling (2023)
Google Scholar
Pantelidis, N., et al.: VERGE in VBS 2023. In: Dang-Nguyen, D.T., et al. (eds.) MMM 2023. LNCS, vol. 13833, pp. 658–664. Springer, Cham (2023). https://doi.org/10.1007/978-3-031-27077-2_55
Chapter Google Scholar
Radford, A., et al.: Learning transferable visual models from natural language supervision. CoRR abs/2103.00020 (2021). https://arxiv.org/abs/2103.00020
Rossetto, L., et al.: Interactive video retrieval in the age of deep learning–detailed evaluation of VBS 2019. IEEE Trans. Multimed. 23, 243–256 (2020). https://doi.org/10.1109/TMM.2020.2980944
Article Google Scholar
Rossetto, L., Schuldt, H., Awad, G., Butt, A.A.: V3C – a research video collection. In: Kompatsiaris, I., Huet, B., Mezaris, V., Gurrin, C., Cheng, W.-H., Vrochidis, S. (eds.) MMM 2019. LNCS, vol. 11295, pp. 349–360. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-05710-7_29
Chapter Google Scholar
Sauter, L., et al.: Exploring effective interactive text-based video search in vitrivr. In: Dang-Nguyen, D.T., et al. (eds.) MMM 2023. LNCS, vol. 13833, pp. 646–651. Springer, Cham (2023). https://doi.org/10.1007/978-3-031-27077-2_53
Chapter Google Scholar
Schall, K., Hezel, N., Jung, K., Barthel, K.U.: Vibro: video browsing with semantic and visual image embeddings. In: Dang-Nguyen, D.T., et al. (eds.) MMM 2023. LNCS, pp. 665–670. Springer, Cham (2023). https://doi.org/10.1007/978-3-031-27077-2_56
Chapter Google Scholar
Truong, Q.T., et al.: Marine video kit: a new marine video dataset for content-based analysis and retrieval. In: Dang-Nguyen, D.T., et al. (eds.) MMM 2023. LNCS, vol. 13833, pp. 539–550. Springer, Cham (2023). https://doi.org/10.1007/978-3-031-27077-2_42
Chapter Google Scholar

Download references

Acknowledgment

This work has been supported by Czech Science Foundation (GAČR) project 22-21696S. We would like to thank the authors of the discussed VBS systems for clarification of architecture details.

Author information

Authors and Affiliations

SIRET Research Group, Department of Software Engineering Faculty of Mathematics and Physics, Charles University, Prague, Czech Republic
Jakub Lokoč & Zuzana Vopálková
Visual Computing, Department of Computer Science Faculty of Natural Sciences, Konstanz University, Konstanz, Germany
Michael Stroh
Data Analysis and Visualization, Department of Computer Science Faculty of Natural Sciences, Konstanz University, Konstanz, Germany
Raphael Buchmueller & Udo Schlegel

Authors

Jakub Lokoč
View author publications
You can also search for this author in PubMed Google Scholar
Zuzana Vopálková
View author publications
You can also search for this author in PubMed Google Scholar
Michael Stroh
View author publications
You can also search for this author in PubMed Google Scholar
Raphael Buchmueller
View author publications
You can also search for this author in PubMed Google Scholar
Udo Schlegel
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jakub Lokoč .

Editor information

Editors and Affiliations

University of Amsterdam, Amsterdam, The Netherlands
Stevan Rudinac
Delft University of Technology, Delft, The Netherlands
Alan Hanjalic
Delft University of Technology, Delft, The Netherlands
Cynthia Liem
University of Amsterdam, Amsterdam, The Netherlands
Marcel Worring
Reykjavik University, Reykjavik, Iceland
Björn Þór Jónsson
Microsoft Research Lab – Asia, Beijing, China
Bei Liu
The University of Tokyo, Tokyo, Japan
Yoko Yamakata

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lokoč, J., Vopálková, Z., Stroh, M., Buchmueller, R., Schlegel, U. (2024). PraK Tool: An Interactive Search Tool Based on Video Data Services. In: Rudinac, S., et al. MultiMedia Modeling. MMM 2024. Lecture Notes in Computer Science, vol 14557. Springer, Cham. https://doi.org/10.1007/978-3-031-53302-0_30

Download citation

DOI: https://doi.org/10.1007/978-3-031-53302-0_30
Published: 29 January 2024
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-53301-3
Online ISBN: 978-3-031-53302-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

PraK Tool: An Interactive Search Tool Based on Video Data Services