Abstract
This paper presents a tool relying on data service architecture, where technical details of all VBS datasets are completely hidden behind an abstract stateless data layer. The data services allow independent development of interactive search interfaces and refinement techniques, which is demonstrated by a smart front-end component. The component supports common search features and allows users to exploit content-based statistics for effective filtering. We believe that video data services might be a valuable addition to the open-source VBS toolkit, especially when available for the competition on a shared server with all VBS datasets, extracted features, and meta-data behind.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Amato, G.: VISIONE at video browser showdown 2023. In: Dang-Nguyen, D.T., et al. (eds.) MMM 2023. LNCS, vol. 13833, pp. 615–621. Springer, Cham (2023). https://doi.org/10.1007/978-3-031-27077-2_48
Amato, G., et al.: VISIONE Feature Repository for VBS: Multi-Modal Features and Detected Objects from MVK Dataset (2023). https://doi.org/10.5281/zenodo.8355037
Amato, G., et al.: VISIONE Feature Repository for VBS: Multi-Modal Features and Detected Objects from V3C1+V3C2 Dataset (2023). https://doi.org/10.5281/zenodo.8188570
Chernoff, H.: The use of faces to represent points in k-dimensional space graphically. J. Am. Stat. Assoc. 68(342), 361–368 (1973). https://doi.org/10.1080/01621459.1973.10482434
Cox, I.J., Miller, M.L., Minka, T.P., Papathomas, T.V., Yianilos, P.N.: The Bayesian image retrieval system, PicHunter: theory, implementation, and psychophysical experiments. IEEE Trans. Image Process. 9(1), 20–37 (2000)
Heller, S., et al.: Interactive video retrieval evaluation at a distance: comparing sixteen interactive video search systems in a remote setting at the 10th video browser showdown. Int. J. Multim. Inf. Retr. 11(1), 1–18 (2022). https://doi.org/10.1007/s13735-021-00225-2
Ilharco, G., et al.: Openclip (2021). https://doi.org/10.5281/zenodo.5143773, if you use this software, please cite it as below
Kratochvíl, M., Mejzlík, F., Veselý, P., Souček, T., Lokoč, J.: SOMHunter: lightweight video search system with SOM-guided relevance feedback. In: Proceedings of the 28th ACM International Conference on Multimedia, MM 2020. ACM (2020, in press)
Lokoč, J., Mejzlík, F., Souček, T., Dokoupil, P., Peška, L.: Video search with context-aware ranker and relevance feedback. In: Þór Jónsson, B., et al. (eds.) MMM 2022. LNCS, vol. 13142, pp. 505–510. Springer, Cham (2022). https://doi.org/10.1007/978-3-030-98355-0_46
Lokoč, J., Kovalčík, G., Souček, T., Moravec, J., Čech, P.: A framework for effective known-item search in video. In: Proceedings of the 27th ACM International Conference on Multimedia (MM 2019), Nice, France, 21–25 October 2019, pp. 1–9 (2019). https://doi.org/10.1145/3343031.3351046
Ma, Z., Wu, J., Loo, W., Ngo, C.W.: Reinforcement learning enhanced PicHunter for interactive search. In: Conference on Multimedia Modeling (2023)
Pantelidis, N., et al.: VERGE in VBS 2023. In: Dang-Nguyen, D.T., et al. (eds.) MMM 2023. LNCS, vol. 13833, pp. 658–664. Springer, Cham (2023). https://doi.org/10.1007/978-3-031-27077-2_55
Radford, A., et al.: Learning transferable visual models from natural language supervision. CoRR abs/2103.00020 (2021). https://arxiv.org/abs/2103.00020
Rossetto, L., et al.: Interactive video retrieval in the age of deep learning–detailed evaluation of VBS 2019. IEEE Trans. Multimed. 23, 243–256 (2020). https://doi.org/10.1109/TMM.2020.2980944
Rossetto, L., Schuldt, H., Awad, G., Butt, A.A.: V3C – a research video collection. In: Kompatsiaris, I., Huet, B., Mezaris, V., Gurrin, C., Cheng, W.-H., Vrochidis, S. (eds.) MMM 2019. LNCS, vol. 11295, pp. 349–360. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-05710-7_29
Sauter, L., et al.: Exploring effective interactive text-based video search in vitrivr. In: Dang-Nguyen, D.T., et al. (eds.) MMM 2023. LNCS, vol. 13833, pp. 646–651. Springer, Cham (2023). https://doi.org/10.1007/978-3-031-27077-2_53
Schall, K., Hezel, N., Jung, K., Barthel, K.U.: Vibro: video browsing with semantic and visual image embeddings. In: Dang-Nguyen, D.T., et al. (eds.) MMM 2023. LNCS, pp. 665–670. Springer, Cham (2023). https://doi.org/10.1007/978-3-031-27077-2_56
Truong, Q.T., et al.: Marine video kit: a new marine video dataset for content-based analysis and retrieval. In: Dang-Nguyen, D.T., et al. (eds.) MMM 2023. LNCS, vol. 13833, pp. 539–550. Springer, Cham (2023). https://doi.org/10.1007/978-3-031-27077-2_42
Acknowledgment
This work has been supported by Czech Science Foundation (GAČR) project 22-21696S. We would like to thank the authors of the discussed VBS systems for clarification of architecture details.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Lokoč, J., Vopálková, Z., Stroh, M., Buchmueller, R., Schlegel, U. (2024). PraK Tool: An Interactive Search Tool Based on Video Data Services. In: Rudinac, S., et al. MultiMedia Modeling. MMM 2024. Lecture Notes in Computer Science, vol 14557. Springer, Cham. https://doi.org/10.1007/978-3-031-53302-0_30
Download citation
DOI: https://doi.org/10.1007/978-3-031-53302-0_30
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-53301-3
Online ISBN: 978-3-031-53302-0
eBook Packages: Computer ScienceComputer Science (R0)