Abstract
Conversational search system seek to support users in their search activities to improve the effectiveness and efficiency of search while reducing their cognitive load. The challenges of multimedia search mean that search supports provided by conversational search have the potential to improve the user search experience. For example, by assisting users in constructing better queries and making more informed decisions in relevance feedback stages whilst searching. However, previous research on conversational search has been focused almost exclusively on text archives. This demonstration illustrates the potential for the application of conversational methods in multimedia search. We describe a framework to enable multimodal conversational search for use with multimedia archives. Our current prototype demonstrates the use of an conversational AI assistant during the multimedia information retrieval process for both image and video collections.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Hanjalic, A., Lienhart, R., Ma, W.Y., Smith, J.R.: The holy grail of multimedia information retrieval: so close or yet so far away? Proc. IEEE 96(4), 541–547 (2008). https://doi.org/10.1109/JPROC.2008.916338
Hodosh, M., Young, P., Hockenmaier, J.: Framing image description as a ranking task: data, models and evaluation metrics. J. Artif. Intell. Res. 47, 853–899 (2013)
Kaushik, A., Jacob, B., Velavan, P.: An exploratory study on a reinforcement learning prototype for multimodal image retrieval using a conversational search interface. Knowledge 2(1), 116–138 (2022)
Kim, H., Kim, D., Yoon, S., Dernoncourt, F., Bui, T., Bansal, M.: Caise: conversational agent for image search and editing. In: Proceedings of the The Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI-2022) (2022)
Nie, L., Jiao, F., Wang, W., Wang, Y., Tian, Q.: Conversational image search. IEEE Trans. Image Process. 30, 7732–7743 (2021)
Radlinski, F., Craswell, N.: A theoretical framework for conversational search. In: Proceedings of the 2017 Conference on Conference Human Information Interaction and Retrieval, pp. 117–126 (2017)
Xu, J., Mei, T., Yao, T., Rui, Y.: MSR-VTT: a large video description dataset for bridging video and language. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5288–5296 (2016)
Zamani, H., Trippas, J.R., Dalton, J., Radlinski, F.: Conversational information seeking. arXiv preprint arXiv:2201.08808 (2022)
Acknowledgments
This work was conducted with the financial support of the Science Foundation Ireland Centre for Research Training in Digitally-Enhanced Reality (d-real) under Grant No. 18/CRT/6224, and partially as part of the ADAPT Centre at DCU (Grant No. 13/RC/2106_P2) (www.adaptcentre.ie). For the purpose of Open Access, the author has applied a CC BY public copyright licence to any Author Accepted Manuscript version arising from this submission.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Potyagalova, A., Jones, G.J.F. (2024). A Conversational Search Framework for Multimedia Archives. In: Goharian, N., et al. Advances in Information Retrieval. ECIR 2024. Lecture Notes in Computer Science, vol 14612. Springer, Cham. https://doi.org/10.1007/978-3-031-56069-9_25
Download citation
DOI: https://doi.org/10.1007/978-3-031-56069-9_25
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-56068-2
Online ISBN: 978-3-031-56069-9
eBook Packages: Computer ScienceComputer Science (R0)