Abstract
The paper presents a novel approach, in which images are integrated with a dialogue interface that enables them to communicate with the user. The structure of the corresponding dialogue system is supported by graphical ontologies and enables the system learning from the dialogues. The Internet environment is used for retrieving additional information about the images as well as for solving more complex tasks related with exploiting other relevant knowledge. Further, the paper deals with some problems that arise from the system initiative dialogue mode and discusses the structure and algorithms of the dialogue system. Some examples and applications of the presented approach are presented as well.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Sandnes, F.: Where was that photo taken? deriving geographical information from image collections based on temporal exposure attributes. Multimedia Systems, 309–318 (2010)
Boutell, M., Luo, J.: Photo classification by integrating image content and camera metadata. In: Proc. of the 17th Int. Conf. on Pattern Recognition, vol. 4, pp. 901–904 (2004)
Yuan, J., Luo, J., Wu, Y.: Mining compositional features from gps and visual cues for event recognition in photo collections. IEEE Trans. on Multimedia, 705–716 (2010)
Li, S.Z., Jain, A.K. (eds.): Handbook of Face Recognition. Springer (2011)
Wright, J., et al.: Robust face recognition via sparse representation. IEEE Trans. on Pattern Analysis and Machine Intelligence 31, 210–227 (2009)
Haddadnia, J., Ahmadi, M.: N-feature neural network human face recognition. In: Image and Vision Computing, pp. 1071–1082 (2004)
Segaran, T.: Programming Collective Intelligence: Building Smart Web 2.0 Applications. O’Reilly Media (2007)
Bartlett, M., Movellan, J., Sejnowski, T.: Face recognition by independent component analysis. IEEE Transactions on Neural Networks, 1450–1464 (2002)
Rowley, H., Baluja, S., Kanade, T.: Neural network-based face detection. IEEE Transactions on Pattern Analysis and Machine Intelligence, 23–38 (1998)
Batko, M., Dohnal, V., Novák, D., Sedmidubský, J.: Mufin: A multi-feature indexing network. In: SISAP 2009: 2009 Second Int. Workshop on Similarity Search and Applications, pp. 158–159. IEEE Computer Society (2009)
Jaffe, A., Naaman, M., Tassa, T., Davis, M.: Generating summaries and visualization for large collections of geo-referenced photographs. In: Proceedings of the 8th ACM Internat. Workshop on Multimedia Information Retrieval, pp. 89–98. ACM (2006)
Abbasi, R., Chernov, S., Nejdl, W., Paiu, R., Staab, S.: Exploiting Flickr Tags and Groups for Finding Landmark Photos. In: Boughanem, M., Berrut, C., Mothe, J., Soule-Dupuy, C. (eds.) ECIR 2009. LNCS, vol. 5478, pp. 654–661. Springer, Heidelberg (2009)
Dahlström, E., et al.: Scalable vector graphics (svg) 1.1, 2nd edn. (2011), http://www.w3.org/TR/SVG/
Lacy, L.W.: Owl: Representing Information Using the Web Ontology Language. Trafford Publishing (2005)
Ošlejšek, R.: Annotation of pictures by means of graphical ontologies. In: Proc. of Int. Conf. on Internet Computing, ICOMP 2009, pp. 296–300. CSREA Press (2009)
Hunt, A., McGlashan, S.: Speech recognition grammar specification version 1.0 (2004), http://www.w3.org/TR/speech-grammar/
Kopeček, I., Ošlejšek, R., Plhák, J.: Dialogue management in communicative images. In: Text, Speech and Dialogue – Students’ section, Proceedings Addendum, University of West Bohemia in Pilsen, pp. 9–13. Publ. House (2011)
Kopecek, I., Oslejsek, R.: Communicative Images. In: Dickmann, L., Volkmann, G., Malaka, R., Boll, S., Krüger, A., Olivier, P. (eds.) SG 2011. LNCS, vol. 6815, pp. 163–173. Springer, Heidelberg (2011)
Kamel, H.M., Landay, J.A.: Sketching images eyes-free: a grid-based dynamic drawing tool for the blind. In: Proceedings of the Fifth International ACM Conference on Assistive Technologies, pp. 33–40. ACM Press (2002)
Kopeček, I., Ošlejšek, R.: Accessibility of graphics and e-learning. In: Proc. of the Second International Conference on ICT & Accessibility, pp. 157–165. Hammamet: Art Print (2009)
Chai, Y., Xia, T., Zhu, J., Li, H.: Intelligent digital photo management system using ontology and swrl. In: Proc. of the 2010 International Conference on Computational Intelligence and Security, CS 2010, pp. 18–22. IEEE Computer Society Press, Washington, DC (2010)
Boley, H., et al.: Swrl: A semantic web rule language combining OWL and RuleML (2004), http://www.w3.org/Submission/SWRL/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kopeček, I., Ošlejšek, R., Plhák, J. (2012). Integrating Dialogue Systems with Images. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2012. Lecture Notes in Computer Science(), vol 7499. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32790-2_77
Download citation
DOI: https://doi.org/10.1007/978-3-642-32790-2_77
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-32789-6
Online ISBN: 978-3-642-32790-2
eBook Packages: Computer ScienceComputer Science (R0)