Abstract
Automated document classification process extracts information with a systematic analysis of the content of documents.
This is an active research field of growing importance due to the large amount of electronic documents produced in the world wide web and available thanks to diffused technologies including mobile ones.
Several application areas benefit from automated document classification, including document archiving, invoice processing in business environments, press releases and research engines.
Current tools classify or ”tag” either text or images separately.In this paper we show how, by linking image and text-based contents together, a technology improves fundamental document management tasks like retrieving information from a database or automated documents.
We present an investigation of a model of conceptual spaces for investigation using joint information sources from the text and the images forming complex documents. We present a formal model and the computable algorithms and the dataset from which we took a subset to make experiments and relative tests and results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Ye, Q., Huang, Q., Gao, W., Zhao, D.: Fast and robust text detection in images and video frames. Image and Vision Computing 23, 565–576 (2005)
Ah-Pine, J., Bressan, M., Clinchant, S., Csurka, G., Hoppenot, Y., Renders, J.-M.: Crossing textual and visual content in different application scenarios. Multimedia Tools and Applications 42, 31–56 (2009)
Qi, C., Aggarwal, G., Tian, Q., Ji, H., Huang, T.: Exploring context and content links in social media: A latent space method. IEEE Transactions Pattern Analysis and Machine Intelligence (August 2011)
Kesorn, S., Poslad, K.: An enhanced bag of visual word vector space model to represent visual content in athletics images. IEEE Transactions on Multimedia (October 2011)
Denoyer, L., Gallinari, P.: Bayesian network model for semi-structured document classification. Information Processing and Management 40, 807–827 (2004)
Bouguila, N., ElGuebaly, W.: Discrete data clustering using finite mixture models. Pattern Recognition 42, 33–42 (2009)
Mikhailov, D.V., Emelyanov, G.M.: Semantic clustering and affinity measure of subject-oriented language texts. Pattern Recognition and Image Analysis 20, 376–385 (2010)
Yang, L., Geng, Y., Cai, B., Hanjalic, A.: Object retrieval using visual query context. IEEE Transactions on Multimedia (July 2011)
Qin, J., Yung, N.H.C.: Scene categorization via contextual visual words. Pattern Recognition 43, 1874–1888 (2010)
Park, G., Baek, Y., Lee, H.-K.: Web image retrieval using majority-based ranking approach. Multimedia Tools and Applications 31, 195–219 (2006)
Chan, W., Coghill, G.: Text analysis using local energy. Pattern Recognition 34, 2523–2532 (2001)
Aronovich, L., Spiegler, I.: Cm-tree: A dynamic clustered index for similarity search in metric databases. Data & Knowledge Engineering 63, 919–946 (2007)
Sable, C.L., Hatzivassiloglou, V.: Text-based approaches for non-topical image categorization. International Journal on Digital Libraries 3, 261–275 (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Cristani, M., Tomazzoli, C. (2014). A Multimodal Approach to Exploit Similarity in Documents. In: Ali, M., Pan, JS., Chen, SM., Horng, MF. (eds) Modern Advances in Applied Intelligence. IEA/AIE 2014. Lecture Notes in Computer Science(), vol 8481. Springer, Cham. https://doi.org/10.1007/978-3-319-07455-9_51
Download citation
DOI: https://doi.org/10.1007/978-3-319-07455-9_51
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-07454-2
Online ISBN: 978-3-319-07455-9
eBook Packages: Computer ScienceComputer Science (R0)