Abstract
Information retrieval systems that search within multimedia artifacts face inter-modality fusion problem. Fuzzy logic, sequential and linear combinational techniques are used for inter-modality fusion. We explore asymptotically that Fuzzy logic and sequential techniques have limitations. One major limitation is that they only address fusion of documents coming from different modalities, not document relevancies distributed in different modality relevancy spaces. Linear combinational techniques fuse document relevancies distributed within different relevancy spaces using inter-modality weights and calculate composite relevancies. Inter-modality weights can be calculated using several offline and online techniques. Offline techniques mostly use machine learning techniques and adjust weights before search process. Online techniques calculate inter-modality weights within search process and are satisfactory for general purpose information retrieval systems. We investigate asymptotically that linear combination technique for inter-modality fusion outperforms fuzzy logic techniques and workflows. We explore a variation of linear combination technique based on ratio of average arithmetic means of document relevancies. Our proposed technique smoothes the effect of inter-modality weights and provides a moderate mechanism of inter-modality fusion.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Marchionini, G. & Geisler, G. 2002. “The Open Video Digital Library”. D-Lib Magazine, Volume 8 Number 12.
Wilfred Owen digital archive to expand. (1993, Feb 22). “BLUEPRINT the newsletter of the University of Oxford”, Oxford Press, pp. 4.
Mahmood, T.S., Srinivasan, S., Ami, A., Ponceleon, D., Blanchard, B., and Petkovicstf. D. (2000). “CueVideo: A System for Cross-Modal Search and Browse of Video”. In CVPR’00, IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp 2786.
Bruno, E., Loccoz, N.M., & Maillet, S.M. (2005). “Interactive video retrieval based on multimodal dissimilarity Interactive video retrieval based on multimodal dissimilarity”. In MLMM’05, workshop Machine Learning Techniques for Processing Multimedia Content.
[5] Westerveld, T., Vries, A.P.D., A.R.V., Ballegooij, A.R.V., Jong, F.M.G.D., & Hiemstra, D.2003. “A Probabilistic Multimedia Retrieval Model and its Evaluation”. Journal on Applied Signal Processing, special issue on Unstructured Information Management from Multimedia Data Source, 2003(3), pp. 186-198.
[6] Chen, N. (2006). A Survey of Indexing and Retrieval of Multimodal Documents: Text and Images (2006-505). Canada, Queen’s University.
Srihari, R.K, Rao, A., Han, B., Munirathnam, S., & Wu, X.2000. “A Model for Multimodal Information Rettrieval”. In ICME’00, IEEE International Conference on Multimedia and Expo, pp. 701-704.
James L. Hein. Discrete Structures, Logic, and Computability. 2 nd Edition, Portland State University.
Thomas H. Cormen, Charles E. Leiserson, Ronald L. R. Introduction to Algorithms. Second Edition, MIT press.
[10] R.B. Yates & B.R. Neto. Modern Information Retrieval. New York, 1999, ACM Press.
Chen, M., Christel, M., Hauptmann, A., & Wactlar. H. (2005). “Putting Active Learning into Multimedia Applications: Dynamic Definition and Refinement of Concept Classifiers”. In MM’05, 5 th ACM International Conference on Multimedia pp. 902-911
Christel, M.G. & and Conescu, R.M.(2005). “Addressing the Challenge of Visual Information Access from Digital Image and Video Libraries”. In JCDL’05, ACM/IEEE-CS Joint Conference on Digital Libraries. pp. 69-78.
Chen, J.Y, Carlis, J.V, Gao,N. (2005). “A Complex Biological DataBase Quering Method”. In SAC’05, Symposium on Applied Computing, pp. 110-114.
Bruno_b, E., Loccoz, N.M., & Maillet, S.M. (2005). Learning User Queries in Multimodal Dissimilarity Spaces. In AMR’05, 3rd International Workshop on Adaptive Multimedia Retrieval.
Bushman, F., Menuier, R., Rohnert, H., Sommerland, P., & Stal, M. (1996). PATTERN-ORIENTED SOFTWARE ARCHITECTURE A System of Patterns. Singapore, Wiley, volume 1.
Li, C.S., Chang, Y.C, Smith, J.R. & Hill, M. (2001). SPIRE/EPI-SPIRE Model-Based Multi-modal Information Retrieval. Retrieved form IBM T. J. Watson Research Centre Website: www.rocq.inria.fr/imedia/mmcbir2001/FinalpaperLi.pdf
Chen, M., Christel, M., Hauptmann, A., & Wactlar. H. (2005). Putting Active Learning into Multimedia Applications: Dynamic Definition and Refinement of Concept Classifiers. In MM’05, 5th ACM International Conference on Multimedia, pp. 902-911.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer Science+Business Media B.V.
About this paper
Cite this paper
Rashid, U., Niaz, I.A., Bhatti, M.A. (2010). Fusion of Multimedia Document Intra-Modality Relevancies using Linear Combination Model. In: Elleithy, K. (eds) Advanced Techniques in Computing Sciences and Software Engineering. Springer, Dordrecht. https://doi.org/10.1007/978-90-481-3660-5_98
Download citation
DOI: https://doi.org/10.1007/978-90-481-3660-5_98
Published:
Publisher Name: Springer, Dordrecht
Print ISBN: 978-90-481-3659-9
Online ISBN: 978-90-481-3660-5
eBook Packages: Computer ScienceComputer Science (R0)