Fusion of Multimedia Document Intra-Modality Relevancies using Linear Combination Model

Rashid, Umer; Niaz, Iftikhar Azim; Bhatti, Muhammad Afzal

doi:10.1007/978-90-481-3660-5_98

Umer Rashid²,
Iftikhar Azim Niaz² &
Muhammad Afzal Bhatti³

2418 Accesses

Abstract

Information retrieval systems that search within multimedia artifacts face inter-modality fusion problem. Fuzzy logic, sequential and linear combinational techniques are used for inter-modality fusion. We explore asymptotically that Fuzzy logic and sequential techniques have limitations. One major limitation is that they only address fusion of documents coming from different modalities, not document relevancies distributed in different modality relevancy spaces. Linear combinational techniques fuse document relevancies distributed within different relevancy spaces using inter-modality weights and calculate composite relevancies. Inter-modality weights can be calculated using several offline and online techniques. Offline techniques mostly use machine learning techniques and adjust weights before search process. Online techniques calculate inter-modality weights within search process and are satisfactory for general purpose information retrieval systems. We investigate asymptotically that linear combination technique for inter-modality fusion outperforms fuzzy logic techniques and workflows. We explore a variation of linear combination technique based on ratio of average arithmetic means of document relevancies. Our proposed technique smoothes the effect of inter-modality weights and provides a moderate mechanism of inter-modality fusion.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Investigating Non-classical Correlations Between Decision Fused Multi-modal Documents

Fusion Strategies for Large-Scale Multi-modal Image Retrieval

Multimodal recommendation algorithm based on Dempster-Shafer evidence theory

Article 07 September 2023

References

Marchionini, G. & Geisler, G. 2002. “The Open Video Digital Library”. D-Lib Magazine, Volume 8 Number 12.
Google Scholar
Wilfred Owen digital archive to expand. (1993, Feb 22). “BLUEPRINT the newsletter of the University of Oxford”, Oxford Press, pp. 4.
Google Scholar
Mahmood, T.S., Srinivasan, S., Ami, A., Ponceleon, D., Blanchard, B., and Petkovicstf. D. (2000). “CueVideo: A System for Cross-Modal Search and Browse of Video”. In CVPR’00, IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp 2786.
Google Scholar
Bruno, E., Loccoz, N.M., & Maillet, S.M. (2005). “Interactive video retrieval based on multimodal dissimilarity Interactive video retrieval based on multimodal dissimilarity”. In MLMM’05, workshop Machine Learning Techniques for Processing Multimedia Content.
Google Scholar
[5] Westerveld, T., Vries, A.P.D., A.R.V., Ballegooij, A.R.V., Jong, F.M.G.D., & Hiemstra, D.2003. “A Probabilistic Multimedia Retrieval Model and its Evaluation”. Journal on Applied Signal Processing, special issue on Unstructured Information Management from Multimedia Data Source, 2003(3), pp. 186-198.
Google Scholar
[6] Chen, N. (2006). A Survey of Indexing and Retrieval of Multimodal Documents: Text and Images (2006-505). Canada, Queen’s University.
Google Scholar
Srihari, R.K, Rao, A., Han, B., Munirathnam, S., & Wu, X.2000. “A Model for Multimodal Information Rettrieval”. In ICME’00, IEEE International Conference on Multimedia and Expo, pp. 701-704.
Google Scholar
James L. Hein. Discrete Structures, Logic, and Computability. 2 ^nd Edition, Portland State University.
Google Scholar
Thomas H. Cormen, Charles E. Leiserson, Ronald L. R. Introduction to Algorithms. Second Edition, MIT press.
Google Scholar
[10] R.B. Yates & B.R. Neto. Modern Information Retrieval. New York, 1999, ACM Press.
Google Scholar
Chen, M., Christel, M., Hauptmann, A., & Wactlar. H. (2005). “Putting Active Learning into Multimedia Applications: Dynamic Definition and Refinement of Concept Classifiers”. In MM’05, 5 ^th ACM International Conference on Multimedia pp. 902-911
Google Scholar
Christel, M.G. & and Conescu, R.M.(2005). “Addressing the Challenge of Visual Information Access from Digital Image and Video Libraries”. In JCDL’05, ACM/IEEE-CS Joint Conference on Digital Libraries. pp. 69-78.
Google Scholar
Chen, J.Y, Carlis, J.V, Gao,N. (2005). “A Complex Biological DataBase Quering Method”. In SAC’05, Symposium on Applied Computing, pp. 110-114.
Google Scholar
Bruno_b, E., Loccoz, N.M., & Maillet, S.M. (2005). Learning User Queries in Multimodal Dissimilarity Spaces. In AMR’05, 3rd International Workshop on Adaptive Multimedia Retrieval.
Google Scholar
Bushman, F., Menuier, R., Rohnert, H., Sommerland, P., & Stal, M. (1996). PATTERN-ORIENTED SOFTWARE ARCHITECTURE A System of Patterns. Singapore, Wiley, volume 1.
Google Scholar
Li, C.S., Chang, Y.C, Smith, J.R. & Hill, M. (2001). SPIRE/EPI-SPIRE Model-Based Multi-modal Information Retrieval. Retrieved form IBM T. J. Watson Research Centre Website: www.rocq.inria.fr/imedia/mmcbir2001/FinalpaperLi.pdf
Chen, M., Christel, M., Hauptmann, A., & Wactlar. H. (2005). Putting Active Learning into Multimedia Applications: Dynamic Definition and Refinement of Concept Classifiers. In MM’05, 5th ACM International Conference on Multimedia, pp. 902-911.
Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Computing, Riphah International University, Islamabad, Pakistan
Umer Rashid & Iftikhar Azim Niaz
Department of Computer Science, Quaid-i-Azam University, Islamabad, Pakistan
Muhammad Afzal Bhatti

Authors

Umer Rashid
View author publications
You can also search for this author in PubMed Google Scholar
Iftikhar Azim Niaz
View author publications
You can also search for this author in PubMed Google Scholar
Muhammad Afzal Bhatti
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Umer Rashid .

Editor information

Editors and Affiliations

School of Engineering, University of Bridgeport, University Avenue 221, Bridgeport, 06604, U.S.A.
Khaled Elleithy

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Rashid, U., Niaz, I.A., Bhatti, M.A. (2010). Fusion of Multimedia Document Intra-Modality Relevancies using Linear Combination Model. In: Elleithy, K. (eds) Advanced Techniques in Computing Sciences and Software Engineering. Springer, Dordrecht. https://doi.org/10.1007/978-90-481-3660-5_98

Download citation

DOI: https://doi.org/10.1007/978-90-481-3660-5_98
Published: 15 December 2009
Publisher Name: Springer, Dordrecht
Print ISBN: 978-90-481-3659-9
Online ISBN: 978-90-481-3660-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics