Abstract
In this paper, we propose an improved information retrieval model, where the integration of modification-words and head-words is introduced into the representation of user queries and the traditional vector space model. We show how to calculate the weights of combined terms in vectors. We also propose a new strategy to construct the thesaurus in a fuzzy way for query expansion. Through the developed information retrieval system, we can retrieve documents in a relatively narrow search space and meanwhile extend the coverage of the retrieval to the related documents that do not necessarily contain the same terms as the given query. Experiments for testing the retrieval effectiveness have been implemented by using benchmark corpora. Experimental results show that the improved information retrieval system is capable of improving the retrieval performance both in precision and recall rates.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Kraft, D.H., Petry, F.E.: Fuzzy information systems: managing uncertainty in databases and information retrieval systems. Fuzzy Sets and Systems 90, 183–191 (1997)
Salton, G., Wong, A., Yang, C.S.: A vector space model for automatic indexing. Communications of the ACM 18, 613–620 (1975)
Salton, G., Buckley, C.: Term-weighting in information retrieval using the term precision model. Journal of the Association for Computing Machinery 29, 152–170 (1982)
Papadimitriou, C.H., Raghavan, P., Tamaki, H., Vempala, S.: Latent semantic indexing: A probabilistic analysis. Journal of Computer and System Sciences 61, 217–235 (2000)
Letsche, T.A., Berry, M.W.: Large-scale information retrieval with latent semantic indexing. Information Sciences 100, 105–137 (1997)
Chandren-Muniyandi, R., Komputer, J.S., Maklumat, F.T.: dan S.: Neural network: An exploration in document retrieval system. Proceedings of TENCON 2000 1, 156–160 (2000)
Ramirez, C., Cooley, R.: Case-based reasoning model applied to information retrieval. In: IEE Colloquium on Case Based Reasoning: Prospects for Applications, pp. 9/1 -9/3 (1995)
Liu, G.: The semantic vector space model (SVSM)(A text representation and searching technique. In: Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences, Vol. IV: Information Systems: Collaboration Technology Organizational Systems and Technology, vol. 4, pp. 928–937 (1994)
Mandala, R., Tokunaga, T., Tanaka, H.: Query expansion using heterogeneous thesauri. Information Processing and Management 36, 361–378 (2000)
Kim, M.C., Choi, K.S.: A comparison of collocation-based similarity measures in query expansion. Information Processing and Management 35, 19–30 (1999)
Qiu, Y., Frei, H.: Concept based query expansion. In: Proceedings of the 16th Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval, pp. 160–169 (1993)
Nie, J.Y., Jin, F.: Integrating logical operators in query expansion in vector space model. In: Workshop on Mathematical/Formal Methods in Information Retrieval, 25th ACM-SIGIR. (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wu, J., Tanioka, H., Wang, S., Pan, D., Yamamoto, K., Wang, Z. (2005). An Improved VSM Based Information Retrieval System and Fuzzy Query Expansion. In: Wang, L., Jin, Y. (eds) Fuzzy Systems and Knowledge Discovery. FSKD 2005. Lecture Notes in Computer Science(), vol 3613. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11539506_68
Download citation
DOI: https://doi.org/10.1007/11539506_68
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28312-6
Online ISBN: 978-3-540-31830-9
eBook Packages: Computer ScienceComputer Science (R0)