Abstract
This paper describes the Indonesian-English cross language information system, where we investigated the problem of cross language document retrieval for Indonesian-English. Our work based on Indonesian-English parallel corpus and applied mutual information technique. Our parallel corpus was built using a machine translation tool. Indonesian queries were translated into English by means of bilingual word list that created using mutual information technique. Experimental results demonstrated that translating Indonesian queries into English using this bilingual word list achieved 41.9% of the monolingual queries.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Aljlayl, M., Frieder, O.: Effective arabic-english cross-language information retrieval via machine-readable dictionaries and machine translation. In: Proceedings of the tenth international conference on Information and knowledge management, pp. 295–302. ACM, New York (2001)
Baeza-Yates, R., Ribiero-Neto, B.: Modern Information Retrieval. Addison Wesley, New York (1999)
Braschler, M.: Combination Approaches for Multilingual Text Retrieval. Information Retrieval 7, 183–204 (2004)
Church, K.W., Hanks, P.: Word Association Norms, Mutual Information and Lexicography. Computational Linguistic 1, 22–29 (1990)
Fano, R.: Transmission of Information. MIT Press, Cambridge (1961)
Grossman, D.A., Frieder, O.: Information Retrieval: Algorithms and Heuristics, 2nd edn. Springer, Netherland (2004)
Hayurani, H., Sari, S., Adriani, M.: Query and Document Translation for English-Indonesian Cross Language IR. In: Peters, C., Clough, P., Gey, F.C., Karlgren, J., Magnini, B., Oard, D.W., de Rijke, M., Stempfhuber, M. (eds.) CLEF 2006. LNCS, vol. 4730, pp. 57–61. Springer, Heidelberg (2007)
Hedlund, T., et al.: Dictionary-Based Cross-Language Information Retrieval: Learning Experiences from CLEF 2000–2002. CLEF 2000 7, 99–119 (2004)
Jang, M.G., Myaeng, S.H., Park, S.E.: Using Mutual Information to Resolve Query Translation Ambiguities and Query Term Weighting. In: Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics, Morristown, NJ, USA, pp. 223–229 (1999)
Lavrenko, V., Choquette, M., Croft, W.B.: Cross-Lingual Relevance Models. In: The Proceedings of the 25th annual international ACM SIGIR conference on research and development in information retrieval, pp. 175–182. ACM Press, New York (2002)
McCarley, J.S.: Should we translate the documents or the queries in cross-language information retrieval? In: Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics,pp. 208–214. Association for Computational Linguistics Morristown, NJ, USA, (1999)
Mititelu, V.B., Ion, R.: Cross-Language Transfer Of Syntactic Relations Using Parallel Corpora. In: Cross-Language Knowledge Induction Workshop, Romania (2005)
Peters, C.: Cross Language Evaluation Forum (2007), http://www.clef-campaign.org/2007/2007agenda.html
Qiu, Y., Frei, H.: Concept Based Query Expansion. In: Proceedings of the 16th ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 160–169. ACM Press, New York (1993)
Sheridan, P., Braschler, M., Schäuble, P.: Cross-language information retrieval in a multilingual legal domain. In: Proceedings of the First European Conference on Research and Advanced Technology for Digital Libraries, pp. 253–268 (1997)
Talvensaari, T., et al.: Corpus-based cross-language information retrieval in retrieval of highly relevant documents. JASIST 58, 322–334 (2007)
Yang, Y., et al.: Translingual Information Retrieval: Learning from Bilingual Corpora. Artificial Intelligence 103, 323–345 (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Sari, S., Adriani, M. (2008). Using Mutual Information Technique in Cross-Language Information Retrieval. In: Buchanan, G., Masoodian, M., Cunningham, S.J. (eds) Digital Libraries: Universal and Ubiquitous Access to Information. ICADL 2008. Lecture Notes in Computer Science, vol 5362. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-89533-6_28
Download citation
DOI: https://doi.org/10.1007/978-3-540-89533-6_28
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-89532-9
Online ISBN: 978-3-540-89533-6
eBook Packages: Computer ScienceComputer Science (R0)