A Novel Method for Cross-Language Retrieval of Chunks Using Monolingual and Bilingual Corpora

Miangah, Tayebeh Mosavi; Nezarat, Amin

doi:10.1007/978-3-642-15766-0_45

Tayebeh Mosavi Miangah³ &
Amin Nezarat⁴

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 101))

Included in the following conference series:

International Conference on Advances in Information and Communication Technologies

1239 Accesses

Abstract

Information retrieval (IR) is a crucial area of natural language processing (NLP). One of the fundamental issues in bilingual retrieving of information in search engines seems to be the way and the extent users call for phrases and chunks. The main problem arises when the existing bilingual dictionaries are not able to meet the users’ actual needs for translating such phrases and chunks into an alternative language and the results often are not reliable. In this project a heuristic method for extracting the correct equivalents of source language chunks using monolingual and bilingual linguistic corpora as well as text classification algorithms is to be introduced. Experimental results revealed that our method gained the accuracy rate of 86.13% which seems very encouraging.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Alizade, H., et al.: Studying the efficiency of the existing methods in cross-language information retrieval using a machine-readable bilingual dictionary. Iranian Information and Documentation Centre 25(1), 53–70 (2009)
Google Scholar
Chen, H.: Chinese information extraction Techniques. Presented at the SSIMIP, Singapore (2002)
Google Scholar
Hull, D., Grefenstette, G.: Querying Across Languages; A Dictionary – Based Approach to Multilingual Information Retrieval. In: Proceedings of the 19th Annual International ACM Sigir, Zurich, Switzerland, pp. 49–57 (1996)
Google Scholar
Mosavi Miangah, T.: Automatic term extraction for cross-language information retrieval using a bilingual parallel corpus. In: Proceedings of the 6th International Conference on Informatics and Systems (INFOS 2008), Cairo, Egypt, pp. 81–84 (2008)
Google Scholar
Mosavi Miangah, T.: Constructing a large-scale English-Persian Parallel Corpus. META 54(1), 181–188 (2009)
Google Scholar
Manning, C.D., Raghavan, P., Schütze, H.: An Introduction to Information Retrieval. Cambridge University Press, Cambridge (2009)
Google Scholar
Shams, M., Pourmahmoud, S.: A linguistic-conceptual approach for cross-language information retrieval. In: Proceedings of the 13th National Conference of Computer Society of Iran, pp. 1–8. Kish Island, Iran (2008)
Google Scholar

Download references

Author information

Authors and Affiliations

English Language Department, Payame Noor University, Yazd, Iran
Tayebeh Mosavi Miangah
Information Technology Department, Shiraz University, Shiraz, Iran
Amin Nezarat

Authors

Tayebeh Mosavi Miangah
View author publications
You can also search for this author in PubMed Google Scholar
Amin Nezarat
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Engineers Network, Trivandrum, Kerala, India
Vinu V Das
NSS College of Engineering, Palakkadu, India
R. Vijaykumar

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Miangah, T.M., Nezarat, A. (2010). A Novel Method for Cross-Language Retrieval of Chunks Using Monolingual and Bilingual Corpora. In: Das, V.V., Vijaykumar, R. (eds) Information and Communication Technologies. ICT 2010. Communications in Computer and Information Science, vol 101. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15766-0_45

Download citation

DOI: https://doi.org/10.1007/978-3-642-15766-0_45
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15765-3
Online ISBN: 978-3-642-15766-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics