Abstract
Traditional bilingual snippets retrieval method is to select the top N snippets returned from web search engine as bilingual corpora. In this paper, an improved bilingual snippets retrieval method is proposed. It combines term expansion, the surface pattern matching model and top N bilingual snippets selection to retrieve bilingual snippets. More relative bilingual snippets can be found by using term expansion, and surface pattern matching is useful to select bilingual snippets according to the constitution of unknown term translation. The top 100 bilingual snippets and those satisfy the surface pattern matching model serve as the last bilingual corpora for term translation mining. Experimental results show that the improved bilingual snippets retrieving method improves the top 100 inclusion rate by 2.3% than baseline system, which verified that the improved bilingual snippets retrieving method is effective.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Ballesteros, L., Croft, W.B.: Phrasal Translation and Query Expansion Techniques for Cross-Language Information Retrieval. In: Proc. of 20th International ACM SIGIR Conference on Research and Development in Information Retrieval, Philadelphia, USA, pp. 84–91 (1997)
Gamallo Otero, P.: Using Natural Alignment to Extract Translation Equivalents. In: Vieira, R., Quaresma, P., das Graças Volpe Nunes, M., Mamede, N.J., Oliveira, C., Dias, M.C. (eds.) PROPOR 2006. LNCS (LNAI), vol. 3960, pp. 41–49. Springer, Heidelberg (2006)
Nagata, M., Saito, T., Suzuki, K.: Using the web as a bilingual dictionary. In: Proc. ACL 2001 Workshop Data-Driven Methods in Machine Translation, pp. 95–102 (2001)
Cheng, P.J., Teng, J.W., Chen, R.C., Wang, J.H., Lu, W.H., Chien, L.F.: Translating un-known queries with web corpora for cross-language information retrieval. In: The Proceedings of 27th ACM SIGIR, pp. 146–153. ACM Press (2004)
Lu, C.Y., Xu,Y., Geva, S.: Web-Based Query Translation fro English-Chinese CLIR. Computational Linguistics and Chinese Language Processing 13(1), 61–90 (2008)
Huang, F., Zhang, Y., Vogel, S.: Mining Key Phrase Transla-tion from Web Corpora. In: Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing (HLT/EMNLP), pp. 483–490 (2005)
Fang, G.L., Yu, H., Nishino, F.: Chinese-English Term Translation Mining Based on Semantic Prediction. In: Proceedings of the COLING/ACL 2006 Main Conference Poster Session, pp. 199–206 (2006)
Sun, J., Yao, J.M., Zhang, J., Zhu, Q.M.: Web Mining of OOV translations. Journal of Information & Computational Science 5(1), 1–6 (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Liang, YH., Li, JX., Ye, L., Chen, K., Guo, CZ. (2012). The English Unknown Term Translation Mining with Improved Bilingual Snippets Collection Strategy. In: Huang, DS., Ma, J., Jo, KH., Gromiha, M.M. (eds) Intelligent Computing Theories and Applications. ICIC 2012. Lecture Notes in Computer Science(), vol 7390. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31576-3_52
Download citation
DOI: https://doi.org/10.1007/978-3-642-31576-3_52
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-31575-6
Online ISBN: 978-3-642-31576-3
eBook Packages: Computer ScienceComputer Science (R0)