Obtaining Better Word Representations via Language Transfer

Li, Changliang; Xu, Bo; Wu, Gaowei; Wang, Xiuying; Ge, Wendong; Li, Yan

doi:10.1007/978-3-642-54906-9_11

Changliang Li¹⁷,
Bo Xu¹⁷,
Gaowei Wu¹⁷,
Xiuying Wang¹⁷,
Wendong Ge¹⁷ &
…
Yan Li¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 8403))

Included in the following conference series:

International Conference on Intelligent Text Processing and Computational Linguistics

2189 Accesses

Abstract

Vector space word representations have gained big success recently at improving performance across various NLP tasks. However, existing word embeddings learning methods only utilize homo-lingual corpus. Inspired by transfer learning, we propose a novel language transfer method to obtain word embeddings via language transfer. Under this method, in order to obtain word embeddings of one language (target language), we train models on corpus of another different language (source language) instead. And then we use the obtained source language word embeddings to represent target language word embeddings. We evaluate the word embeddings obtained by the proposed method on word similarity tasks across several benchmark datasets. And the results show that our method is surprisingly effective, outperforming competitive baselines by a large margin. Another benefit of our method is that the process of collecting new corpus might be skipped.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Challenges and Solutions with Alignment and Enrichment of Word Embedding Models

SeNSe: embedding alignment via semantic anchors selection

Article 20 March 2024

Measuring Word Semantic Similarity Based on Transferred Vectors

References

Huang, E.H., Socher, R., Manning, C.D., Ng, A.Y.: Improving word representations via global context and multiple word prototypes. In: Annual Meeting of the Association for Computational Linguistics, ACL (2012)
Google Scholar
Turian, J., Ratinov, L., Bengio, Y.: Word representations: A simple and general method for semisupervised learning. In: ACL (2010)
Google Scholar
Mikolov, T., Karafiát, M., Burget, L., Cernocký, J., Khudanpur, S.: Recurrent neural network based language model. In: INTERSPEECH (2010)
Google Scholar
Mnih, A., Hinton, G.E.: A scalable hierarchical distributed language model. In: NIPS, pp. 1081–1088 (2009)
Google Scholar
Luong, M., Socher, R., Manning, C.: Better word representations with recursive neural networks for morphology. In: CONLL (2013)
Google Scholar
Bengio, Y., Louradour, J., Collobert, R., Weston, J.: Curriculum learning. In: ICML (2009)
Google Scholar
Torrey, L., Shavlik, J.: Transfer learning. In: Soria, E., Martin, J., Magdalena, R., Martinez, M., Serrano, A. (eds.) Handbook of Research on Machine Learning Applications. IGI Global (2009)
Google Scholar
Asadi, M., Huber, M.: Effective control knowledge transfer through learning skill and representation hierarchies. In: International Joint Conference on Artificial Intelligence (2007)
Google Scholar
Huang, F., Yates, A.: Distributional representations for handling sparsity in supervised sequence labeling. In: ACL (2009)
Google Scholar
Ratinov, L., Roth, D.: Design challenges and misconceptions in named entity recognition. In: CoNLL (2009)
Google Scholar
Koo, T., Carreras, X., Collins, M.: Simple semi-supervised dependency parsing. In: ACL, pp. 595–603 (2008)
Google Scholar
Miller, S., Guinness, J., Zamanian, A.: Name tagging with word clusters and discriminative training. In: HLT-NAACL, pp. 337–342 (2004)
Google Scholar
Liang, P.: Semi-supervised learning for natural language. Master’s thesis, Massachusetts Institute of Technology (2005)
Google Scholar
Bengio, Y.: Neural net language models. Scholarpedia 3, 3881 (2008)
Article Google Scholar
Bengio, Y., Ducharme, R., Vincent, P.: A neural probabilistic language model. In: NIPS (2001)
Google Scholar
Bengio, Y., Ducharme, R., Vincent, P., Jauvin, C.: A neural probabilistic language model. Journal of Machine Learning Research 3, 1137–1155 (2003)
MATH Google Scholar
Collobert, R., Weston, J.: A unified architecture for natural language processing: Deep neural networks with multitask learning. In: ICML (2008)
Google Scholar
Morin, F., Bengio, Y.: Hierarchical probabilistic neural network language model. AISTATS (2005)
Google Scholar
Mikolov, T., Yih, W.-T., Zweig, G.: Linguistic regularities in continuous space word representations. In: NAACL-HLT (2013)
Google Scholar
Miller, S., Guinness, J., Zamanian, A.: Name tagging with word clusters and discriminative training. In: HLT-NAACL, pp. 337–342 (2004)
Google Scholar
Socher, R., Pennington, J., Huang, E.H., Ng, A.Y., Manning, C.D.: Semi-supervised recursive autoencoders for predicting sentiment distributions. In: EMNLP (2011)
Google Scholar
Socher, R., Manning, C., Ng, A.: Learning continuous phrase representations and syntactic parsing with recursive neural networks. In: NIPS*2010 Workshop on Deep Learning and Unsupervised Feature Learning (2010)
Google Scholar
Socher, R., Lin, C.C., Ng, A., et al.: Parsing natural scenes and natural language with recursive neural networks. In: Proceedings of the 28th International Conference on Machine Learning (ICML 2011), pp. 129–136 (2011)
Google Scholar
Socher, R., Bauer, J., Manning, C.D., et al.: Parsing with compositional vector grammars. In: Proceedings of the ACL Conference (2013)
Google Scholar
Mikolov, T., Kombrink, S., Burget, L., Cernocký, J., Khudanpur, S.: Extensions of recurrent neural network language model. In: ICASSP (2011)
Google Scholar
Mikolov, T., Zweig, G.: Context dependent recurrent neural network language model. In: SLT (2012)
Google Scholar
Finkelstein, L., Gabrilovich, E., Matias, Y., Rivlin, H., Solan, Z., Wolfman, G., Uppin, E.: Placing search in context: The concept revisited. ACM Transactions on Information Systems 20(1), 116–131 (2002)
Article Google Scholar
Rubenstein, H., Goodenough, J.B.: Contextual correlates of synonymy. Commun. ACM 8(10), 627–633 (1965)
Article Google Scholar
Miller, G., Charles, W.: Contextual correlates of semantic similarity. Language and Cognitive Processes 6(1), 1–28 (1991)
Article Google Scholar
Mikolov, T., Le, Q.V., Sutskever, I.: Exploiting Similarities among Languages for Machine Translation. arXiv preprint arXiv:1309.4168 (2013)
Google Scholar
http://baike.baidu.com/

Download references

Author information

Authors and Affiliations

Institute of Automation of the Chinese Academy of Sciences (CASIA), Beijing, China
Changliang Li, Bo Xu, Gaowei Wu, Xiuying Wang, Wendong Ge & Yan Li

Authors

Changliang Li
View author publications
You can also search for this author in PubMed Google Scholar
Bo Xu
View author publications
You can also search for this author in PubMed Google Scholar
Gaowei Wu
View author publications
You can also search for this author in PubMed Google Scholar
Xiuying Wang
View author publications
You can also search for this author in PubMed Google Scholar
Wendong Ge
View author publications
You can also search for this author in PubMed Google Scholar
Yan Li
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Center for Computing Research, National Polytechnic Institute, Av. Juan Dios Bátiz, Col. Nueva Industrial Vallejo, 07738, Mexico D.F., Mexico
Alexander Gelbukh

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, C., Xu, B., Wu, G., Wang, X., Ge, W., Li, Y. (2014). Obtaining Better Word Representations via Language Transfer. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2014. Lecture Notes in Computer Science, vol 8403. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-54906-9_11

Download citation

DOI: https://doi.org/10.1007/978-3-642-54906-9_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-54905-2
Online ISBN: 978-3-642-54906-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Obtaining Better Word Representations via Language Transfer

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Challenges and Solutions with Alignment and Enrichment of Word Embedding Models

SeNSe: embedding alignment via semantic anchors selection

Measuring Word Semantic Similarity Based on Transferred Vectors

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Obtaining Better Word Representations via Language Transfer

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Challenges and Solutions with Alignment and Enrichment of Word Embedding Models

SeNSe: embedding alignment via semantic anchors selection

Measuring Word Semantic Similarity Based on Transferred Vectors

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation