Skip to main content

Assisting Text Localization and Transcreation Tasks Using AI-Based Masked Language Modeling

  • Conference paper
  • First Online:
Artificial Intelligence in HCI (HCII 2021)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 12797))

Included in the following conference series:

  • 2628 Accesses

Abstract

Localization refers to the adaptation of a document’s content to meet the linguistic, cultural, and other requirements of a specific target market―a locale. Transcreation describes the process of adapting a message from one language to another, while maintaining its intent, style, tone, and context. In recent years, pre-trained language models have pushed the limits of natural language understanding and generation and dominated the NLP progress. We foresee that the AI-based pre-trained language models (e.g. masked language modeling) and other existing and upcoming language modeling techniques will be integrated as effective tools to support localization/transcreation efforts in the coming years. To support localization/transcreation tasks, we use AI-based Masked Language Modeling (MLM) to provide a powerful human-machine teaming tool to query language models for the most proper words/phrases to reflect the proper linguistical and cultural characteristics of the target language. For linguistic applications, we list examples on logical connectives, pronouns and antecedents, and unnecessary redundant nouns and verbs. For intercultural conceptualization applications, we list examples of cultural event schema, role schema, emotional schema, and propositional schema. There are two possible approaches to determine where to put masks: a human-based approach or an algorithm-based approach. For the algorithm-based approach, constituency parsing can be used to break a text into sub-phrases, or constituents, after which typical linguistic patterns can be detected and then finally masking tasks can be attempted on the related texts.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Wang, C., Li, M., Smola, A.: Language Models with Transformers, https://arxiv.org/pdf/1904.09408.pdf, Accessed Oct 2019

  2. Devlin, J., Chang, M-W., Lee, K., Toutanova, K.: BERT: pre-training of deep bidirectional transformers for language understanding. https://arxiv.org/abs/1810.04805, Accessed Oct 2018

  3. Masked language modeling demo, AllenNLP, Allen Institute for AI. https://demo.allennlp.org/

  4. Joan Pinkham with the collaboration of Guihua Jiang, The translator’s Guide to Chinglish, Foreign Language. Teaching and Researching Press, China (2000)

    Google Scholar 

  5. Sharifian, F.: Cultural linguistics. In: Chapelle, C.A. (ed.) The encyclopedia of applied linguistics (pp. 1590–1596). New Jersey, Boston, Oxford: Blackwell Publishing Ltd., 2013.

    Google Scholar 

  6. Xu, Z., Sharifian, F.: Unpacking cultural conceptualizations in Chinese English. J. Asian Pac. Commun. 27(1), 65–84 (2017)

    Article  Google Scholar 

  7. Donahue, C., Lee, M., Liang, P.: Models to Fill Enabling Language in the Blanks. arXiv preprint arXiv:2005.05339, Acessed 11 May 2020

  8. Official website of the State Council Information Office, China. White Paper ‘‘China's Energy Development in the New Era’’ (2020)

    Google Scholar 

  9. Constituency Parsing demo, AllenNLP, Allen Institute for AI. https://demo.allennlp.org/constituency-parsing

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ming Qian .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Qian, M., Liu, J. (2021). Assisting Text Localization and Transcreation Tasks Using AI-Based Masked Language Modeling. In: Degen, H., Ntoa, S. (eds) Artificial Intelligence in HCI. HCII 2021. Lecture Notes in Computer Science(), vol 12797. Springer, Cham. https://doi.org/10.1007/978-3-030-77772-2_28

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-77772-2_28

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-77771-5

  • Online ISBN: 978-3-030-77772-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics