Abstract
Machine translation (MT) has already been widely applied in various fields, such as businesses, health, communication, and international relations. However, its performance is not satisfactory in specific domains. Researchers have devised different solutions, such as controlled languages (CLs). Nonetheless, as the previous CLs have placed too much emphasis on words or sentences, the author believes it is important to consider contextual factors. Therefore, the author created a controlled document authoring system that integrates document formalization, CLs, MT, and terminology management. Evaluation work was then conducted to assess whether the system was effective and satisfactory. This research provides fascinating insights into CLs, MT, technical writing, terminology management, translation technology, and natural language processing applications.
References
Bernth, A. (2006). EasyEnglishAnalyzer: Taking controlled language from sentence to discourse level, Proceedings of the 5th Conference on Applied Natural Language Processing (ANLP), pp. 159–165.
Crabbe, S. (2017). Controlling Language in Industry. Springer International Publishing. https://doi.org/10.1007/978-3-319-52745-1
Kando, N. (1997). Text-level structure of research papers: Implications for text-based information processing systems. Proceedings of the 19th Annual BCS-IRSG Colloquium on IR Research, pp. 1–14.
Marzouk, S. (2021). An in-depth analysis of the individual impact of controlled language rules on machine translation output: A mixed-methods approach. Machine Translation, 35(2), 167–203. https://doi.org/10.1007/s10590-021-09266-0
OASIS. (2010). Darwin Information Typing Architecture (DITA) Version 1.2. http://docs.oasis-open.org/dita/v1.2/os/spec/DITA1.2-spec.html
O'Brien, S. (2003). Controlling Controlled English: An Analysis of Several Controlled Language Rule Sets. Proceedings of the Joint Conference Combining the 8th International Workshop of the European Association for Machine Translation and the 4th Controlled Language Applications Workshop (EAMT/CLAW), pp. 105–114.
O'Brien, S. (2010). Controlled language and readability. In G. M. Shreve & E. Angelone (Eds.), American Translators Association Scholarly Monograph Series (pp. 143–165). John Benjamins Publishing Company. https://doi.org/10.1075/ata.xv.08obr
Swales, J. (1990). Genre analysis: English in academic and research settings. Cambridge University Press.
Warburton, K. (2014). Developing lexical resources for controlled authoring purposes. Proceedings of LREC 2014 Workshop: Controlled Natural Language Simplifying Language Use, pp. 90–103.
Funding
The authors did not receive support from any organization for the submitted work. No funding was received to assist with the preparation of this manuscript. No funding was received for conducting this study. No funds, grants, or other support were received.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Huang, X., Xu, H. Rei Miyata: controlled document authoring in a machine translation age. Lang Resources & Evaluation 57, 1423–1430 (2023). https://doi.org/10.1007/s10579-022-09598-0
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10579-022-09598-0