Abstract
Internet of Things (IoT) networks have been widely deployed to achieve communication among machines and humans. Machine translation can enable human-machine interactions for IoT equipment. In this paper, we propose to combine the neural machine translation (NMT) and statistical machine translation (SMT) to improve translation precision. In our design, we propose a hybrid deep learning (DL) network that uses the statistical feature extracted from the words as the data set. Namely, we use the SMT model to score the generated words in each decoding step of the NMT model, instead of directly processing their outputs. These scores will be converted to the generation probability corresponding to words by classifiers and used for generating the output of the hybrid MT system. For the NMT, the DL network consists of the input layer, embedding layer, recurrent layer, hidden layer, and output layer. At the offline training stage, the NMT network is jointly trained with SMT models. Then at the online deployment stage, we load the fine-trained models and parameters to generate the outputs. Experimental results on French-to-English translation tasks show that the proposed scheme can take advantage of both NMT and SMT methods, thus higher translation precision could be achieved.







Similar content being viewed by others
References
Arthur P, Neubig G, Nakamura S (2016) Incorporating discrete translation lexicons into neural machine translation. In: Proc. Conf. Empirical methods natural lang. process
Dahl G, Yu D, Deng L, Acero A (2012) Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition. Audio, Speech, Language Process, IEEE Trans 20(1):30–42
Heafield K (2011) Kenlm: Faster and smaller language model queries. In: Proceedings of the sixth workshop on statistical machine translation. WMT ’11, Association for Computational Linguistics, USA, pp 187–197
Jean S, Cho K, Memisevic R, Bengio Y (2015) On using very large target vocabulary for neural machine translation. In: Proc. 53rd annu, meeting assoc. comput. linguistics 7th int. joint conf. natural lang. process, vol 1. Association for Computational Linguistics, Beijing, China, pp 1–10
Krizhevsky A, Sutskever I, Hinton G (2012) Imagenet classification with deep convolutional neural networks. In: Proc. adv. neural inf. process. syst, pp 1097–1105
Och F, Nev H (2002) Discriminative training and maximum entropy models for statistical machine translation. In: Association for Computational Linguistics, Philadelphia, Pennsylvania, USA, pp 295–302
Palangi H, Deng L, Shen Y, Gao J, He X, Chen J, Song X, Ward R (Apr 2016) Deep sentence embedding using long short-term memory networks: Analysis and application to information retrieval. IEEE/ACM Trans Audio, Speech and Lang Proc 24(4):694–707
Tu Z., Lu Z., Liu Y., Liu X., Li H. (2016) Modeling coverage for neural machine translation. In: Proc. 40th annu. meeting assoc. comput. linguistics. pp 76–85
Wang X., Tu Z., Zhang M. (2018) Incorporating statistical machine translation word knowledge into neural machine translation. IEEE/ACM Trans Audio Speech, Language Process 26(12):2255–2266
Wang X, Lu Z, Tu Z, Li H, Xiong D, Zhang Mx (2016) Neural machine translation advised by statistical machine translation. In: Proc. AAAI conf. artif intell
Zhu X, Yang M, Zhao T, Zhu C (2018) Minimum bayes-risk phrase table pruning for pivot-based machine translation in internet of things. IEEE Access 6:55754–55764
Acknowledgements
This work was supported in part by the State Key Program of National Social Science of China (No. 18AZD035), the Key Research & Development and Transformation Plan of Science and Technology Program for Tibet Autonomous Region (No. XZ201901-GB-16), the Special Fund from the Central Finance to Support the Development of Local Universities (No.ZFYJY201902001) and the National Natural Science Foundation of China (No.71964030).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Zhang, Y., Zhang, L., Lan, P. et al. Statistical Feature Aided Intelligent Deep Learning Machine Translation in Internet of Things. Mobile Netw Appl 28, 325–333 (2023). https://doi.org/10.1007/s11036-022-01936-4
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11036-022-01936-4