Generating Chinese Poems from Images Based on Neural Network

Xing, Shuo; Liu, Xueliang; Hong, Richang; Zhao, Ye

doi:10.1007/978-3-319-77380-3_52

Shuo Xing¹⁹,
Xueliang Liu¹⁹,
Richang Hong¹⁹ &
…
Ye Zhao¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10735))

Included in the following conference series:

Pacific Rim Conference on Multimedia

2821 Accesses
1 Citations

Abstract

Chinese classical poetry generation from images is an overwhelmingly challenging work in the field of artificial intelligence. Inspired by recent advances in automatically generating description of an image and Chinese poem generation, in this paper, we present a generative model based on deep recurrent framework that describes images in the form of poems. Our model consists of two parts, one is to extract information according to the semantics presented in images, and the other is to generate each line of the poem incrementally according to the extracted semantic information from the images by a recurrent neural network. Experimental results thoroughly demonstrate the effectiveness of our approach by manual evaluation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 119.00; Price excludes VAT (USA)

Softcover Book: USD 155.00; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Russakovsky, O., Deng, J., Su, H., Krause, J., Satheesh, S., Ma, S., Huang, Z., Karpathy, A., Bernstein, M., Khosla, A., Berg, A.C., Fei-Fei, L.: ImageNet large scale visual recognition challenge. IJCV 115, 211–252 (2015)
Article MathSciNet Google Scholar
Farhadi, A., Endres, I., Hoiem, D., Forsyth, D.: Describing objects by their attributes. In: CVPR, pp. 1602–1605 (2009)
Google Scholar
Kulkarni, G., Premraj, V., Dhar, S., Li, S., Berg, A., Choi, Y., Berg, T.: Baby talk: understanding and generating simple image descriptions. In: CVPR (2011)
Google Scholar
Farhadi, A., Hejrati, M., Sadeghi, M.A., Young, P., Rashtchian, C., Hockenmaier, J., Forsyth, D.: Every picture tells a story: generating sentences for images. In: ECCV (2010)
Chapter Google Scholar
Yao, B.Z., Yang, X., Lin, L., Lee, M.W., Zhu, S.C.: I2T: image parsing to text description. Proc. IEEE 98(8), 1485–1508 (2010)
Article Google Scholar
Elliott, D., Keller, F.: Image description using visual dependency representations. In: EMNLP, pp. 1292–1302 (2013)
Google Scholar
Li, S., Kulkarni, G., Berg, T.L., Berg, A.C., Choi, Y.: Composing simple image descriptions using web-scale n-grams. In: CoNLL (2011)
Google Scholar
Hodosh, M., Young, P., Hockenmaier, J.: Framing image description as a ranking task: data, models and evaluation metrics. J. Artif. Intell. Res. 47(1), 853–899 (2013)
MathSciNet MATH Google Scholar
Vinyals, O., Toshev, A., Bengio, S., Erhan, D.: Show and tell: a neural image caption generator. arXiv preprint arXiv:1411.4555 (2014)
Karpathy, A., Fei-Fei, L.: Deep visual-semantic alignments for generating image descriptions. In: CVPR (2015)
Google Scholar
Xu, K., Ba, J., Kiros, R., Cho, K., Courville, A.C., Salakhutdinov, R., Zemel, R.S., Bengio, Y.: Show, attend and tell: neural image caption generation with visual attention. arXiv preprint arXiv:1502.03044 (2015)
Zhang, X., Lapata, M.: Chinese poetry generation with recurrent neural networks. In: EMNLP, pp. 670–680 (2014)
Google Scholar
Wang, Q., Luo, T., Wang, D., Xing, C.: Chinese song iambics generation with neural attention-based model. CoRR, abs/1604.06274 (2016)
Google Scholar
Yi, X., Li, R., Sun, M.: Generating Chinese classical poems with RNN encoder-decoder. CoRR, abs/1604.01537 (2016)
Google Scholar
Colton, S., Goodwin, J., Veale, T.: Full FACE poetry generation. In: ICCC, pp. 95–102 (2012)
Google Scholar
Oliveira, H.: Automatic generation of poetry: an overview. Universidade de Coimbra (2009)
Google Scholar
Oliveira, H.: Poetryme: a versatile platform for poetry generation. Comput. Creat. Concept Inven. Gen. Intell. 1, 21 (2012)
Google Scholar
Jiang, L., Zhou, M.: Generating Chinese couplets using a statistical MT approach. In: Proceedings of the 22nd International Conference on Computational Linguistics, pp. 377–384 (2008)
Google Scholar
He, J., Zhou, M., Jiang, L.: Generating Chinese classical poems with statistical machine translation models. In: AAAI, pp. 1650–1656 (2012)
Google Scholar
Zhou, C.L., You, W., Ding, X.: Genetic algorithm and its implementation of automatic generation of Chinese songci. J. Softw. 21(3), 427–437 (2010)
Article Google Scholar
Wang, L.: A Summary of Rhyming Constraints of Chinese Poems (in Chinese). Beijing Press, Beijing (2002)
Google Scholar
Krizhevsky, A., Sutskever, I., Hinton, G.: Imagenet classification with deep convolutional neural networks. In: NIPS (2012)
Google Scholar
Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., Fei-Fei, L.: Imagenet: a large-scale hierarchical image database. In: CVPR (2009)
Google Scholar
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9, 1735–1780 (1997)
Article Google Scholar
Liu, C.W., Lowe, R., Serban, I.V., Noseworthy, M., Charlin, L., Pineau, J.: How not to evaluate your dialogue system: an empirical study of unsupervised evaluation metrics for dialogue response generation. arXiv preprint arXiv:1603.08023 (2016)
Wu, Q., Shen, C., Liu, L., Dick, A., van den Hengel, A.: What value do explicit high level concepts have in vision to language problems? In: CVPR (2016)
Google Scholar
Hong, R., Yang, Y., Wang, M., Hua, X.-S.: Learning visual semantic relationships for efficient visual retrieval. IEEE Trans. Big Data 1(4), 152–161 (2015)
Article Google Scholar
Zhang, H., Shang, X., Luan, H.-B., Wang, M., Chua, T.-S.: Learning from collective intelligence: feature learning using social images and tags. TOMCCAP 13(1), 1:1–1:23 (2016)
Article Google Scholar
Zhang, H., Kyaw, Z., Chang, S.-F., Chua, T.-S.: Visual translation embedding network for visual relation detection. In: CVPR (2017)
Google Scholar

Download references

Acknowledgments

This work was supported by the National Natural Science Foundation of China (NSFC) under grants 61472116 and 61502139.

Author information

Authors and Affiliations

Hefei University of Technology, Hefei, China
Shuo Xing, Xueliang Liu, Richang Hong & Ye Zhao

Authors

Shuo Xing
View author publications
You can also search for this author in PubMed Google Scholar
Xueliang Liu
View author publications
You can also search for this author in PubMed Google Scholar
Richang Hong
View author publications
You can also search for this author in PubMed Google Scholar
Ye Zhao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xueliang Liu .

Editor information

Editors and Affiliations

University of Electronic Science and Technology of China, Chengdu, China
Bing Zeng
University of Chinese Academy of Sciences, Beijing, China
Qingming Huang
University of Ottawa, Ottawa, Ontario, Canada
Abdulmotaleb El Saddik
University of Electronic Science and Technology of China, Chengdu, China
Hongliang Li
Chinese Academy of Sciences, Beijing, China
Shuqiang Jiang
Harbin Institute of Technology, Harbin, China
Xiaopeng Fan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Xing, S., Liu, X., Hong, R., Zhao, Y. (2018). Generating Chinese Poems from Images Based on Neural Network. In: Zeng, B., Huang, Q., El Saddik, A., Li, H., Jiang, S., Fan, X. (eds) Advances in Multimedia Information Processing – PCM 2017. PCM 2017. Lecture Notes in Computer Science(), vol 10735. Springer, Cham. https://doi.org/10.1007/978-3-319-77380-3_52

Download citation

DOI: https://doi.org/10.1007/978-3-319-77380-3_52
Published: 10 May 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-77379-7
Online ISBN: 978-3-319-77380-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics