Automatic Text Generation in Macedonian Using Recurrent Neural Networks

Milanova, Ivona; Sarvanoska, Ksenija; Srbinoski, Viktor; Gjoreski, Hristijan

doi:10.1007/978-3-030-33110-8_1

Ivona Milanova⁸,
Ksenija Sarvanoska⁸,
Viktor Srbinoski⁸ &
…
Hristijan Gjoreski⁸

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1110))

Included in the following conference series:

International Conference on ICT Innovations

740 Accesses
5 Citations

Abstract

Neural text generation is the process of a training neural network to generate a human understandable text (poem, story, article). Recurrent Neural Networks and Long-Short Term Memory are powerful sequence models that are suitable for this kind of task. In this paper, we have developed two types of language models, one generating news articles and the other generating poems in Macedonian language. We developed and tested several different model architectures, among which we also tried transfer-learning model, since text generation requires a lot of processing time. As evaluation metric we used ROUGE-N metric (Recall-Oriented Understudy for Gisting Evaluation), where the generated text was tested against a reference text written by an expert. The results showed that even though the generate text had flaws, it was human understandable, and it was consistent throughout the sentences. To the best of our knowledge this is a first attempt in automatic text generation (poems and articles) in Macedonian language using Deep Learning.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Bailey, P.: Searching for storiness: story-generation from a reader’s perspective. In: Working Notes of the Narrative Intelligence Symposium (1999)
Google Scholar
PÉrez, R.P.Ý., Sharples, M.: MEXICA: a computer model of a cognitive account of creative writing. J. Exp. Teor. Artif. Intell. 13, 119–139 (2001)
Article Google Scholar
Sutskever, I., Martens, J., Hinton, G.E.: Generating text with recurrent neural networks. In: Proceedings of the 28th International Conference on Machine Learning (ICML-2011), pp. 1017–1024 (2011)
Google Scholar
Jain, P., Agrawal, P., Mishra, A., Sukhwani, M., Laha, A., Sankaranarayanan, K.: Story generation from sequence of independent short descriptions. In: Proceedings of Workshop on Machine Learning for Creativity, Halifax, Canada, August 2017 (SIGKDD 2017) (2017)
Google Scholar
McIntyre, N., Lapata, M.: Learning to tell tales: a data-driven approach to story generation. In: Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, Association for Computational Linguistics, vol. 1, pp. 217–225 (2009)
Google Scholar
Li, B., Lee-Urban, S., Johnston, G., Riedl, M.: Story generation with crowdsourced plot graphs. In: AAAI (2013)
Google Scholar
Swanson, R., Gordon, A.: Say anything: a massively collaborative open domain story writing companion. Interact. Storytelling 2008, 32–40 (2008)
Article Google Scholar
Vinyals, O., Toshev, A., Bengio, S., Erhan, D.: Show and tell: a neural image caption generator. In: CVPR (2015)
Google Scholar
Xu, K., et al.: Show, attend and tell: neural image caption generation with visual attention. In: ICML (2015)
Google Scholar
Venugopalan, S., Rohrbach, M., Donahue, J., Mooney, R., Darrell, T., Saenko, K.: Sequence to sequence-video to text. In: ICCV (2015)
Google Scholar
Pan, Y., Mei, T., Yao, T., Li, H., Rui, Y.: Jointly modeling embedding and translation to bridge video and language. In: CVPR (2016)
Google Scholar
Rush, A.M., Chopra, S., Weston, J.: A neural attention model for abstractive sentence summarization. In: EMNLP (2015)
Google Scholar
Kim, G., Xing, E.P.: Reconstructing storyline graphs for image recommendation from web community photos. In: CVPR (2014)
Google Scholar
Sigurdsson, G.A., Chen, X., Gupta, A.: Learning visual storylines with skipping recurrent neural networks. In: ECCV (2016)
Google Scholar
Glorianna Jagfeld, S.J.: Sequence-to-sequence models for data-to-text natural language (2018)
Google Scholar
Sutskever, I.: Generating text with recurrent neural networks. In: 28th International Conference on Machine Learning (ICML-2011), pp. 1017–1024 (2011)
Google Scholar
Racin, K.: Beli mugri, pp. 3–33. Makedonska kniga, Skopje (1989)
Google Scholar
Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I., Salakhutdinov, R.: Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15, 1929–1958 (2014)
MathSciNet MATH Google Scholar
Hahnloser, R.H., Sarpeshkar, R., Mahowald, M.A., Douglas, R.J., Seung, H.S.: Digital selection and analogue amplification coexist in a cortex-inspired silicon circuit. Nature 405(6789), 947 (2000)
Article Google Scholar
Kingma, D.P., Ba, J.: Adam: A Method for Stochastic Optimization. arXiv:1412.6980 [cs.LG], December 2014
Montavon, G., Orr, Geneviève B., Müller, K.-R. (eds.): Neural Networks: Tricks of the Trade. LNCS, vol. 7700. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-35289-8
Book Google Scholar
Mikolov, T., et al.: Distributed representations of words and phrases and their compositionality. In: NIPS (2013)
Google Scholar
Pradhan, S.: Exploring the depths of recurrent neural networks with stochastic residual learning (2016)
Google Scholar
Lin, C.-Y.: ROUGE: a package for automatic evaluation of summaries. In: ACL Workshop: Text Summarization Braches Out 2004, p. 10 (2004)
Google Scholar
Huang, Q., Gan, Z., Celikyilmaz, A., Wu, D., Wang, J., He, X.: Hierarchically structured reinforcement learning for topically coherent visual story generation. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 33, pp. 8465–8472, July 2019
Google Scholar

Download references

Acknowledgment

We gratefully acknowledge the support of NVIDIA Corporation with the donation of the Titan Xp GPU used for this research.

Author information

Authors and Affiliations

Faculty of Electrical Engineering and Information Technologies, University of Ss. Cyril and Methodius in Skopje, Skopje, North Macedonia
Ivona Milanova, Ksenija Sarvanoska, Viktor Srbinoski & Hristijan Gjoreski

Authors

Ivona Milanova
View author publications
You can also search for this author in PubMed Google Scholar
Ksenija Sarvanoska
View author publications
You can also search for this author in PubMed Google Scholar
Viktor Srbinoski
View author publications
You can also search for this author in PubMed Google Scholar
Hristijan Gjoreski
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hristijan Gjoreski .

Editor information

Editors and Affiliations

Saints Cyril and Methodius University of Skopje, Skopje, North Macedonia
Sonja Gievska
Saints Cyril and Methodius University of Skopje, Skopje, North Macedonia
Gjorgji Madjarov

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Milanova, I., Sarvanoska, K., Srbinoski, V., Gjoreski, H. (2019). Automatic Text Generation in Macedonian Using Recurrent Neural Networks. In: Gievska, S., Madjarov, G. (eds) ICT Innovations 2019. Big Data Processing and Mining. ICT Innovations 2019. Communications in Computer and Information Science, vol 1110. Springer, Cham. https://doi.org/10.1007/978-3-030-33110-8_1

Download citation

DOI: https://doi.org/10.1007/978-3-030-33110-8_1
Published: 14 October 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-33109-2
Online ISBN: 978-3-030-33110-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics