Abstract
Generating a text based on a predefined guideline is an interesting but challenging problem. A series of studies have been carried out in recent years. In dialogue systems, researchers have explored driving a dialogue based on a plan, while in story generation, a storyline has also been proved to be useful. In this article, we address a new task—generating movie scripts based on a predefined narrative. As an early exploration, we study this problem in a “retrieval-based” setting. We propose a model (ScriptWriter-CPre) to select the best response (i.e., next script line) among the candidates that fit the context (i.e., previous script lines) as well as the given narrative. Our model can keep track of what in the narrative has been said and what is to be said. Besides, it can also predict which part of the narrative should be paid more attention to when selecting the next line of script. In our study, we find the narrative plays a different role than the context. Therefore, different mechanisms are designed for deal with them. Due to the unavailability of data for this new application, we construct a new large-scale data collection GraphMovie from a movie website where end-users can upload their narratives freely when watching a movie. This new dataset is made available publicly to facilitate other studies in text generation under the guideline. Experimental results on the dataset show that our proposed approach based on narratives significantly outperforms the baselines that simply use the narrative as a kind of context.
- [1] Claire Bonial, Tommaso Caselli, Snigdha Chaturvedi, Elizabeth Clark, Ruihong Huang, Mohit Iyyer, Alejandro Jaimes, Heng Ji, Lara J. Martin, Ben Miller, Teruko Mitamura, Nanyun Peng, and Joel R. Tetreault (Eds.). 2020. In Proceedings of the First Joint Workshop on Narrative Understanding, Storylines, and Events, NUSE@ACL 2020, Online, July 9, 2020. Association for Computational Linguistics.Google Scholar
- [2] . 2016. Layer normalization.
arXiv:1607.06450. Retrieved from http://arxiv.org/abs/1607.06450.Google Scholar - [3] . 1999. Artificial Intelligence and Literary Creativity: Inside the Mind of Brutus, A Storytelling Machine. Psychology Press.Google ScholarCross Ref
- [4] . 2002. Planning characters’ behaviour in interactive storytelling. Comput. Animat. Virt. Worlds 13, 2 (2002), 121–131. Google ScholarCross Ref
- [5] . 2019. Towards coherent and cohesive long-form text generation. In Proceedings of the 1st Workshop on Narrative Understanding. Association for Computational Linguistics, 1–11. Google ScholarCross Ref
- [6] . 1975. Artificial Paranoia: A Computer Simulation of Paranoid Process. Pergamon Press.Google Scholar
- [7] . 2020. Decoding methods for neural narrative generation.
arXiv:2010.07375. Retrieved from https://arxiv.org/abs/2010.07375.Google Scholar - [8] . 2018. Hierarchical neural story generation. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL’18), Volume 1: Long Papers. Association for Computational Linguistics, 889–898. Google ScholarCross Ref
- [9] . 2019. Strategies for structuring story generation. In Proceedings of the 57th Conference of the Association for Computational Linguistics (ACL’19), Volume 1: Long Papers. Association for Computational Linguistics, 2650–2660. Google ScholarCross Ref
- [10] . 2018. Topic-to-Essay generation with neural networks. In Proceedings of the 27th International Joint Conference on Artificial Intelligence (IJCAI’18). 4078–4084. Google ScholarCross Ref
- [11] . 2018. Neural approaches to conversational AI. In Proceedings of the 41st International ACM SIGIR Conference on Research & Development in Information Retrieval (SIGIR’18). ACM, 1371–1374. Google ScholarDigital Library
- [12] . 2005. Story plot generation based on CBR. Knowl. Bas. Syst. 18, 4–5 (2005), 235–242. Google ScholarDigital Library
- [13] . 2018. A knowledge-grounded neural conversation model. In Proceedings of the 32nd AAAI Conference on Artificial Intelligence, (AAAI’18), the 30th Innovative Applications of Artificial Intelligence (IAAI’18), and the 8th AAAI Symposium on Educational Advances in Artificial Intelligence (EAAI’18). AAAI Press, 5110–5117. https://www.aaai.org/ocs/index.php/AAAI/AAAI18/paper/view/16710.Google Scholar
- [14] . 2020. Speaker-Aware BERT for multi-turn response selection in retrieval-based chatbots. In Proceedings of the 29th ACM International Conference on Information and Knowledge Management (CIKM’20). ACM, 2041–2044. Google ScholarDigital Library
- [15] . 2019. Interactive matching network for multi-turn response selection in retrieval-based chatbots. In Proceedings of the 28th ACM International Conference on Information and Knowledge Management (CIKM’19). ACM, 2321–2324. Google ScholarDigital Library
- [16] . 2019. Story ending generation with incremental encoding and commonsense knowledge. In Proceedings of the 33rd AAAI Conference on Artificial Intelligence (AAAI’19), the 31st Innovative Applications of Artificial Intelligence Conference (IAAI’19), and the 9th AAAI Symposium on Educational Advances in Artificial Intelligence (EAAI’19). AAAI Press, 6473–6480. Google ScholarDigital Library
- [17] . 2016. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’16). IEEE Computer Society, 770–778. Google ScholarCross Ref
- [18] . 2014. Word-Based dialog state tracking with recurrent neural networks. In Proceedings of the 15th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL’14). The Association for Computer Linguistics, 292–299. Google ScholarCross Ref
- [19] . 2014. Convolutional neural network architectures for matching natural language sentences. In Advances in Neural Information Processing Systems 27: Annual Conference on Neural Information Processing Systems. 2042–2050.Google ScholarDigital Library
- [20] . 2020. Narrative text generation with a latent discrete plan. In Findings of the Association for Computational Linguistics (EMNLP’20). Association for Computational Linguistics, 3637–3650. Google ScholarCross Ref
- [21] . 2014. An information retrieval approach to short text conversation.
arXiv:1408.6988. Retrieved from http://arxiv.org/abs/1408.6988.Google Scholar - [22] . 2016. Globally coherent text generation with neural checklist models. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP’16). The Association for Computational Linguistics, 329–339. Google ScholarCross Ref
- [23] . 2020. Sequential latent knowledge selection for knowledge-grounded dialogue. In Proceedings of the 8th International Conference on Learning Representations (ICLR’20).Google Scholar
- [24] . 2015. Adam: A method for stochastic optimization. In Proceedings of the 3rd International Conference on Learning Representations (ICLR’15).Google Scholar
- [25] . 2000. The AT&t-DARPA communicator mixed-initiative spoken dialog system. In Proceedings of the 6th International Conference on Spoken Language Processing (ICSLP’00). ISCA, 122–125.Google Scholar
- [26] . 2016. A diversity-promoting objective function for neural conversation models. In Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL HLT’16). The Association for Computational Linguistics, 110–119. Google ScholarCross Ref
- [27] . 2016. A persona-based neural conversation model. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (ACL’16), Volume 1: Long Papers. The Association for Computer Linguistics. Google ScholarCross Ref
- [28] . 2019. Learning to select knowledge for response generation in dialog systems. In Proceedings of the 28th International Joint Conference on Artificial Intelligence (IJCAI’19). 5081–5087. Google ScholarCross Ref
- [29] . 2015. The Ubuntu dialogue corpus: A large dataset for research in unstructured multi-turn dialogue systems. In Proceedings of the 16th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL’15). The Association for Computer Linguistics, 285–294. Google ScholarCross Ref
- [30] . 2021. One chatbot per person: Creating personalized chatbots based on implicit user profiles. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’21). ACM, 555–564. Google ScholarDigital Library
- [31] . 1977. TALE-SPIN, an interactive program that writes stories. In Proceedings of the 5th International Joint Conference on Artificial Intelligence. William Kaufmann, 91–98.Google ScholarDigital Library
- [32] . 2013. Efficient estimation of word representations in vector space. In Proceedings of the 1st International Conference on Learning Representations (ICLR’13). http://arxiv.org/abs/1301.3781Google Scholar
- [33] . 2018. Towards controllable story generation. In Proceedings of the 1st Workshop on Storytelling. Association for Computational Linguistics, 43–49. Google ScholarCross Ref
- [34] . 2021. Learning implicit user profile for personalized retrieval-based chatbot. In Proceedings of the 30th ACM International Conference on Information and Knowledge Management (CIKM’21). ACM, New York, NY, 1467–1477. Google ScholarDigital Library
- [35] . 2018. Assigning personality/profile to a chatting machine for coherent conversation generation. In Proceedings of the 27th International Joint Conference on Artificial Intelligence (IJCAI’18). 4279–4285. Google ScholarCross Ref
- [36] . 1989. Cognitive and computer systems for understanding narrative text.Google Scholar
- [37] . 2010. Narrative planning: Balancing plot and character. J. Artif. Intell. Res. 39 (2010), 217–268. Google ScholarCross Ref
- [38] . 2011. Data-Driven response generation in social media. In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP’11). ACL, 583–593. https://aclanthology.org/D11-1054/Google ScholarDigital Library
- [39] . 2019. Identifying sensible lexical relations in generated stories. In Proceedings of the 1st Workshop on Narrative Understanding. Association for Computational Linguistics, 44–52. Google ScholarCross Ref
- [40] . 1998. GALAXY-II: A reference architecture for conversational system development. In Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating the 7th Australian International Speech Science and Technology Conference. ISCA.Google Scholar
- [41] . 2016. Building end-to-end dialogue systems using generative hierarchical neural network models. In Proceedings of the 30th AAAI Conference on Artificial Intelligence. AAAI Press, 3776–3784.Google ScholarDigital Library
- [42] . 2017. A hierarchical latent variable encoder-decoder model for generating dialogues. In Proceedings of the 31st AAAI Conference on Artificial Intelligence. AAAI Press, 3295–3301.Google ScholarCross Ref
- [43] . 2015. Neural responding machine for short-text conversation. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing (ACL’15), Volume 1: Long Papers. The Association for Computer Linguistics, 1577–1586. Google ScholarCross Ref
- [44] . 2019. Target-Guided open-domain conversation. In Proceedings of the 57th Conference of the Association for Computational Linguistics (ACL 2019), Volume 1: Long Papers. Association for Computational Linguistics, 5624–5634. Google ScholarCross Ref
- [45] . 2019. One time of interaction may not be enough: Go deep with an interaction-over-interaction network for response selection in dialogues. In Proceedings of the 57th Conference of the Association for Computational Linguistics (ACL’19), Volume 1: Long Papers. Association for Computational Linguistics, 1–11. Google ScholarCross Ref
- [46] Zhiliang Tian, Rui Yan, Lili Mou, Yiping Song, Yansong Feng, and Dongyan Zhao. 2017. How to make context more useful? An empirical study on context-aware neural conversational models. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (ACL’17), Volume 2: Short Papers. Association for Computational Linguistics, 231–236. Google ScholarCross Ref
- [47] . 2011. Spoken Language Understanding: Systems for Extracting Semantic Information from Speech. John Wiley & Sons.Google ScholarCross Ref
- [48] . 1990. Computing machinery and intelligence. In The Philosophy of Artificial Intelligence. Oxford University Press, 40–66.Google Scholar
- [49] . 1994. MINSTREL: A computer model of creativity and storytelling.Google Scholar
- [50] . 2017. Attention is all you need. In Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems. 5998–6008.Google Scholar
- [51] . 2015. A neural conversational model.
arXiv:1506.05869. Retrieved from http://arxiv.org/abs/1506.05869.Google Scholar - [52] . 1999. The TREC-8 question answering track report. In Proceedings of the 8th Text REtrieval Conference (TREC’99),
NIST Special Publication , Vol. 500-246. National Institute of Standards and Technology (NIST).Google ScholarCross Ref - [53] . 2016. A deep architecture for semantic matching with multiple positional sentence representations. In Proceedings of the 30th AAAI Conference on Artificial Intelligence. AAAI Press, 2835–2841.Google ScholarCross Ref
- [54] . 2011. Semantic frame-based spoken language understanding. Spoken Language Understanding: Systems for Extracting Semantic Information from Speech (2011), 41–91.Google ScholarCross Ref
- [55] . 1966. ELIZA—A computer program for the study of natural language communication between man and machine. Commun. ACM 9, 1 (1966), 36–45. Google ScholarDigital Library
- [56] Taesun Whang, Dongyub Lee, Dongsuk Oh, Chanhee Lee, Kijong Han, Dong-hun Lee, and Saebyeok Lee. 2021. Do response selection models really know what’s next? Utterance manipulation strategies for multi-turn response selection. In Thirty-Fifth AAAI Conference on Artificial Intelligence, AAAI 2021, Thirty-Third Conference on Innovative Applications of Artificial Intelligence, IAAI 2021, The Eleventh Symposium on Educational Advances in Artificial Intelligence, EAAI 2021, Virtual Event, February 2-9, 2021. AAAI Press, 14041–14049. https://ojs.aaai.org/index.php/AAAI/article/view/17653Google Scholar
- [57] . 2007. Partially observable Markov decision processes for spoken dialog systems. Comput. Speech Lang. 21, 2 (2007), 393–422. Google ScholarDigital Library
- [58] . 2019. Evaluating image-inspired poetry generation. In Proceedings of the 8th International Conference on Natural Language Processing and Chinese Computing (NLPCC’19) (
Lecture Notes in Computer Science , Vol. 11838). Springer, 539–551. Google ScholarDigital Library - [59] . 2019. Proactive human-machine conversation with explicit conversation goal. In Proceedings of the 57th Conference of the Association for Computational Linguistics (ACL’19), Volume 1: Long Papers. Association for Computational Linguistics, 3794–3804. Google ScholarCross Ref
- [60] . 2017. Sequential matching network: A new architecture for multi-turn response selection in retrieval-based chatbots. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (ACL’17), Volume 1: Long Papers. Association for Computational Linguistics, 496–505. Google ScholarCross Ref
- [61] . 2018. Neural response generation with dynamic vocabularies. In Proceedings of the 32nd AAAI Conference on Artificial Intelligence (AAAI-18), the 30th Innovative Applications of Artificial Intelligence (IAAI’18), and the 8th AAAI Symposium on Educational Advances in Artificial Intelligence (EAAI’18). AAAI Press, 5594–5601.Google Scholar
- [62] . 2017. Topic aware neural response generation. In Proceedings of the 31st AAAI Conference on Artificial Intelligence. AAAI Press, 3351–3357.Google ScholarCross Ref
- [63] . 2018. Hierarchical recurrent attention network for response generation. In Proceedings of the 32nd AAAI Conference on Artificial Intelligence (AAAI-18), the 30th Innovative Applications of Artificial Intelligence (IAAI’18), and the 8th AAAI Symposium on Educational Advances in Artificial Intelligence (EAAI’18). AAAI Press, 5610–5617.Google Scholar
- [64] . 2000. Task-based dialog management using an agenda. In ANLP-NAACL 2000 Workshop: Conversational Systems.Google Scholar
- [65] . 2001. MEXICA: A computer model of a cognitive account of creative writing. J. Exp. Theor. Artif. Intell. 13, 2 (2001), 119–139. Google ScholarCross Ref
- [66] . 2016. Learning to respond with deep neural networks for retrieval-based human-computer conversation system. In Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’16). ACM, 55–64. Google ScholarDigital Library
- [67] . 2019. Plan-and-Write: Towards better automatic storytelling. In Proceedings of the 32rd AAAI Conference on Artificial Intelligence (AAAI 2019), the 31st Innovative Applications of Artificial Intelligence Conference (IAAI’19), the 9th AAAI Symposium on Educational Advances in Artificial Intelligence (EAAI’19). AAAI Press, 7378–7385. Google ScholarDigital Library
- [68] . 2020. “where is this relationship going?”: Understanding relationship trajectories in narrative text. In Proceedings of the 9th Joint Conference on Lexical and Computational Semantics (*SEM@COLING’20). Association for Computational Linguistics, 168–178.Google Scholar
- [69] . 2019. Multi-hop selector network for multi-turn response selection in retrieval-based chatbots. In Proceedings of the Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP’19). Association for Computational Linguistics, 111–120. Google ScholarCross Ref
- [70] . 2018. Modeling multi-turn conversation with deep utterance aggregation. In Proceedings of the 27th International Conference on Computational Linguistics (COLING’18). Association for Computational Linguistics, 3740–3752.Google Scholar
- [71] . 2018. Commonsense knowledge aware conversation generation with graph attention. In Proceedings of the 27th International Joint Conference on Artificial Intelligence (IJCAI’18). 4623–4629. Google ScholarCross Ref
- [72] . 2018. Multi-Turn response selection for chatbots with deep attention matching network. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL’18), Volume 1: Long Papers. Association for Computational Linguistics, 1118–1127. Google ScholarCross Ref
- [73] . 2020. ReBoost: A retrieval-boosted sequence-to-sequence model for neural response generation. Inf. Retr. J. 23, 1 (2020), 27–48. Google ScholarDigital Library
- [74] . 2021. Content selection network for document-grounded retrieval-based chatbots. In Advances in Information Retrieval: Proceedings of the 43rd European Conference on IR Research (ECIR’21), Part I
Lecture Notes in Computer Science , Vol. 12656. Springer, 755–769. Google ScholarDigital Library - [75] . 2021. Proactive retrieval-based chatbots based on relevant knowledge and goals. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’21). ACM, 2000–2004. Google ScholarDigital Library
- [76] . 2020. ScriptWriter: Narrative-Guided script generation. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL’20). Association for Computational Linguistics, 8647–8657. Google ScholarCross Ref
Index Terms
- Leveraging Narrative to Generate Movie Script
Recommendations
Movie script markup language
DocEng '09: Proceedings of the 9th ACM symposium on Document engineeringThis paper introduces the Movie Script Markup Language (MSML), a document specification for the structural representation of screenplay narratives for television and feature film drama production. Its definition was motivated by a lack of available ...
Early narrative experience: positive segue to narrative gameplay
ACE '06: Proceedings of the 2006 ACM SIGCHI international conference on Advances in computer entertainment technologyThis paper theorizes that children segue into digital narrative game play easily and 'without pause' because of the perception of narrative they develop through their early print narrative experiences. These experiences are multimodal and socially ...
Comments