Abstract
Natural language text consists of an author’s or narrator’s text and direct speech fragments. They have different speakers so they could use different vocabularies and syntactic structures. In order to analyze the dependency of vocabulary and sentence structure on speaker, it is necessary to attribute each text fragment to its speaker. The results of such analysis can be used in natural language text generation tasks, allowing to convey different narrative voice depending on the purpose of the generated text. The authors developed a set of rules for attributing direct speech fragments to speaking characters, created a method of direct-speech scene analysis and implemented it in a software tool. In order to evaluate the accuracy of the attribution of direct speech fragments to speakers, an experiment was carried out. The results of the experiment show the viability of the developed method and allow to improve it for further use. The potential applications of the developed method and the software tool are discussed.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Anikin A, Sychev O (2018) Semantic treebanks and their uses for multi-level modelling of natural-language texts. Procedia Comput Sci 145:64–71 https://www.sciencedirect.com/science/article/pii/S1877050918322968?via%3Dihub
Butler A (2015) Linguistic expressions and semantic processing. Springer, Cham
De Marneffe M, Dozat T, Silveira N, Haverinen K, Ginter F, Nivre J, Manning CD (2014) Universal stanford dependencies: a cross-linguistic typology. In: Proceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014, pp 4585–4592. https://nlp.stanford.edu/pubs/USD_LREC14_paper_camera_ready.pdf
Demaidi MN, Gaber MM, Filer N (2017) Evaluating the quality of the ontology-based auto-generated questions. Smart Learn Environ 4(1):7. https://doi.org/10.1186/s40561-017-0046-6
Faure R (2009) Verbs of speaking and verbs of thinking. https://hal.archives-ouvertes.fr
Finkel JR, Grenager T, Manning C (2005) Incorporating non-local information into information extraction systems by gibbs sampling. In: ACL-05 - 43rd annual meeting of the association for computational linguistics, proceedings of the conference, pp 363–370. https://dl.acm.org/citation.cfm?doid=1219840.1219885
Hall D, Durrett G, Klein D (2014) Less grammar, more features. In: 52nd annual meeting of the association for computational linguistics, ACL 2014 - proceedings of the conference. vol 1, pp 228–237. https://aclweb.org/anthology/P14-1022
Kolachina S, Kolachina P (2012) Parsing any domain english text to conll dependencies. In: Proceedings of the 8th international conference on language resources and evaluation, LREC 2012, pp 3873–3880. http://www.lrec-conf.org/proceedings/lrec2012/pdf/1097_Paper.pdf
Kulick S, Kroch A, Santorini B (2014) The penn parsed corpus of modern British English: first parsing results and analysis. In: 52nd annual meeting of the association for computational linguistics, ACL 2014 - proceedings of the conference, vol. 2, pp 662–667. https://aclweb.org/anthology/P14-2108
Lee H, Chang A, Peirsman Y, Chambers N, Surdeanu M, Jurafsky D (2013) Deterministic coreference resolution based on entity-centric, precision-ranked rules. Comput Linguist 39(4):885–916 https://www.mitpressjournals.org/doi/10.1162/COLIa00152
Petrov S, Barrett L, Thibaux R, Klein D (2006) Learning accurate, compact, and interpretable tree annotation. In: COLING/ACL 2006 - 21st international conference on computational linguistics and 44th annual meeting of the association for computational linguistics, proceedings of the conference, vol 1, pp 433–440. https://dl.acm.org/citation.cfm?doid=1220175.1220230
Redwoods L, Oepen S, Flickinger D, Toutanova K, Manning C (11 2002) Lingo redwoods: a rich and dynamic treebank for hpsg. Reseach on language and computation
Rishes E, Lukin SM, Elson DK, Walker MA (2013) Generating different story tellings from semantic representations of narrative. In: Lecture notes in computer science (including subseries Lecture notes in artificial intelligence and lecture notes in bioinformatics), vol. 8230 LNCS. Springer International Publishing Switzerland (2013). https://link.springer.com/chapter/10.1007%2F978-3-319-02756-2_24
Sampson G, Sampson C (1995) English for the computer: the SUSANNE corpus and analytic scheme. Clarendon Press, Oxford
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Sychev, O., Kamennov, Y., Shurlaeva, E. (2020). Approach to Automatic Determining of Speakers of Direct Speech Fragments in Natural Language Texts. In: Samsonovich, A. (eds) Biologically Inspired Cognitive Architectures 2019. BICA 2019. Advances in Intelligent Systems and Computing, vol 948. Springer, Cham. https://doi.org/10.1007/978-3-030-25719-4_68
Download citation
DOI: https://doi.org/10.1007/978-3-030-25719-4_68
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-25718-7
Online ISBN: 978-3-030-25719-4
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)