Approach to Automatic Determining of Speakers of Direct Speech Fragments in Natural Language Texts

Sychev, Oleg; Kamennov, Yaroslav; Shurlaeva, Ekaterina

doi:10.1007/978-3-030-25719-4_68

Oleg Sychev¹⁵,
Yaroslav Kamennov¹⁵ &
Ekaterina Shurlaeva¹⁵

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 948))

Included in the following conference series:

Biologically Inspired Cognitive Architectures Meeting

813 Accesses
1 Citations

Abstract

Natural language text consists of an author’s or narrator’s text and direct speech fragments. They have different speakers so they could use different vocabularies and syntactic structures. In order to analyze the dependency of vocabulary and sentence structure on speaker, it is necessary to attribute each text fragment to its speaker. The results of such analysis can be used in natural language text generation tasks, allowing to convey different narrative voice depending on the purpose of the generated text. The authors developed a set of rules for attributing direct speech fragments to speaking characters, created a method of direct-speech scene analysis and implemented it in a software tool. In order to evaluate the accuracy of the attribution of direct speech fragments to speakers, an experiment was carried out. The results of the experiment show the viability of the developed method and allow to improve it for further use. The potential applications of the developed method and the software tool are discussed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 139.00; Price excludes VAT (USA)

Softcover Book: USD 179.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Anikin A, Sychev O (2018) Semantic treebanks and their uses for multi-level modelling of natural-language texts. Procedia Comput Sci 145:64–71 https://www.sciencedirect.com/science/article/pii/S1877050918322968?via%3Dihub
Article Google Scholar
Butler A (2015) Linguistic expressions and semantic processing. Springer, Cham
Book Google Scholar
De Marneffe M, Dozat T, Silveira N, Haverinen K, Ginter F, Nivre J, Manning CD (2014) Universal stanford dependencies: a cross-linguistic typology. In: Proceedings of the 9th International Conference on Language Resources and Evaluation, LREC 2014, pp 4585–4592. https://nlp.stanford.edu/pubs/USD_LREC14_paper_camera_ready.pdf
Demaidi MN, Gaber MM, Filer N (2017) Evaluating the quality of the ontology-based auto-generated questions. Smart Learn Environ 4(1):7. https://doi.org/10.1186/s40561-017-0046-6
Article Google Scholar
Faure R (2009) Verbs of speaking and verbs of thinking. https://hal.archives-ouvertes.fr
Finkel JR, Grenager T, Manning C (2005) Incorporating non-local information into information extraction systems by gibbs sampling. In: ACL-05 - 43rd annual meeting of the association for computational linguistics, proceedings of the conference, pp 363–370. https://dl.acm.org/citation.cfm?doid=1219840.1219885
Hall D, Durrett G, Klein D (2014) Less grammar, more features. In: 52nd annual meeting of the association for computational linguistics, ACL 2014 - proceedings of the conference. vol 1, pp 228–237. https://aclweb.org/anthology/P14-1022
Kolachina S, Kolachina P (2012) Parsing any domain english text to conll dependencies. In: Proceedings of the 8th international conference on language resources and evaluation, LREC 2012, pp 3873–3880. http://www.lrec-conf.org/proceedings/lrec2012/pdf/1097_Paper.pdf
Kulick S, Kroch A, Santorini B (2014) The penn parsed corpus of modern British English: first parsing results and analysis. In: 52nd annual meeting of the association for computational linguistics, ACL 2014 - proceedings of the conference, vol. 2, pp 662–667. https://aclweb.org/anthology/P14-2108
Lee H, Chang A, Peirsman Y, Chambers N, Surdeanu M, Jurafsky D (2013) Deterministic coreference resolution based on entity-centric, precision-ranked rules. Comput Linguist 39(4):885–916 https://www.mitpressjournals.org/doi/10.1162/COLIa00152
Article Google Scholar
Petrov S, Barrett L, Thibaux R, Klein D (2006) Learning accurate, compact, and interpretable tree annotation. In: COLING/ACL 2006 - 21st international conference on computational linguistics and 44th annual meeting of the association for computational linguistics, proceedings of the conference, vol 1, pp 433–440. https://dl.acm.org/citation.cfm?doid=1220175.1220230
Redwoods L, Oepen S, Flickinger D, Toutanova K, Manning C (11 2002) Lingo redwoods: a rich and dynamic treebank for hpsg. Reseach on language and computation
Google Scholar
Rishes E, Lukin SM, Elson DK, Walker MA (2013) Generating different story tellings from semantic representations of narrative. In: Lecture notes in computer science (including subseries Lecture notes in artificial intelligence and lecture notes in bioinformatics), vol. 8230 LNCS. Springer International Publishing Switzerland (2013). https://link.springer.com/chapter/10.1007%2F978-3-319-02756-2_24
Sampson G, Sampson C (1995) English for the computer: the SUSANNE corpus and analytic scheme. Clarendon Press, Oxford
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Volgograd State Technical University, Volgograd, Russian Federation
Oleg Sychev, Yaroslav Kamennov & Ekaterina Shurlaeva

Authors

Oleg Sychev
View author publications
You can also search for this author in PubMed Google Scholar
Yaroslav Kamennov
View author publications
You can also search for this author in PubMed Google Scholar
Ekaterina Shurlaeva
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Oleg Sychev .

Editor information

Editors and Affiliations

Moscow Engineering Physics Institute (MEPhI), Department of Cybernetics, National Research Nuclear University (NRNU), Moscow, Russia
Alexei V. Samsonovich

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sychev, O., Kamennov, Y., Shurlaeva, E. (2020). Approach to Automatic Determining of Speakers of Direct Speech Fragments in Natural Language Texts. In: Samsonovich, A. (eds) Biologically Inspired Cognitive Architectures 2019. BICA 2019. Advances in Intelligent Systems and Computing, vol 948. Springer, Cham. https://doi.org/10.1007/978-3-030-25719-4_68

Download citation

DOI: https://doi.org/10.1007/978-3-030-25719-4_68
Published: 17 July 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-25718-7
Online ISBN: 978-3-030-25719-4
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics