Skip to main content

Using Rhetorical Structure Theory and Entity Grids to Automatically Evaluate Local Coherence in Texts

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8775))

Abstract

This paper presents a joint model designed to measure local text coherence that uses Rhetorical Structure Theory (RST) and entity grids. The purpose is to learn patterns of entity distribution in texts by considering entity transition sequences and organizational/discourse information using RST relations in order to create a predictive model that is able to distinguish coherent from incoherent texts. In an evaluation with newspaper texts, the proposed model outperformed other methods in the area.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Althaus, E., Karamanis, N., Koller, A.: Computing locally coherent discourse. In: Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics, article 399, Stroudsburg, PA, USA (2004)

    Google Scholar 

  2. Barzilay, R., Lapata, M.: Modeling local coherence: An entity-based approach. Computational Linguistics 34, 1–34 (2008)

    Article  Google Scholar 

  3. Bick, E.: The Parsing System Palavras, Automatic Grammatical Analysis of Portuguese in a Constraint Grammar Framework. Aarhus University Press (2000)

    Google Scholar 

  4. Bosma, W.: Query-Based Summarization using Rhetorical Structure Theory. In: Proceedings of the 15th Meetings of CLIN, LOT, Utrecht, pp. 29–44 (2004)

    Google Scholar 

  5. Burstein, J., Tetreault, J., Andreyev, S.: Using entity-based features to model coherence in student essays. In: Human Language Technologies: In Proceedings of the 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, pp. 681–684 (2010)

    Google Scholar 

  6. Cardoso, P., Maziero, E., Jorge, M., Seno, E., di Felippo, A., Rino, L., Nunes, M., Pardo, T.: Cstnews - a discourse-annotated corpus for single and multi-document summarizationof news texts in brazilian portuguese. In: Proceedings of the 3rd RST Brazilian Meeting, pp. 88–105 (2011)

    Google Scholar 

  7. Cunha, I., Torres-Moreno, J.-M., Sierra, G.: On the Development of the RST Spanish Treebank. In: Proceedings of the 5th Linguistic Annotation Workshop, Portland-Oregon, pp. 1–10 (2011)

    Google Scholar 

  8. Dijk, T.V., Kintsch, W.: Strategics in discourse comprehension. Academic Press, New York (1983)

    Google Scholar 

  9. Filippova, K., Strube, M.: Extending the entity-grid coherence model to semantically related entities. In: Proceedings of the Eleventh European Workshop on Natural Language Generations, pp. 139–142 (2007)

    Google Scholar 

  10. Foltz, P.W., Kintsch, W., Landauer, T.K.: The Measurement of textual coherence using latent semantic analysis. Discourse Processes 25(2-3), 285–307 (1998)

    Article  Google Scholar 

  11. Freitas, A.P., Feltrim, V.D.: Análise Automática de Coerência Usando o Modelo Grade de Entidades para o Português. In: Proceedings of the IX Brazilian Symposium in Information and Human Language Technology, Fortaleza, CE, Brazil, pp. 69–78 (2013)

    Google Scholar 

  12. Grosz, B., Aravind, K.J., Scott, W.: Centering: A framework for modeling the local coherence of discourse. Computational Linguistics 21, 203–225 (1995)

    Google Scholar 

  13. Iida, R., Tokunaga, T.: A metric for evaluating discourse coherence based on coreference resolution. In: Proceedings of the COLING 2012: Posters, Mumbai, India, pp. 483–494 (2012)

    Google Scholar 

  14. Joachims, T.: Optimizing search engines using clickthrough data. In: Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, New York, NY, USA, pp. 133–142 (2002)

    Google Scholar 

  15. Karamanis, N., Poesio, M., Mellish, C., Oberlander, J.: Evaluating centering-based metrics of coherence for text structuring using a reliably annotated corpus. In: Proceedings of the 42nd Annual Meetings of the Association for Computational Linguistics, article 391 (2004)

    Google Scholar 

  16. Kibble, R., Power, R.: Optimising referential coherence in text generation. Computational Linguistic 30(4), 401–416 (2004)

    Article  Google Scholar 

  17. Koch, I.V., Travaglia, L.C.: A Coerência Textual, 14th edn. Contexto, São Paulo (2002)

    Google Scholar 

  18. Lapata, M.: Probabilistic texts structuring: Experiments with sentence ordering. In: Proceeding of the 2nd Human Language Technology Conference and Annual Meeting of the North American Chapter of the Association for Computational Linguistics, pp. 545–552 (2003)

    Google Scholar 

  19. Landauer, T.K., Dumais, S.T.: A solution to Plato’s problem: The latent semantic analysis theory of acquisition, induction and representation to coreference resolution. In: Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics, Philadelphia, PA, pp. 104–111 (1997)

    Google Scholar 

  20. Lin, Z., Ng, H.T., Kan, M.Y.: Automatically evaluating text coherence using discourse relations. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Stroudsburg, PA, USA, vol. 1, pp. 997–1006 (2011)

    Google Scholar 

  21. Mann, W.C., Thompson, S.A.: Rhetorical Structure Theory: Toward a functional theory of text organization. Text 8(3), 243–281 (1988)

    Google Scholar 

  22. Mann, W.C., Thompson, S.A.: Rhetorical Structure Theory: A Theory of Text Organization. Technical Report from Information Sciences Institute (ISI), ISI/RS-87-190, pp. 1-91. University of Southern California, USA (1987)

    Google Scholar 

  23. Marcu, D.: The Rhetorical Parsing of Unrestricted Texts: A Surface-based Approach. Computational Linguistics 26, 396–448 (2000)

    Article  Google Scholar 

  24. Maziero, E., Pardo, T.A.S.: Automatização de um método de avaliação de estruturas retóricas. In Proceedings of the RST Brazilian Meeting (2009)

    Google Scholar 

  25. Mckoon, G., Ratcliff, R.: Inference during reading. Psychological Review, 440-446 (1992)

    Google Scholar 

  26. Radev, D.: A common theory of information fusion from multiple text sources, step one: Cross-document structure. In: Proceedings of the 1st ACL SIGDIAL Workshop on Discourse and Dialogue, Hong Kong, pp. 74–83 (2000)

    Google Scholar 

  27. Ribeiro, G.F., Rino, L.H.M.: A Sumarização Automática com Base em Estruturas RST. Technical Reports from Interinstitutional Center for Computational Linguistics, University of São Paulo, NILC-TR-02-05. São Carlos, Brazil (2002)

    Google Scholar 

  28. Salton, G.: Term-Weighting Approaches in Automatic Text Retrieval. Information Processing and Management, 513–523 (1988)

    Google Scholar 

  29. Seno, E.R.M.: Rhesumarst: Um sumarizador automático de estruturas rst. Master Thesis. University of São Carlos. São Carlos/SP (2005)

    Google Scholar 

  30. Webber, B.: D-ltag: Extending lexicalized tag to discourse. Cognitive Science 28(5), 751–779 (2004)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

de S. Dias, M., Feltrim, V.D., Pardo, T.A.S. (2014). Using Rhetorical Structure Theory and Entity Grids to Automatically Evaluate Local Coherence in Texts. In: Baptista, J., Mamede, N., Candeias, S., Paraboni, I., Pardo, T.A.S., Volpe Nunes, M.d.G. (eds) Computational Processing of the Portuguese Language. PROPOR 2014. Lecture Notes in Computer Science(), vol 8775. Springer, Cham. https://doi.org/10.1007/978-3-319-09761-9_26

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-09761-9_26

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-09760-2

  • Online ISBN: 978-3-319-09761-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics