Skip to main content

DiZer: An Automatic Discourse Analyzer for Brazilian Portuguese

  • Conference paper
Advances in Artificial Intelligence – SBIA 2004 (SBIA 2004)

Abstract

This paper presents DiZer, an automatic DIscourse analyZER for Brazilian Portuguese. Given a source text, the system automatically produces its corresponding rhetorical analysis, following Rhetorical Structure Theory – RST [1]. A rhetorical repository, which is DiZer main component, makes the automatic analysis possible. This repository, produced by means of a corpus analysis, includes discourse analysis patterns that focus on knowledge about discourse markers, indicative phrases and words usages. When applicable, potential rhetorical relations are indicated. A preliminary evaluation of the system is also presented.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Mann, W.C., Thompson, S.A.: Rhetorical Structure Theory: A Theory of Text Organization. Technical Report ISI/RS-87-190 (1987)

    Google Scholar 

  2. Marcu, D.: The Theory and Practice of Discourse Parsing and Summarization. The MIT Press, Cambridge (2000)

    MATH  Google Scholar 

  3. O’Donnell, M.: Variable-Length On-Line Document Generation. In: The Proceedings of the 6th European Workshop on Natural Language Generation, Duisburg, Germany (1997)

    Google Scholar 

  4. Cristea, D., Ide, N., Romary, L.: Veins Theory. An Approach to Global Cohesion and Coherence. In: The Proceedings of Coling/ACL, Montreal (1998)

    Google Scholar 

  5. Schauer, H.: Referential Structure and Coherence Structure. In: The Proceedings of TALN, Lausanne, Switzerland (2000)

    Google Scholar 

  6. Marcu, D.: The Rhetorical Parsing, Summarization, and Generation of Natural Language Texts. PhD Thesis, Department of Computer Science, University of Toronto (1997)

    Google Scholar 

  7. Marcu, D., Echihabi, A.: An Unsupervised Approach to Recognizing Discourse Relations. In: The Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (ACL 2002), Philadelphia, PA (2002)

    Google Scholar 

  8. Soricut, R., Marcu, D.: Sentence Level Discourse Parsing using Syntactic and Lexical Information. In: The Proceedings of the Human Language Technology and North American Association for Computational Linguistics Conference (HLT/NAACL), Edmonton, Canada (2003)

    Google Scholar 

  9. Corston-Oliver, S.: Computing Representations of the Structure of Written Discourse. PhD Thesis, University of California, Santa Barbara, CA, USA (1998)

    Google Scholar 

  10. Feltrim, V.D., Aluísio, S.M., Nunes, M.G.V.: Analysis of the Rhetorical Structure of Computer Science Abstracts in Portuguese. In: The Proceedings of Corpus Linguistics (2003)

    Google Scholar 

  11. Pardo, T.A.S., Rino, L.H.M.: DMSumm: Review and assessment. In: Ranchhod, E., Mamede, N.J. (eds.) PorTAL 2002. LNCS (LNAI), vol. 2389, pp. 263–273. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  12. Aluísio, S.M., Oliveira Jr., O.N.: A Case-Based Approach for Developing Writing Tools Aimed at Non-native English Users. In: Aamodt, A., Veloso, M.M. (eds.) ICCBR 1995. LNCS, vol. 1010, pp. 121–132. Springer, Heidelberg (1995)

    Chapter  Google Scholar 

  13. Aluísio, S.M., Barcelos, I., Sampaio, J., Oliveira Jr., O.N.: How to Learn the Many Unwritten Rules of the Game of the Academic Discourse: A Hybrid Approach Based on Critiques and Cases to Support Scientific Writing. In: The Proceedings of the IEEE International Conference on Advanced Learning Technologies, vol. 1, pp. 257–260. IEEE Computer Society, Madison (2001)

    Chapter  Google Scholar 

  14. Rino, L.H.M., Scott, D.: A Discourse Model for Gist Preservation. In: The Proceedings of the XIII Brazilian Symposium on Artificial Intelligence (SBIA 1996), Curitiba - PR, Brasil (1996)

    Google Scholar 

  15. Carlson, L., Marcu, D.: Discourse Tagging Reference Manual. ISI Technical Report ISI-TR-545 (2001)

    Google Scholar 

  16. O’Donnell, M.: RST-Tool: An RST Analysis Tool. In: The Proceedings of the 6th European Workshop on Natural Language Generation. Gerhard-Mercator University, Duisburg, Germany (1997)

    Google Scholar 

  17. Pardo, T.A.S. e Nunes, M.G.V.: A Construção de um Corpus de Textos Científicos em Português do Brasil e sua Marcação Retórica. Série de Relatórios Técnicos do Instituto de Ciências Matemáticas e de Computação - ICMC, Universidade de São Paulo, 212 (2003)

    Google Scholar 

  18. Pardo, T.A.S. e Nunes, M.G.V.: Relações Retóricas e seus Marcadores Superficiais: Análise de um Corpus de Textos Científicos em Português do Brasil. Relatório Técnico NILC-TR-04-03. Série de Relatórios do NILC, ICMC-USP (2004).

    Google Scholar 

  19. Aires, R.V.X., Aluísio, S.M., Kuhn, D.C.S., Andreeta, M.L.B., Oliveira Jr., O.N.: Combining Multiple Classifiers to Improve Part of Speech Tagging: A Case Study for Brazilian Portuguese. In: The Proceedings of the Brazilian AI Symposium (SBIA 2000), pp. 20–22 (2000)

    Google Scholar 

  20. Martins, R.T., Montilha, G., Rino, L.H.M., Nunes, M.G.V.: Dos Modelos de Resolução da Ambigüidade Categorial: O Problema do SE. In: The Proceedings do IV Encontro para o Processamento Computacional da Língua Portuguesa Escrita e Falada, PROPOR 1999, Évora, Portugal, pp. 115–128 (1999)

    Google Scholar 

  21. Pereira, F.C.N., Warren, D.H.D.: Definite Clause Grammars for Language Analysis – A Survey of the Formalism and Comparison with Augmented Transition Networks. Artificial Intelligence 13, 231–278 (1980)

    Article  MATH  MathSciNet  Google Scholar 

  22. Schilder, F.: Robust discourse parsing via discourse markers, topicality and position. In: Tait, J., Boguraev, B.K., Jacquemin, C. (eds.) Natural Language Engineering, Cambridge University Press, Cambridge (2002)

    Google Scholar 

  23. Sumita, K., Ono, K., Chino, T., Ukita, T., Amano, S.: A discourse structure analyzer for Japonese text. In: The Proceedings of the International Conference on Fifth Generation Computer Systems, Tokyo, Japan, vol. 2, pp. 1133–1140 (1992)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Pardo, T.A.S., das Graças Volpe Nunes, M., Rino, L.H.M. (2004). DiZer: An Automatic Discourse Analyzer for Brazilian Portuguese. In: Bazzan, A.L.C., Labidi, S. (eds) Advances in Artificial Intelligence – SBIA 2004. SBIA 2004. Lecture Notes in Computer Science(), vol 3171. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-28645-5_23

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-28645-5_23

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-23237-7

  • Online ISBN: 978-3-540-28645-5

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics