Abstract
This paper presents DiZer, an automatic DIscourse analyZER for Brazilian Portuguese. Given a source text, the system automatically produces its corresponding rhetorical analysis, following Rhetorical Structure Theory – RST [1]. A rhetorical repository, which is DiZer main component, makes the automatic analysis possible. This repository, produced by means of a corpus analysis, includes discourse analysis patterns that focus on knowledge about discourse markers, indicative phrases and words usages. When applicable, potential rhetorical relations are indicated. A preliminary evaluation of the system is also presented.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Mann, W.C., Thompson, S.A.: Rhetorical Structure Theory: A Theory of Text Organization. Technical Report ISI/RS-87-190 (1987)
Marcu, D.: The Theory and Practice of Discourse Parsing and Summarization. The MIT Press, Cambridge (2000)
O’Donnell, M.: Variable-Length On-Line Document Generation. In: The Proceedings of the 6th European Workshop on Natural Language Generation, Duisburg, Germany (1997)
Cristea, D., Ide, N., Romary, L.: Veins Theory. An Approach to Global Cohesion and Coherence. In: The Proceedings of Coling/ACL, Montreal (1998)
Schauer, H.: Referential Structure and Coherence Structure. In: The Proceedings of TALN, Lausanne, Switzerland (2000)
Marcu, D.: The Rhetorical Parsing, Summarization, and Generation of Natural Language Texts. PhD Thesis, Department of Computer Science, University of Toronto (1997)
Marcu, D., Echihabi, A.: An Unsupervised Approach to Recognizing Discourse Relations. In: The Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (ACL 2002), Philadelphia, PA (2002)
Soricut, R., Marcu, D.: Sentence Level Discourse Parsing using Syntactic and Lexical Information. In: The Proceedings of the Human Language Technology and North American Association for Computational Linguistics Conference (HLT/NAACL), Edmonton, Canada (2003)
Corston-Oliver, S.: Computing Representations of the Structure of Written Discourse. PhD Thesis, University of California, Santa Barbara, CA, USA (1998)
Feltrim, V.D., Aluísio, S.M., Nunes, M.G.V.: Analysis of the Rhetorical Structure of Computer Science Abstracts in Portuguese. In: The Proceedings of Corpus Linguistics (2003)
Pardo, T.A.S., Rino, L.H.M.: DMSumm: Review and assessment. In: Ranchhod, E., Mamede, N.J. (eds.) PorTAL 2002. LNCS (LNAI), vol. 2389, pp. 263–273. Springer, Heidelberg (2002)
Aluísio, S.M., Oliveira Jr., O.N.: A Case-Based Approach for Developing Writing Tools Aimed at Non-native English Users. In: Aamodt, A., Veloso, M.M. (eds.) ICCBR 1995. LNCS, vol. 1010, pp. 121–132. Springer, Heidelberg (1995)
Aluísio, S.M., Barcelos, I., Sampaio, J., Oliveira Jr., O.N.: How to Learn the Many Unwritten Rules of the Game of the Academic Discourse: A Hybrid Approach Based on Critiques and Cases to Support Scientific Writing. In: The Proceedings of the IEEE International Conference on Advanced Learning Technologies, vol. 1, pp. 257–260. IEEE Computer Society, Madison (2001)
Rino, L.H.M., Scott, D.: A Discourse Model for Gist Preservation. In: The Proceedings of the XIII Brazilian Symposium on Artificial Intelligence (SBIA 1996), Curitiba - PR, Brasil (1996)
Carlson, L., Marcu, D.: Discourse Tagging Reference Manual. ISI Technical Report ISI-TR-545 (2001)
O’Donnell, M.: RST-Tool: An RST Analysis Tool. In: The Proceedings of the 6th European Workshop on Natural Language Generation. Gerhard-Mercator University, Duisburg, Germany (1997)
Pardo, T.A.S. e Nunes, M.G.V.: A Construção de um Corpus de Textos Científicos em Português do Brasil e sua Marcação Retórica. Série de Relatórios Técnicos do Instituto de Ciências Matemáticas e de Computação - ICMC, Universidade de São Paulo, 212 (2003)
Pardo, T.A.S. e Nunes, M.G.V.: Relações Retóricas e seus Marcadores Superficiais: Análise de um Corpus de Textos Científicos em Português do Brasil. Relatório Técnico NILC-TR-04-03. Série de Relatórios do NILC, ICMC-USP (2004).
Aires, R.V.X., Aluísio, S.M., Kuhn, D.C.S., Andreeta, M.L.B., Oliveira Jr., O.N.: Combining Multiple Classifiers to Improve Part of Speech Tagging: A Case Study for Brazilian Portuguese. In: The Proceedings of the Brazilian AI Symposium (SBIA 2000), pp. 20–22 (2000)
Martins, R.T., Montilha, G., Rino, L.H.M., Nunes, M.G.V.: Dos Modelos de Resolução da Ambigüidade Categorial: O Problema do SE. In: The Proceedings do IV Encontro para o Processamento Computacional da Língua Portuguesa Escrita e Falada, PROPOR 1999, Évora, Portugal, pp. 115–128 (1999)
Pereira, F.C.N., Warren, D.H.D.: Definite Clause Grammars for Language Analysis – A Survey of the Formalism and Comparison with Augmented Transition Networks. Artificial Intelligence 13, 231–278 (1980)
Schilder, F.: Robust discourse parsing via discourse markers, topicality and position. In: Tait, J., Boguraev, B.K., Jacquemin, C. (eds.) Natural Language Engineering, Cambridge University Press, Cambridge (2002)
Sumita, K., Ono, K., Chino, T., Ukita, T., Amano, S.: A discourse structure analyzer for Japonese text. In: The Proceedings of the International Conference on Fifth Generation Computer Systems, Tokyo, Japan, vol. 2, pp. 1133–1140 (1992)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Pardo, T.A.S., das Graças Volpe Nunes, M., Rino, L.H.M. (2004). DiZer: An Automatic Discourse Analyzer for Brazilian Portuguese. In: Bazzan, A.L.C., Labidi, S. (eds) Advances in Artificial Intelligence – SBIA 2004. SBIA 2004. Lecture Notes in Computer Science(), vol 3171. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-28645-5_23
Download citation
DOI: https://doi.org/10.1007/978-3-540-28645-5_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23237-7
Online ISBN: 978-3-540-28645-5
eBook Packages: Springer Book Archive