Abstract
The aim of the work reported here is to provide a tool to help secondary school (high school) age students to reflect on the structure of their essays. Numerous tools are available to help students check their spelling and grammar. Very little, however, has been done to help them with higher level problems in their texts. In order to do this, we need to be able to analyse the discourse relations within their texts. This is particularly problematic for texts of this kind, since they contain few instances of explicit discourse markers such as ‘however’, ‘moreover’, ‘therefore’. The situation is made worse by the fact that many texts produced by such students contain large numbers of spelling and grammatical errors, thus making linguistic analysis extremely challenging. The current paper reports on a number of experiments in classification of the discourse relations in such essays. The work explores the use of machine learning techniques to identify such relations in unseen essays, using a corpus of manually annotated essays as a training set.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Miller, A., Beckwith, G.R., et al.: Introduction to WordNet: An On-line Lexical Database, p. 85. University of Princeton (1993)
Burstein, J., Marcu, D., et al.: Finding the WRITE Stuff: Automatic Identification of Discourse Structure in Student Essays. IEEE Intelligent Systems 18, 32–39 (2003)
Corston-Oliver, S.H.: Computing Representations of the Structure of Written Discourse, p. 256. University of California, Santa Barbara (1998)
Grosz, B.J., Joshi, A.K., et al.: Centering: A Framework for Modelling the Local Coherence of Discourse. Computational Linguistics 21(2), 203–225 (1995)
Hutchinson, B.: Automatic classification of discourse markers by their co-occurrences. In: Proceedings of the ESSLLI 2003 workshop on The Meaning and Implementation of Discourse Particles, Vienna, Austria, pp. 65–72 (2003)
Knott, A.: A Data-Driven Methodology for Motivating a Set of Coherence Relations, Department of Artificial Intelligence. University of Edinburgh, Edinburgh (1996)
Mahmud, R.: Possible attributes for identifying discourse relations structure. In: CLUK 2004, Birmingham (2004)
Marcu, D.: The Theory and Practice of Discourse Parsing and Summarization. The MIT Press, London (2000)
Marcu, D., Echihabi, A.: An Unsupervised Approach to Recognizing Discourse Relations. In: Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (ACL 2002), Philadelphia (2002)
Ramsay, A.: PrAgmatics = ReAsoning about the Speaker’s InTEnsions—The Parasite Manual. UMIST, Computation Department, Manchester, p. 48 (2001)
White, R., Arndt, V.: Process Writing. Addison Wesley Longman Limited, Essex (1991)
Witten, I.H., Frank, E.: Data Mining - Practical machine Learning Tools and Techniques with Java Implementation. Morgan Kaufmann, San Francisco (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Mahmud, R., Ramsay, A. (2005). Finding Discourse Relations in Student Essays. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2005. Lecture Notes in Computer Science, vol 3406. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30586-6_11
Download citation
DOI: https://doi.org/10.1007/978-3-540-30586-6_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-24523-0
Online ISBN: 978-3-540-30586-6
eBook Packages: Computer ScienceComputer Science (R0)