Toward Automatic Classification of Metadiscourse

Correia, Rui; Mamede, Nuno; Baptista, Jorge; Eskenazi, Maxine

doi:10.1007/978-3-319-10888-9_27

Toward Automatic Classification of Metadiscourse

Rui Correia^20,21,
Nuno Mamede²⁰,
Jorge Baptista²² &
…
Maxine Eskenazi²¹

Conference paper

2073 Accesses
1 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8686))

Abstract

This paper describes the supervised classification of four metadiscursive functions in English. Training data is collected using crowdsourcing to label a corpus of TED talks transcripts with occurrences of Introductions, Conclusions, Examples, and Emphasis. Using decision trees and lexical features, we report classification accuracy.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Abbasi, A., Chen, H., Salem, A.: Sentiment analysis in multiple languages: Feature selection for opinion classification in Web forums. ACM Transactions on Information Systems (TOIS) 26(3), 12 (2008)
Article Google Scholar
Ädel, A.: Just to give you kind of a map of where we are going: A Taxonomy of Metadiscourse in Spoken and Written Academic English. Nordic Journal of English Studies 9(2), 69–97 (2010)
Google Scholar
Auria, C.P.L.: Signaling speaker’s intentions: towards a phraseology of textual metadiscourse in academic lecturing. English as a GloCalization Phenomenon. Observations from a Linguistic Microcosm 3, 59 (2006)
Google Scholar
Crismore, A., Markkanen, R., Steffensen, M.S.: Metadiscourse in persuasive writing. Written Communication 10(1), 39 (1993)
Article Google Scholar
Eskenazi, M., Levow, G.A., Meng, H., Parent, G.: Crowdsourcing for Speech Processing. John Wiley & Sons (2013)
Google Scholar
Klein, D., Manning, C.D.: Accurate unlexicalized parsing. In: Proceedings of the 41st Annual Meeting on Association for Computational Linguistics, vol. 1, pp. 423–430. Association for Computational Linguistics (2003)
Google Scholar
Lee, Y.K., Ng, H.T., Chia, T.K.: Supervised Word Sense Disambiguation with Support Vector Machines and Multiple Knowledge Sources. In: Senseval-3: Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text, pp. 137–140 (2004)
Google Scholar
Luukka, M.R.: Metadiscourse in academic texts. In: Conference on Discourse and the Professions, Uppsala, Sweden, vol. 28, pp. 77–88 (1992)
Google Scholar
Lyons, J.: Semantics, vol. 2. Cambridge University Press, Cambridge (1977)
Book Google Scholar
Maas, A.L., Daly, R.E., Pham, P.T., Huang, D., Ng, A.Y., Potts, C.: Learning Word Vectors for Sentiment Analysis. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, vol. 1, pp. 142–150. Association for Computational Linguistics (2011)
Google Scholar
Madnani, N., Heilman, M., Tetreault, J., Chodorow, M.: Identifying High-Level Organizational Elements in Argumentative Discourse. In: Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 20–28. Association for Computational Linguistics (2012)
Google Scholar
Mann, W.C., Thompson, S.A.: Rhetorical Structure Theory: Toward a functional theory of text organization. Text 8(3), 243–281 (1988)
Google Scholar
Marcu, D.: The Theory and Practice of Discourse Parsing and Summarization. The MIT Press (2000)
Google Scholar
Marcus, M.P., Marcinkiewicz, M.A., Santorini, B.: Building a Large Annotated Corpus of English: The Penn Treebank. Computational Linguistics 19(2), 313–330 (1993)
Google Scholar
Mauranen, A.: Reflexive academic talk: Observations from MICASE. In: Corpus linguistics in North America: Selections from the 1999 Symposium, pp. 165–178 (2001)
Google Scholar
Miltsakaki, E., Robaldo, L., Lee, A., Joshi, A.: Sense Annotation in the Penn Discourse Treebank. In: Gelbukh, A. (ed.) CICLing 2008. LNCS, vol. 4919, pp. 275–286. Springer, Heidelberg (2008)
Chapter Google Scholar
Purver, M.R.J.: The Theory and Use of Clarification Requests in Dialogue. Ph.D. thesis. Citeseer (2004)
Google Scholar
Rodríguez, K.J., Schlangen, D.: Form, Intonation and Function of Clarification Requests in German task-oriented spoken dialogues. In: Proceedings of Catalog (the 8th Workshop on the Semantics and Pragmatics of Dialogue; SemDial 2004) (2004)
Google Scholar
Römer, U., Swales, J.M.: The Michigan Corpus of Upper-level Student Papers (MICUSP). Journal of English for Academic Purposes (April 2009)
Google Scholar
Simpson, R.C., Briggs, S.L., Ovens, J., Swales, J.M.: The Michigan Corpus of Academic Spoken English (2002)
Google Scholar
Thelwall, M., Buckley, K., Paltoglou, G., Cai, D., Kappas, A.: Sentiment strength detection in short informal text. Journal of the American Society for Information Science and Technology 61(12), 2544–2558 (2010)
Article Google Scholar
Wilson, S.: Distinguishing Use and Mention in Natural Language. In: Proceedings of the NAACL HLT 2010 Student Research Workshop, pp. 29–33. Association for Computational Linguistics (2010)
Google Scholar
Wilson, S.: The Creation of a Corpus of English Metalanguage. In: Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers, vol. 1, pp. 638–646. Association for Computational Linguistics (2012)
Google Scholar
Xiong, W., Litman, D.: Identifying Problem Localization in Peer-Review Feedback. In: Aleven, V., Kay, J., Mostow, J. (eds.) ITS 2010, Part II. LNCS, vol. 6095, pp. 429–431. Springer, Heidelberg (2010)
Chapter Google Scholar
Zaidan, O.F., Callison-Burch, C.: Crowdsourcing Translation: Professional Quality from Non-Professionals. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, vol. 1, pp. 1220–1229 (2011)
Google Scholar

Download references

Author information

Authors and Affiliations

L2F – INESC-ID / Instituto Superior Técnico, Universidade de Lisboa, Portugal
Rui Correia & Nuno Mamede
Language Technologies Institute, Carnegie Mellon University, USA
Rui Correia & Maxine Eskenazi
Universidade do Algarve, Portugal
Jorge Baptista

Authors

Rui Correia
View author publications
You can also search for this author in PubMed Google Scholar
Nuno Mamede
View author publications
You can also search for this author in PubMed Google Scholar
Jorge Baptista
View author publications
You can also search for this author in PubMed Google Scholar
Maxine Eskenazi
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Computer Science, Polish Academy of Sciences, ul. Jana Kazimierza 5, 01-248, Warsaw, Poland
Adam Przepiórkowski & Maciej Ogrodniczuk &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Correia, R., Mamede, N., Baptista, J., Eskenazi, M. (2014). Toward Automatic Classification of Metadiscourse. In: Przepiórkowski, A., Ogrodniczuk, M. (eds) Advances in Natural Language Processing. NLP 2014. Lecture Notes in Computer Science(), vol 8686. Springer, Cham. https://doi.org/10.1007/978-3-319-10888-9_27

Download citation

DOI: https://doi.org/10.1007/978-3-319-10888-9_27
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-10887-2
Online ISBN: 978-3-319-10888-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics