Abstract
Although superlatives are commonly used in natural language, so far there has been no large-scale computational investigation of the types of comparisons they express. This article describes a comprehensive annotation scheme for superlatives, which classifies superlatives according to their surface forms and motivates an initial focus on so-called “ISA superlatives”. This type of superlative comparison is especially suitable for a computational approach because both their targets and comparison sets are explicitly realised in the text, and the proposed annotation scheme offers guidelines for annotating the spans of such comparative elements. The annotations are tested and evaluated on 500 tokens of superlatives with good inter-annotator agreement. In addition to providing a platform for investigating superlatives on a larger scale, this research also introduces a new text-based Wikipedia corpus in which all superlative instances have been annotated according to the proposed annotation scheme, and which has been used to develop a tool that can reliably distinguish between different superlative types, and identify the comparative components of ISA superlatives.


Similar content being viewed by others
Notes
Superlative targets (T S ) and their comparison sets (CS) are defined as shown in Sect. 1.
“Qid” refers to the question ID in the TREC data set.
I would like to thank Nitin Jindal and Bing Liu for making their data available.
Altogether, 285 sentences were labelled as non-equal, 110 as equative, and 169 as superlative.
I would like to thank Malvina Nissim and Johan Bos for making their data available.
Sentence mark-up was added automatically using the TTT2 tools (Grover and Tobin 2006).
Email the author at Silke.Scheible@manchester.ac.uk.
For the ISA class, there is also a distinction to be made between Type 1 and Type 2.
Note that the target may be expressed in terms of a pronoun.
For a more detailed description of the ISA-2 class, see Scheible (2009).
Here, square brackets indicate a list of restrictors.
The corpus includes sentence mark-up added by Jijkoun and de Rijke (2006).
References
Bauer, L. (1994). Watching English change. London: Longman.
Biber, D., Johansson, S., Conrad, S., & Finnegan, E. (1999). Longman grammar of spoken and written English. Harlow, England: Pearson Education Ltd.
Bos, J. (2006). The “La Sapienza” question answering system at TREC-2006. In E. M. Voorhees & L. P. Buckland (Eds.), The 15th text retrieval conference (TREC 2006) (pp. 797–803). Maryland: Gaithersburg.
Bos, J., & Nissim, M. (2006). An empirical approach to the interpretation of superlatives. In Proceedings of the 2006 conference on empirical methods in natural language processing (EMNLP 2006) (pp. 9–17), Sydney, Australia. Association for Computational Linguistics.
Bruce, R. F., & Wiebe, J. M. (1999). Recognizing subjectivity: A case study of manual tagging. Natural Language Engineering, 5, 187–205.
Clark, S., & Curran, J. R. (2004). Parsing the WSJ using CCG and log-linear models. In Proceedings of the 42nd annual meeting of the association for computational linguistics (ACL 2004) (pp. 104–111), Morristown, NJ. Association for Computational Linguistics.
Corver, N., & Matushansky, O. (2006). At our best when at our boldest. Handout, TIN-dag.
Denoyer, L., & Gallinari, P. (2006). The Wikipedia XML corpus. SIGIR Forum, 40(1), 64–69.
Grover, C., & Tobin, R. (2006). Rule-based chunking and reusability. In Proceedings of the 2006 conference on language resources and evaluation (LREC 2006) (pp. 873–878), Genoa, Italy.
Hatzivassiloglou, V., & Wiebe, J. M. (2000). Effects of adjective orientation and gradability on sentence subjectivity. In Proceedings of the 18th conference on computational linguistics (pp. 299–305). Morristown, NJ: Association for Computational Linguistics.
Heim, I. (1985). Notes on comparatives and related matters. Manuscript, University of Texas, Austin
Heim, I. (1999). Notes on superlatives. Manuscript, MIT.
Huddleston, R., & Pullum, G. K. (2002). The Cambridge grammar of the English language. New York: Cambridge University Press.
Jijkoun, V., & de Rijke, M. (2006). Overview of the WiQA task at CLEF 2006. In CLEF (pp. 265–274).
Jindal, N., & Liu, B. (2006). Mining comparative sentences and relations. In Proceedings of the 2006 conference on artificial intelligence (AAAI 2006).
Kaißer, M., Scheible, S., & Webber, B. (2006). Experiments at the University of Edinburgh for the TREC 2006 QA track. In E. M. Voorhees & L. P. Buckland (Eds.), The 15th text retrieval conference (TREC 2006). Maryland: Gaithersburg.
Kennedy, C. (1997). Comparison and polar opposition. In Proceedings of the 7th annual conference on semantics and linguistic theory SALT VII (pp. 240–257). CLC Publications.
Klein, E. (1980). A semantics for positive and comparative adjectives. Linguistics and Philosophy, 4(1), 1–45.
Lehrer, A. (1974). Semantic fields and lexical structure. Amsterdam: North-Holland.
Lin, J., & Demner-Fushman, D. (2006). Will pyramids built of nuggets topple over? In Proceedings of the main conference on human language technology conference of the North American chapter of the association of computational linguistics (HLT-NAACL 2006) (pp. 383–390). Morristown, NJ: Association for Computational Linguistics.
Lyons, J. (1977). Semantics. Cambridge: Cambridge University Press.
Merlo, P., & Ferrer, E. E. (2006). The notion of argument in PP attachment. Computational Linguistics, 32(2), 341–378.
Merlo, P., & Leybold, M. (2001). Automatic distinction of arguments and modifiers: The case of prepositional phrases. In Proceedings of the 5th computational natural language learning workshop (CoNLL—2001) (pp. 121–128).
Milosavljevic, M. (1999). The automatic generation of comparisons in descriptions of entities. In Microsoft Research Institute, Macquarie University, Sydney, Australia.
Quirk, R., Greenbaum, S., Leech, G., & Svartvik, J. (1985). A comprehensive grammar of the English language. Longman.
Razmara, M., & Kosseim, L. (2007). A little known fact is… Answering Other questions using interest-markers. In Proceedings of the 8th international conference on intelligent text processing and computational linguistics (CICLING 2007). Lecture Notes in Computer Science (Vol. 4394, pp. 518–529), Mexico City, Mexico. Springer.
Ross, J. R. (1964). A partial grammar of English superlatives. Unpublished Masters Thesis, University of Pennsylvania.
Scheible, S. (2007). Towards a computational treatment of superlatives. In Proceedings of the 45th annual meeting of the association for computational linguistics (ACL 2007), Prague, Czech Republic.
Scheible, S. (2009). A computational treatment of superlatives. PhD Thesis, University of Edinburgh.
Scheible, S. (2010). The smallest, cheapest, and best: Superlatives in opinion mining. In Proceedings of the ECAI-workshop on computational approaches to subjectivity and sentiment analysis (WASSA 2010) (pp. 52–58), Lisbon, Portugal.
Szabolsci, A. (1986). Comparative superlatives. In N. Fukui, T. R. Rapoport, & E. Sagey (Eds.), Papers in theoretical linguistics (pp. 245–265). Cambridge, MA: MIT Working Papers.
Acknowledgments
This work was supported by an EPSRC Doctoral Training Award at the School of Informatics, University of Edinburgh. I would like to thank Professor Bonnie Webber for her helpful comments and suggestions on this article.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Scheible, S. TextWiki: a superlative resource. Lang Resources & Evaluation 46, 635–666 (2012). https://doi.org/10.1007/s10579-011-9171-y
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10579-011-9171-y