Abstract
This paper presents the Stars2 corpus of definite descriptions for referring expression generation (REG). The corpus was produced in collaborative communication involving speaker-hearer pairs, and includes situations of reference that are arguably under-represented in similar work. Stars2 is intended as an incremental contribution to the research in REG and related fields, and it may be used both as training/test data for algorithms of this kind, and also to gain further insights into reference phenomena in general, with a particular focus on the issue of attribute choice in referential overspecification.
Similar content being viewed by others
Notes
Available from http://ivandreparaboni.wix.com/research#!visconde/cwwa.
GRE3D7 originally contained six instances of description involving two landmark objects but, as discussed in Viethen and Dale (2011) these exceptional cases were removed from the data.
For a more comprehensive comparison among REG algorithms see, for instance, the use of TUNA data in van Deemter et al. (2012).
References
Belke, E., & Meyer, A. (2002). Tracking the time course of multidimensional stimulus discrimination. European Journal of Cognitive Psychology, 14(2), 237–266.
Byron, D., Koller, A., Oberlander, J., Stoia, L., & Striegnitz, K. (2007). In Generating instructions in virtual environments (GIVE): A challenge and evaluation testbed for NLG. Workshop on shared tasks and comparative evaluation in natural language generation.
Clarke, A. D. F., Elsner, M., & Rohde, H. (2013). Where’s Wally: The influence of visual salience on referring expression generation. Frontiers in Psychology, 4, 329. doi:10.3389/fpsyg.2013.00329.
Dale, R. (2002). Cooking up referring expressions. In Proceedings of the 27th Annual Meeting of the Association for Computational Linguistics, (pp. 68–75).
Dale, R., & Viethen, J. (2009). Referring expression gene ration through attribute-based heuristics. In Proceedings of ENLG-2009, (pp. 58–65).
Dale, R., & Haddock, N. J. (1991). Content determination in the generation of referring expressions. Computational Intelligence, 7(4), 252–265.
Dale, R., & Reiter, E. (1995). Computational interpretations of the Gricean maxims in the generation of referring expressions. Cognitive Science, 19(2), 233–263.
de Lucena, D. J., Paraboni, I., & Pereira, D. B. (2010). From semantic properties to surface text: The generation of domain object descriptions. Inteligencia Artificial. Revista Iberoamericana de. Inteligencia Artificial, 14(45), 48–58.
Dice, L. R. (1945). Measures of the amount of ecologic association between species. Ecology, 26(3), 297–302.
dos Santos Silva, D., & Paraboni, I. (2015). Generating spatial referring expressions in interactive 3D worlds. Spatial Cognition & Computation, 15(03), 186–225. doi:10.1080/13875868.2015.1039166.
Ferreira, T. C., & Paraboni, I. (2014a). Classification-based referring expression generation. Lecture Notes in Computer Science, 8403, 481–491.
Ferreira, T. C., & Paraboni, I. (2014b). Referring expression generation: Taking speakers’ preferences into account. Lecture Notes in Artificial Intelligence, 8655, 539–546.
FitzGerald, N., Artzi, Y., & Zettlemoyer, L. (2013). Learning distributions over logical forms for referring expression generation. In Proceedings of the 2013 conference on empirical methods in natural language processing, (pp. 1914–1925). Association for Computational Linguistics.
Gatt, A., Belz, A., & Kow, E. (2009). The TUNA challenge 2009: Overview and evaluation results. In Proceedings of the 12nd European workshop on natural language generation, (pp. 174–182).
Gatt, A., Krahmer, E., van Gompel, R., & van Deemter, K. (2013). Production of referring expressions: Preference trumps discrimination. 35th meeting of the cognitive science society, (pp. 483–488).
Gatt, A., van der Sluis, I., & van Deemter, K. (2007). Evaluating algorithms for the generation of referring expressions using a balanced corpus. Proceedings of ENLG-07.
Gorniak, P., & Roy, D. (2004). Grounded semantic composition for visual scenes. Journal of Artificial Intelligence Research, 21, 429–470.
Grice, H. P. (1975). Logic and conversation Logic and conversation. In P. Cole & J. L. Morgan (Eds.), Syntax and semantics Syntax and semantics (Vol. 3). New York: Academic Press.
Kazemzadeh, S., Ordonez, V., Matten, M., & Berg, T. (2014). ReferItGame: Referring to objects in photographs of natural scenes. In Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), (pp. 787–798). Association for Computational Linguistics.
Kelleher, J. D., & Costello, F. J. (2009). Applying computational models of spatial prepositions to visually situated dialog. Computational Linguistics, 35(2), 271–306. doi:10.1162/coli.06-78-prep14.
Krahmer, E., & van Deemter, K. (2012). Computational generation of referring expressions: A survey. Computational Linguistics, 38(1), 173–218.
Mitchell, M., van Deemter, K., & Reiter, E. (2010). Natural reference to objects in a visual domain. Proceedings of INLG-2010. The Association for Computer Linguistics.
Paraboni, I. (2000). An algorithm for generating document-deictic references. In Proceedings of workshop coherence in generated multimedia, associated with first int. conf. on natural language generation (INLG-2000), Mitzpe Ramon, (pp. 27–31).
Paraboni, I., & van Deemter, K. (2014). Reference and the facilitation of search in spatial domains. Language, Cognition and Neuroscience, 29(8), 1002–1017.
Passonneau, R. (2006). Measuring agreement on set-valued items (MASI) for semantic and pragmatic annotation. In Proceedings of the international conference on language resources and evaluation (LREC).
Pechmann, T. (1989). Incremental speech production and referential overspecification. Linguistics, 27(1), 98–110.
Reiter, E., & Dale, R. (2000). Building natural language generation systems. New York, NY, USA: Cambridge University Press.
Teixeira, C. V. M., Paraboni, I., da Silva, A. S. R., & Yamasaki, A. K. (2014). Generating relational descriptions involving mutual disambiguation. Lecture Notes in Computer Science, 8403, 492–502.
van Deemter, K., Gatt, A., van der Sluis, I., & Power, R. (2012). Generation of referring expressions: Assessing the incremental algorithm. Cognitive Science, 36(5), 799–836.
van Gompel, R., Gatt, A., Krahmer, E., & Deemter, K. V. (2014). Testing computational models of reference generation as models of human language production: The case of size contrast. In Refnet workshop on psychological and computational models of reference comprehension and production. Edinburgh, Scotland.
Viethen, J., & Dale, R. (2011). GRE3D7: A corpus of distinguishing descriptions for objects in visual scenes. In Proceedings of UCNLG+Eval-2011, (pp. 12–22).
Acknowledgments
This work has been supported by FAPESP and the University of São Paulo. The authors are also grateful to all the participants in the data collection.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Paraboni, I., Galindo, M.R. & Iacovelli, D. Stars2: a corpus of object descriptions in a visual domain. Lang Resources & Evaluation 51, 439–462 (2017). https://doi.org/10.1007/s10579-016-9350-y
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10579-016-9350-y