Stars2: a corpus of object descriptions in a visual domain

Paraboni, Ivandré; Galindo, Michelle Reis; Iacovelli, Douglas

doi:10.1007/s10579-016-9350-y

Stars2: a corpus of object descriptions in a visual domain

Original Paper
Published: 17 March 2016

Volume 51, pages 439–462, (2017)
Cite this article

Language Resources and Evaluation Aims and scope Submit manuscript

Ivandré Paraboni¹,
Michelle Reis Galindo¹ &
Douglas Iacovelli¹

277 Accesses
6 Citations
1 Altmetric
Explore all metrics

Abstract

This paper presents the Stars2 corpus of definite descriptions for referring expression generation (REG). The corpus was produced in collaborative communication involving speaker-hearer pairs, and includes situations of reference that are arguably under-represented in similar work. Stars2 is intended as an incremental contribution to the research in REG and related fields, and it may be used both as training/test data for algorithms of this kind, and also to gain further insights into reference phenomena in general, with a particular focus on the issue of attribute choice in referential overspecification.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Natural Language Processing

Embodied human language models vs. Large Language Models, or why Artificial Intelligence cannot explain the modal be able to

Article 07 February 2024

Semantic memory: A review of methods, models, and current challenges

Article 03 September 2020

Notes

Available from http://ivandreparaboni.wix.com/research#!visconde/cwwa.
GRE3D7 originally contained six instances of description involving two landmark objects but, as discussed in Viethen and Dale (2011) these exceptional cases were removed from the data.
For a more comprehensive comparison among REG algorithms see, for instance, the use of TUNA data in van Deemter et al. (2012).

References

Belke, E., & Meyer, A. (2002). Tracking the time course of multidimensional stimulus discrimination. European Journal of Cognitive Psychology, 14(2), 237–266.
Article Google Scholar
Byron, D., Koller, A., Oberlander, J., Stoia, L., & Striegnitz, K. (2007). In Generating instructions in virtual environments (GIVE): A challenge and evaluation testbed for NLG. Workshop on shared tasks and comparative evaluation in natural language generation.
Clarke, A. D. F., Elsner, M., & Rohde, H. (2013). Where’s Wally: The influence of visual salience on referring expression generation. Frontiers in Psychology, 4, 329. doi:10.3389/fpsyg.2013.00329.
Google Scholar
Dale, R. (2002). Cooking up referring expressions. In Proceedings of the 27th Annual Meeting of the Association for Computational Linguistics, (pp. 68–75).
Dale, R., & Viethen, J. (2009). Referring expression gene ration through attribute-based heuristics. In Proceedings of ENLG-2009, (pp. 58–65).
Dale, R., & Haddock, N. J. (1991). Content determination in the generation of referring expressions. Computational Intelligence, 7(4), 252–265.
Article Google Scholar
Dale, R., & Reiter, E. (1995). Computational interpretations of the Gricean maxims in the generation of referring expressions. Cognitive Science, 19(2), 233–263.
Article Google Scholar
de Lucena, D. J., Paraboni, I., & Pereira, D. B. (2010). From semantic properties to surface text: The generation of domain object descriptions. Inteligencia Artificial. Revista Iberoamericana de. Inteligencia Artificial, 14(45), 48–58.
Google Scholar
Dice, L. R. (1945). Measures of the amount of ecologic association between species. Ecology, 26(3), 297–302.
Article Google Scholar
dos Santos Silva, D., & Paraboni, I. (2015). Generating spatial referring expressions in interactive 3D worlds. Spatial Cognition & Computation, 15(03), 186–225. doi:10.1080/13875868.2015.1039166.
Article Google Scholar
Ferreira, T. C., & Paraboni, I. (2014a). Classification-based referring expression generation. Lecture Notes in Computer Science, 8403, 481–491.
Article Google Scholar
Ferreira, T. C., & Paraboni, I. (2014b). Referring expression generation: Taking speakers’ preferences into account. Lecture Notes in Artificial Intelligence, 8655, 539–546.
Google Scholar
FitzGerald, N., Artzi, Y., & Zettlemoyer, L. (2013). Learning distributions over logical forms for referring expression generation. In Proceedings of the 2013 conference on empirical methods in natural language processing, (pp. 1914–1925). Association for Computational Linguistics.
Gatt, A., Belz, A., & Kow, E. (2009). The TUNA challenge 2009: Overview and evaluation results. In Proceedings of the 12nd European workshop on natural language generation, (pp. 174–182).
Gatt, A., Krahmer, E., van Gompel, R., & van Deemter, K. (2013). Production of referring expressions: Preference trumps discrimination. 35th meeting of the cognitive science society, (pp. 483–488).
Gatt, A., van der Sluis, I., & van Deemter, K. (2007). Evaluating algorithms for the generation of referring expressions using a balanced corpus. Proceedings of ENLG-07.
Gorniak, P., & Roy, D. (2004). Grounded semantic composition for visual scenes. Journal of Artificial Intelligence Research, 21, 429–470.
Google Scholar
Grice, H. P. (1975). Logic and conversation Logic and conversation. In P. Cole & J. L. Morgan (Eds.), Syntax and semantics Syntax and semantics (Vol. 3). New York: Academic Press.
Google Scholar
Kazemzadeh, S., Ordonez, V., Matten, M., & Berg, T. (2014). ReferItGame: Referring to objects in photographs of natural scenes. In Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP), (pp. 787–798). Association for Computational Linguistics.
Kelleher, J. D., & Costello, F. J. (2009). Applying computational models of spatial prepositions to visually situated dialog. Computational Linguistics, 35(2), 271–306. doi:10.1162/coli.06-78-prep14.
Article Google Scholar
Krahmer, E., & van Deemter, K. (2012). Computational generation of referring expressions: A survey. Computational Linguistics, 38(1), 173–218.
Article Google Scholar
Mitchell, M., van Deemter, K., & Reiter, E. (2010). Natural reference to objects in a visual domain. Proceedings of INLG-2010. The Association for Computer Linguistics.
Paraboni, I. (2000). An algorithm for generating document-deictic references. In Proceedings of workshop coherence in generated multimedia, associated with first int. conf. on natural language generation (INLG-2000), Mitzpe Ramon, (pp. 27–31).
Paraboni, I., & van Deemter, K. (2014). Reference and the facilitation of search in spatial domains. Language, Cognition and Neuroscience, 29(8), 1002–1017.
Article Google Scholar
Passonneau, R. (2006). Measuring agreement on set-valued items (MASI) for semantic and pragmatic annotation. In Proceedings of the international conference on language resources and evaluation (LREC).
Pechmann, T. (1989). Incremental speech production and referential overspecification. Linguistics, 27(1), 98–110.
Article Google Scholar
Reiter, E., & Dale, R. (2000). Building natural language generation systems. New York, NY, USA: Cambridge University Press.
Book Google Scholar
Teixeira, C. V. M., Paraboni, I., da Silva, A. S. R., & Yamasaki, A. K. (2014). Generating relational descriptions involving mutual disambiguation. Lecture Notes in Computer Science, 8403, 492–502.
Article Google Scholar
van Deemter, K., Gatt, A., van der Sluis, I., & Power, R. (2012). Generation of referring expressions: Assessing the incremental algorithm. Cognitive Science, 36(5), 799–836.
Article Google Scholar
van Gompel, R., Gatt, A., Krahmer, E., & Deemter, K. V. (2014). Testing computational models of reference generation as models of human language production: The case of size contrast. In Refnet workshop on psychological and computational models of reference comprehension and production. Edinburgh, Scotland.
Viethen, J., & Dale, R. (2011). GRE3D7: A corpus of distinguishing descriptions for objects in visual scenes. In Proceedings of UCNLG+Eval-2011, (pp. 12–22).

Download references

Acknowledgments

This work has been supported by FAPESP and the University of São Paulo. The authors are also grateful to all the participants in the data collection.

Author information

Authors and Affiliations

School of Arts, Sciences and Humanities, University of São Paulo (USP/EACH), Av. Arlindo Bettio, São Paulo, 1000, Brazil
Ivandré Paraboni, Michelle Reis Galindo & Douglas Iacovelli

Authors

Ivandré Paraboni
View author publications
You can also search for this author in PubMed Google Scholar
Michelle Reis Galindo
View author publications
You can also search for this author in PubMed Google Scholar
Douglas Iacovelli
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ivandré Paraboni.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Paraboni, I., Galindo, M.R. & Iacovelli, D. Stars2: a corpus of object descriptions in a visual domain. Lang Resources & Evaluation 51, 439–462 (2017). https://doi.org/10.1007/s10579-016-9350-y

Download citation

Published: 17 March 2016
Issue Date: June 2017
DOI: https://doi.org/10.1007/s10579-016-9350-y

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Stars2: a corpus of object descriptions in a visual domain

Abstract

Access this article

Similar content being viewed by others

Natural Language Processing

Embodied human language models vs. Large Language Models, or why Artificial Intelligence cannot explain the modal be able to

Semantic memory: A review of methods, models, and current challenges

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Stars2: a corpus of object descriptions in a visual domain

Abstract

Access this article

Similar content being viewed by others

Natural Language Processing

Embodied human language models vs. Large Language Models, or why Artificial Intelligence cannot explain the modal be able to

Semantic memory: A review of methods, models, and current challenges

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation