A Resource-Driven Approach for Anchoring Linguistic Resources to Conceptual Spaces

Lieto, Antonio; Mensa, Enrico; Radicioni, Daniele P.

doi:10.1007/978-3-319-49130-1_32

Antonio Lieto¹⁷,
Enrico Mensa¹⁷ &
Daniele P. Radicioni¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10037))

Included in the following conference series:

Conference of the Italian Association for Artificial Intelligence

1443 Accesses

Abstract

In this paper we introduce the ttcs system, so named after Terms To Conceptual Spaces, that exploits a resource-driven approach relying on BabelNet, NASARI and ConceptNet. ttcs takes in input a term and its context of usage and produces as output a specific type of vector-based semantic representation, where conceptual information is encoded through the Conceptual Spaces (a geometric framework for common-sense knowledge representation and reasoning). The system has been evaluated in a twofold experimentation. In the first case we assessed the quality of the extracted common-sense conceptual information with respect to human judgments with an online questionnaire. In the second one we compared the performances of a conceptual categorization system that was run twice, once fed with extracted annotations and once with hand-crafted annotations. In both cases the results are encouraging and provide precious insights to make substantial improvements.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

COVER: a linguistic resource combining common sense and lexicographic information

Article 21 June 2018

USI: a fast and accurate approach for conceptual document annotation

Article Open access 14 March 2015

Annotating Concept Abstractness by Common-Sense Knowledge

Notes

1.
Recently the convexity constraint of conceptual spaces has been argued as a plausible but not necessary condition for the characterisation of concepts within this framework in the case, for example, of the adoption of non-euclidean metrics (see [4]). In our case-study we considered such constraint as proposed in the original theory since we didn’t consider non-euclidean metrics.
2.
In fact, we remark that differently from CSs, formal ontologies are not suited for representing defeasible, prototypical knowledge and for dealing with the corresponding typicality-based conceptual reasoning (e.g., non-monotonic inference). For example, for the concept dog, OpenCyc does not represent that “typically” dogs bark and woof because common-sense traits are not necessary/sufficient for defining this category.
3.
Resources marked with emphasized fonts are harmonized in UBY in both the English and the German version.
4.
Typically, the context is composed by one or more sentences; without loss of generality, in the present setting the context has been retrieved by accessing the DBPedia page associated to t.
5.
NASARI unified vectors are composed by a head concept (represented by its ID in the first position) and a body, that is a list of synsets related to the head concept. Each synset ID is followed by a number that grasps its correlation with the head concept. It is worth noting that in order to reduce the number of required accesses to BabelNet we built an all-in-one resource that maps each ID referred in NASARI vectors onto its synset terms.
6.
$\alpha $ is presently set to 100.
7.
$\beta $ is presently set to 3.
8.
We note that the presence of $t'_{i}$ in the vector of c is guaranteed only if $t'_{i}$ was detected as relevant through the first relevance condition. So, if $t'_{i}$ does not appear in the vector of $c^t$, the identification process fails, and the term will not be added to the result set C.
9.
Correctly identified concepts are those for which the whole procedure produces an output.
10.
Questionnaires are available at: http://goo.gl/am0S2f.
11.
We acknowledge that compared to similar experiments (such as [26, 27]) such data is rather small, and defer to future work an extensive evaluation.
12.
As the ratio between the number of deletions expressed for a given statement and the number of assessments obtained by that statement: e.g., the statement ‘Soap has function of scent’ has been questioned by 2 participants out of 12. The agreement on such deletion was computed as $2/12=16.7\,\%$.
13.
In essence, the employed system executes a two-steps categorization process: it first computes a result based on Conceptual Spaces, and it then checks the validity of the obtained result against an ontological knowledge base.

References

Navigli, R., Ponzetto, S.P.: BabelNet: the automatic construction, evaluation and application of a wide-coverage multilingual semantic network. Artif. Intell. 193, 217–250 (2012)
Article MathSciNet MATH Google Scholar
Camacho-Collados, J., Pilehvar, M.T., Navigli, R.: NASARI: a novel approach to a semantically-aware representation of items. In: Proceedings of NAACL, pp. 567–577 (2015)
Google Scholar
Speer, R., Havasi, C.: Representing general relational knowledge in ConceptNet 5. In: LREC, pp. 3679–3686 (2012)
Google Scholar
Hernández-Conde, J.V.: A case against convexity in conceptual spaces. Synthese, 1–27 (2016)
Google Scholar
Gärdenfors, P.: The Geometry of Meaning: Semantics Based on Conceptual Spaces. MIT Press, Cambridge (2014)
MATH Google Scholar
Rosch, E.: Cognitive representations of semantic categories. J. Exp. Psychol. Gen. 104, 192–233 (1975)
Article Google Scholar
Lenat, D.B., Prakash, M., Shepherd, M.: CYC: using common sense knowledge to overcome brittleness and knowledge acquisition bottlenecks. AI Mag. 6, 65 (1985)
Google Scholar
Frixione, M., Lieto, A.: Representing concepts in formal ontologies: compositionality vs. typicality effects. Logic Logical Philos. 21, 391–414 (2012)
MathSciNet Google Scholar
Reeve, L., Han, H.: Survey of semantic annotation platforms. In: Proceedings of the 2005 ACM symposium on Applied computing, pp. 1634–1638. ACM (2005)
Google Scholar
Derrac, J., Schockaert, S.: Inducing semantic relations from conceptual spaces: a data-driven approach to plausible reasoning. Artif. Intell. 228, 66–94 (2015)
Article MathSciNet MATH Google Scholar
Pennington, J., Socher, R., Manning, C.D.: Glove: global vectors for word representation. EMNLP 14, 1532–1543 (2014)
Google Scholar
Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S., Dean, J.: Distributed representations of words and phrases and their compositionality. In: Advances in Neural Information Processing Systems, pp. 3111–3119 (2013)
Google Scholar
Turney, P.D., Pantel, P., et al.: From frequency to meaning: vector space models of semantics. J. Artif. Intell. Res. 37(1), 141–188 (2010)
MathSciNet MATH Google Scholar
Miller, G.A.: WordNet: a lexical database for English. Commun. ACM 38(11), 39–41 (1995)
Article Google Scholar
Baker, C.F., Fillmore, C.J., Lowe, J.B.: The Berkeley framenet project. In: Proceedings of the 17th International Conference on Computational Linguistics. Association for Computational Linguistics, vol. 1, pp. 86–90 (1998)
Google Scholar
Levin, B.: English Verb Classes and Alternations: A Preliminary Investigation. University of Chicago Press, Chicago (1993)
Google Scholar
Havasi, C., Speer, R., Alonso, J.: ConceptNet: a lexical resource for common sense knowledge. Recent Adv. Nat. Lang. Process. V Sel. Pap. RANLP 309, 269 (2007)
Article Google Scholar
Eckle-Kohler, J., Gurevych, I., Hartmann, S., Matuschek, M., Meyer, C.M.: UBY-LMF-a uniform model for standardizing heterogeneous lexical-semantic resources in ISO-LMF. In: LREC, pp. 275–282 (2012)
Google Scholar
Gurevych, I., Eckle-Kohler, J., Hartmann, S., Matuschek, M., Meyer, C.M., Wirth, C.: UBY: a large-scale unified lexical-semantic resource based on LMF. In: Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics. Association for Computational Linguistics, pp. 580–590 (2012)
Google Scholar
McCrae, J., Montiel-Ponsoda, E., Cimiano, P.: Integrating WordNet and Wiktionary with lemon. In: Chiarcos, C., Nordhoff, S., Hellmann, S. (eds.) Linked Data in Linguistics, pp. 25–34. Springer, Heidelberg (2012)
Chapter Google Scholar
Buitelaar, P., Cimiano, P., Haase, P., Sintek, M.: Towards linguistically grounded ontologies. In: Aroyo, L., et al. (eds.) ESWC 2009. LNCS, vol. 5554, pp. 111–125. Springer, Heidelberg (2009). doi:10.1007/978-3-642-02121-3_12
Chapter Google Scholar
Tonelli, S., Pianta, E.: A novel approach to mapping framenet lexical units to Wordnet synsets. In: Proceedings of the Eighth International Conference on Computational Semantics. Association for Computational Linguistics, pp. 342–345 (2009)
Google Scholar
Ferrández, O., Ellsworth, M., Munoz, R., Baker, C.F.: Aligning FrameNet and WordNet based on semantic neighborhoods. LREC 10, 310–314 (2010)
Google Scholar
Pilehvar, M.T., Jurgens, D., Navigli, R.: Align, disambiguate and walk: a unified approach for measuring semantic similarity. In: ACL, vol. 1, pp. 1341–1351 (2013)
Google Scholar
Bejan, A., Marden, J.H.: Constructing animal locomotion from new thermadynamics theory. Am. Sci. 94, 342–349 (2006)
Article Google Scholar
Tonelli, S., Pighin, D.: New features for framenet: Wordnet mapping. In: Proceedings of the Thirteenth Conference on Computational Natural Language Learning. Association for Computational Linguistics, pp. 219–227 (2009)
Google Scholar
De Cao, D., Croce, D., Basili, R.: Extensive evaluation of a framenet-wordnet mapping resource. In: LREC (2010)
Google Scholar
Lieto, A., Minieri, A., Piana, A., Radicioni, D.P.: A knowledge-based system for prototypical reasoning. Connection Sci. 27, 137–152 (2015)
Article Google Scholar
Lieto, A., Radicioni, D.P., Rho, V.: A common-sense conceptual categorization system integrating heterogeneous proxytypes and the dual process of reasoning. In: Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI), pp. 875–881. AAAI Press, Buenos Aires (2015)
Google Scholar
Lieto, A., Radicioni, D.P., Rho, V.: Dual PECCS: a cognitive system for conceptual representation and categorization. J. Exp. Theor. Artif. Intell., 1–20 (2016)
Google Scholar

Download references

Author information

Authors and Affiliations

Dipartimento di Informatica, Università degli Studi di Torino, Turin, Italy
Antonio Lieto, Enrico Mensa & Daniele P. Radicioni

Authors

Antonio Lieto
View author publications
You can also search for this author in PubMed Google Scholar
Enrico Mensa
View author publications
You can also search for this author in PubMed Google Scholar
Daniele P. Radicioni
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Daniele P. Radicioni .

Editor information

Editors and Affiliations

University of Genoa , Genova, Italy
Giovanni Adorni
University of Parma , Parma, Italy
Stefano Cagnoni
University of Siena , Siena, Italy
Marco Gori
University of Genova , Genova, Italy
Marco Maratea

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lieto, A., Mensa, E., Radicioni, D.P. (2016). A Resource-Driven Approach for Anchoring Linguistic Resources to Conceptual Spaces. In: Adorni, G., Cagnoni, S., Gori, M., Maratea, M. (eds) AI*IA 2016 Advances in Artificial Intelligence. AI*IA 2016. Lecture Notes in Computer Science(), vol 10037. Springer, Cham. https://doi.org/10.1007/978-3-319-49130-1_32

Download citation

DOI: https://doi.org/10.1007/978-3-319-49130-1_32
Published: 05 November 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-49129-5
Online ISBN: 978-3-319-49130-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics