Abstract
The need of the current Natural Language Processing applications to identify text segments that express the same meaning in different ways, evolved into the identification of semantic variability expressions. Most of the developed approaches focus on the text structure, such as the word overlaps, the distance between phrases or syntactic trees, word to word similarity, logic representation among others. However, current research did not identify how the global conceptual representation of a sentences can contribute to the resolution of this problem. In this paper, we present an approach where the meaning of a sentence is represented with the associated relevant domains. In order to determine the semantic relatedness among text segments, Latent Semantic Analysis is used. We demonstrate, evaluate and analyze the contribution of our conceptual representation approach in an evaluation with the paraphrase task.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Barzilay, R., McKeown, K.: Extracting paraphrases from a parallel corpus. In: ACL 2001, pp. 50–57 (2001)
Barzilay, R., McKeown, K.: Learning to paraphrase: An unsupervised approach using multiple-sequence alignment. In: HTLT-NAACL 2003, pp. 16–23 (2003)
Corley, C., Mihalcea, R.: Measures of text semantic similarity. In: Proceedings of the ACL workshop on Empirical Modeling of Semantic Equivalence (2005)
Deerwester, S., Dumais, S.T., Furnas, G.W., Landauer, T.K., Harshman, R.: Indexing by latent semantic indexing. Journal of the American Society for Information Science 41, 321–407 (1990)
Dolan, B., Quirk, C., Brockett, C.: Unsupervised construction of large paraphrase corpora: Exploiting massively parallel news sources. In: Proceedings of the 20th International Conference on Computational Linguistics, Geneva, Switzerland (2004)
FellBaum, C.: WordNet, an electronic lexical database. MIT Press, Cambridge (1998)
Gonzalo, J., Verdejo, F., Peters, C., Calzolari, N.: Applying eurowordnet to cross-language text retrieval. pp. 113–135 (1998)
Kouylekov, M., Magnini, B.: Tree edit distance for recognizing textual entailment: Estimating the cost of insertion. In: Proceedings of the PASCAL Challenges Workshop on Recognising Textual Entailment, pp. 17–20 (2006)
Kozareva, Z., Montoyo, A.: Paraphrase identification on the basis of supervised machine learning techniques. In: FinTAL, pp. 524–533 (2006)
Landauer, T., Dumais, S.: A solution to plato’s problem: The latent semantic analysis theory of acquisition. Psychological Review, 211–240 (1997)
Lin, D., Pantel, P.: Discovery of inference rules for question answering. Natural Language Engineering 4(7), 343–360
Magnini, B., Cavaglia, G.: Integrating Subject Field Codes into WordNet. In: Gavrilidou, M., Crayannis, G., Markantonatu, S., Piperidis, S., Stainhaouer, G. (eds.) Proceedings of LREC-2000, Second International Conference on Language Resources and Evaluation, Athens, Greece, pp. 1413–1418 (2000)
Magnini, B., Strapparava, C., Pezzulo, G., Gliozzo, A.: Using domain information for word sense disambiguation. In: SENSEVAL-2 (2001)
Muñoz, R., Montoyo, A.: Definite description resolution enrichment with wordnet domain labels. In: IBERAMIA, pp. 645–654 (2002)
Niles, I., Pease, A.: Linking lexicons and ontologies: Mapping wordnet to the suggested upper merged ontology. In: Proceedings of the 2003 International Conference on Information and Knowledge Engineering (IKE 03). Las Vegas, Nevada (2003)
Schmid, H.: Probabilistic part-of-speech tagging using decision trees. In: International Conference on New Methods in Language Processing, Manchester, UK (1994)
Stevenson, M., Greenwood, M.A.: Learning information extraction patterns using wordnet. In: Proceedings of the 3rd International Conference of the Global WordNet Association (GWA’06) (2006)
Szpektor, I., Tanev, H., Dagan, I., Coppola, B.: Scaling web-based acquisition of entailment relations. In: Proceedings of Empirical Methods in Natural Language Processing (2004)
Vázquez, S., Montoyo, A., Rigau, G.: Using relevant domains resource for word sense disambiguation. In: IC-AI, pp. 784–789 (2004)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kozareva, Z., Vázquez, S., Montoyo, A. (2007). The Usefulness of Conceptual Representation for the Identification of Semantic Variability Expressions. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2007. Lecture Notes in Computer Science, vol 4394. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-70939-8_29
Download citation
DOI: https://doi.org/10.1007/978-3-540-70939-8_29
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-70938-1
Online ISBN: 978-3-540-70939-8
eBook Packages: Computer ScienceComputer Science (R0)