Abstract
A query written in natural language (NL) may involve several linguistic problems that cause a query not being interpreted or translated correctly into SQL. One of these problems is implicit information or semantic ellipsis, which can be understood as the omission of important words in the wording of a query written in NL. An exhaustive survey on NLIDB works has revealed that most of these works has not systematically dealt with semantic ellipsis. In experiments conducted on commercial NLIDBs, very poor results have been obtained (7% to 16.9%) when dealing with query corpora that involve semantic ellipsis. In this paper we propose a dialogue manager (DM) for a NLIDB for solving semantic ellipsis problems. The operation of this DM is based on a typification of elliptical problems found in queries, which permits to systematically deal with this problem. Additionally, the typification has two important characteristics: domain independence, which permits the typification to be applied to queries of different databases, and generality, which means that it holds for different languages such as English, French, Italian, Spanish, etc. These characteristics are inherited to the dialogue processes implemented in the DM, since they are based on this typification. In experiments conducted with this DM and a NLIDB on a corpus of elliptical queries, an increase of correctly answered queries of 30-35% was attained.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
González, J.J.: Traductor de Lenguaje Natural Español a SQL para un Sistema de Consultas a Bases de Datos. PhD dissertation. Computer Sci. Dept., Centro Nacional de Investigación y Desarrollo Tecnológico, Cuernavaca, Mexico (2005)
Pazos, R.A., Pérez, J., González, J.J., Gelbukh, A., Sidorov, G., Rodríguez, M.J.: A Domain Independent Natural Language Interface to Databases Capable of Processing Complex Queries. In: Gelbukh, A., de Albornoz, Á., Terashima-Marín, H. (eds.) MICAI 2005. LNCS (LNAI), vol. 3789, pp. 833–842. Springer, Heidelberg (2005)
González, J.J., Pazos, R.A., Cruz, C., Fraire, H.J., Aguilar, S., Pérez, J.: Issues in Translating from Natural Language to SQL in a Domain-Independent Natural Language Interface to Databases. In: Gelbukh, A., Reyes-Garcia, C.A. (eds.) MICAI 2006. LNCS (LNAI), vol. 4293, pp. 922–931. Springer, Heidelberg (2006)
Rocher, G.: Traducción de Queries en Prolog a SQL. BS thesis, Universidad de las Américas, Puebla, Mexico (1999)
CORDIS: Telematics for Libraries - Projects, VILIB (1999), http://cordis.europa.eu/libraries/en/projects/vilib.html
Chae, J., Lee, S.: Frame Based Decomposition Method for Korean Language Query Processing. Computer Processing of Oriental Languages 11(4), 353–379 (1998)
GPLSI: Procesamiento del Lenguaje Natural (1998), http://gplsi.dlsi.ua.es/gplsi/areas.htm
Microsoft TechNet, Chapter 32- English Query Best Practices (2009), http://technet.microsoft.com/es-mx/library/cc917659en-us.aspx
Popescu, A., Etzioni, O., Kautz, H.: Towards a Theory of Natural Language Interfaces to Databases. In: Proc. International Conference on Intelligent User Interfaces, Miami, USA, pp. 149–157 (2003)
ELF Software, ELF Software Documentation Series (2002), http://www.elfsoft.com/help/accelf/overview.htm
Reis, P., Matias, J., Mamede, N.: Edite - A Natural Language Interface to Databases: a New Dimension for an Approach. In: Proc. 4th International Conference on Information and Communication Technology in Tourism, Edinburgh, Scottland (1997)
Cercone, N., Mcfetridge, P., Popowish, F., Fass, D., Groeneboer, C., Hall, G.: The System X Natural Language Interface: Design, Implementation and Evaluation. Technical report. Centre for System Science, Simon Fraser University, British Columbia, Canada (1993)
Androutsopoulus, I., Ritchie, G., Thanish, P.: MASQUE/SQL, an Efficient and Portable Natural Language Query Interface for Relational Databases. In: Proc. 6th International Conference on Industrial & Engineering Applications of Artificial Intelligence and Expert Systems, Edinburgh, UK, pp. 327–330 (1993)
Minock, M.: A STEP Towards Realizing Codd’s Vision of Rendezvous with the Casual User. In: Proc. 33rd International Conference on Very Large Databases, Vienna, Austria, pp. 1358–1361 (2007)
Minock, M.: Natural Language Access to Relational Databases through STEP. Technical report. Dept. Computer Science, University of Umea, Umea, Sweden (2004)
Bagnasco, C., Bresciani, P., Magnini, B., Strapparava, C.: Natural Language Interpretation for Public Administration Database Querying in the TAMIC Demonstrator. In: Proc. 2nd International Workshop on Applications of Natural Language to Information Systems, Amsterdam, The Netherlands (1996)
Chu, W., Yang, H., Chiang, K., Minock, M., Chow, G., Larson, C.: Cobase: A Scalable and Extensible Cooperative Information System. Journal of Intelligent Information Systems 6, 223–259 (1996)
Ott, N.: Aspects of the Automatic Generation of SQL Statements in a Natural Language Query Interface. Information Systems 17(2), 147–159 (1992)
Boldasov, M.V., Sokolova, G.E.: QGen – Generation Module for the Register Restricted InBASE System. In: Gelbukh, A. (ed.) CICLing 2003. LNCS, vol. 2588, pp. 465–476. Springer, Heidelberg (2003)
DARPA Air Travel Information System, ATIS0 (1990), http://www.ldc.upenn.edu/Catalog/readme_files/atis/sdtd/trn_prmp.html
Rojas, J.C.: Administrador de Diálogo para una Interfaz de Lenguaje Natural a Bases de Datos. PhD dissertation. Centro Nacional de Investigación y Desarrollo Tecnológico, Cuernavaca, Mexico (2009)
Pazos, R., Santaolaya, R., Rojas, J.C., Pérez, J.: Shedding Light on a Troublesome Issue in NLIBDs: Word Economy in Query Formulation. LNCS, pp. 641–648. Springer, Heidelberg (2008)
Pazos, R.A., Santaolaya, R., Rojas, J.C., Martínez, J.A., Pérez, J., Cruz, L.: Domain Independent Dialog Processes for Solving the Word-Economy Problem in a NLIDB. Polish Journal of Environmental Studies 17(4C), 457–462 (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Pazos R., R.A., Rojas P., J.C., Santaolaya S., R., Martínez F., J.A., Gonzalez B., J.J. (2010). Dialogue Manager for a NLIDB for Solving the Semantic Ellipsis Problem in Query Formulation. In: Setchi, R., Jordanov, I., Howlett, R.J., Jain, L.C. (eds) Knowledge-Based and Intelligent Information and Engineering Systems. KES 2010. Lecture Notes in Computer Science(), vol 6277. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15390-7_21
Download citation
DOI: https://doi.org/10.1007/978-3-642-15390-7_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15389-1
Online ISBN: 978-3-642-15390-7
eBook Packages: Computer ScienceComputer Science (R0)