Abstract
In this paper, a new molecular alignment based recognition method for question answering from from the Web is proposed. This identifies locations using an molecular alignment sequence algorithm according to their similarity with a user natural-language question. Different experiments and results concerning questions on locations are discussed. The high accuracy of the proposed alignment strategy shows the promise of approach to effectively deal with questions extracted from natural-language corpus which contain many complex patterns.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Notes
In the scope of this baseline, the underlying language model is a unigram model.
References
Ahn, K., Bos, J., Curran, J. R., Kor, D., Nissim, M., & Webber, B. (2005). Question answering with QED at TREC-2005. In Proceedings of the fourteenth text retrieval conference (TREC 2005).
Benner, S., Cohen, M., & Gonnet, G. (1993). Empirical and structural models for insertions and deletions in the divergent evolution of proteins. Journal of Molecular Biology, 229, 1065–1082.
Chen, J., Ge, H., Wu, Y., & Jiang, S. (2004). Question answering combining multiple evidences. In Proceedings of TREC.
Curran, J., & Clark, S. (2003). Language independent NER using a maximum entropy tagger. In Proceedings of CoNLL-2003 (pp. 164–167). Edmonton, Canada.
Dayhoff, M., Schwartz, R., & Orcutt, B. (1978). A model of evolutionary change in proteins. In Atlas of protein sequence and structure (Vol. 5, Suppl. 3, pp. 345–352). Washington D.C.: National Biomedical Research Foundation.
De Chalendar, G., Dalmas, T., Elkateb-Gara, F., Ferret, O., Grau, B., Hurault-Planet, M., et al. (2003). The question answering system QALC at LIMSI: Experiments in using Web and WordNet. NIST Special Publication SP.
Dumais, S., Banko, M., Brill, E., Lin, J., & Ng, A. (2001). Data-intensive question answering. In Proceedings of the tenth text retrieval conference (TREC 2001). Gaithersburg, Maryland, November 2001.
Dumais, S., Banko, M., Brill, E., Lin, J., & Ng, A. (2002). Web question answering: Is more always better? In Proceedings of SIGIR-2002.
Echihabi, A., Hermjakob, U., Hovy, E., Marcu, D., Melz, E., & Ravichadran, D. (2004). How to select an answer String? In Advances in textual question answering. Deventer: Kluwer.
Echihabi, A., & Marcu, D. (2003). A noisy-channel approach to question answering. In Proceedings of the 41st annual meeting of the association for computational linguistics (pp. 16–23). July 2003.
Eddy, S. (2004). Where did the BLOSUM62 alignment score matrix come from? Nature Biotechnology, 22(8), 1035–1036.
Figueroa, A., & Neumann, G. (2006). Language independent answer prediction from the web. In Proceedings of the FinTAL 5th international conference on natural language processing. Turku, Finland, 23–25 August.
Gusfield, D. (1997). Algorithms on strings, trees, and sequences. Cambridge: Cambridge University Press.
Henikoff, S., & Henikoff, J. (1992). Amino acid substitution matrices from protein blocks. Proceedings of the National Academy of Sciences of the United States of America, 89(2), 10915–10919.
Lita, L., & Carbonell, J. (2004a). Unsupervised question answering data acquisition from local corpora. In Proceedings of the thirteenth conference on information and knowledge management (CIKM 2004). Washington, DC, USA, 8–13 November.
Lita, L., & Carbonell, J. (2004b). Instance-based question answering: A data driven approach. In Proceedings of EMNLP.
Luhn, H. (1958). The automatic creation of literature abstracts. IBM Journal of Research and Development, 2, 159–165.
Moldovan, D., Harabagui, S., Clark, C., Bowden, M., & Lehmann, J. (2004). Experiments and analysis of LCC’s two QA systems over TREC 2004. TREC 2004.
Monz, C. (2003). From document retrieval to question answering. IILC Dissertation Series DS-2003-4. Institute for Logic, Language and Computation, University of Amsterdam.
Needleman, N., & Wunsch, J. (1970). A general method applicable to the search for similarities in the amino acid sequence of two proteins. Journal of Molecular Biology, 48, 443–453.
Radev, D., Qi, H., Zheng, Z., Blair-Goldensohn, S., Zhang, Z., & Fan, W. (2001). Mining the web for answers to natural language questions. In Proceedings of the tenth international conference on information and knowledge management table of contents (pp. 143–150). Atlanta, Georgia, USA.
Radev, D., Libner, K., & Fan, W. (2002). Getting answers to natural language questions on the web. Journal of the American Society for Information Science and Technology, 53(5), 359–364.
Rijke, M., & Monz, C. (2002). Tequesta: The University of Amsterdam’s Textual Question Answering System. NIST Special Publication SP.
Robertson, S. (2004). Understanding inverse document frequency: On theoretical arguments for IDF. Journal of Documentation, 60(5), 503–520.
Savary, A., & Jacquemin, C. (2000). Reducing information variation in text. In ELSNET Summer School (pp. 145–181).
Smith, T., & Waterman, M. (1981). Identification of common molecular subsequences. Journal of Molecular Biology, 147, 195–197.
van Rijsbergen, C. (1979). Information Retreival. Toronto: Butterworths.
Voorhees, E. (2000). Overview of trec-9 question-answering track. In The ninth text retrieval conference (TREC-9). Gaitherburg, USA.
Waterman, M. (1994). Estimating statistical of sequence alignments. In Philosophy transactions of the royal society of London, (No. 344, pp. 383–390).
Wei, K. (2005). Improving answer precision and recall of list questions. MSc thesis, School of Informatics, University of Edinburgh, UK.
Zipf, H. (1949). Human behaviour and the principle of the least effort. Cambridge: Addison-Wesley.
Author information
Authors and Affiliations
Corresponding author
Additional information
This research was partially supported by the National Council for Scientific and Technological Research (FONDECYT, Chile) under grant number 1070714: “An Interactive Natural-Language Dialogue Model for Intelligent Filtering based on Patterns Discovered from Text Documents”
Rights and permissions
About this article
Cite this article
Figueroa, A., Atkinson, J. Intelligent answering location questions from the web using molecular alignment. J Intell Inf Syst 35, 75–90 (2010). https://doi.org/10.1007/s10844-009-0089-4
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10844-009-0089-4