Skip to main content
Log in

Intelligent answering location questions from the web using molecular alignment

  • Published:
Journal of Intelligent Information Systems Aims and scope Submit manuscript

Abstract

In this paper, a new molecular alignment based recognition method for question answering from from the Web is proposed. This identifies locations using an molecular alignment sequence algorithm according to their similarity with a user natural-language question. Different experiments and results concerning questions on locations are discussed. The high accuracy of the proposed alignment strategy shows the promise of approach to effectively deal with questions extracted from natural-language corpus which contain many complex patterns.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

Notes

  1. In the scope of this baseline, the underlying language model is a unigram model.

References

  • Ahn, K., Bos, J., Curran, J. R., Kor, D., Nissim, M., & Webber, B. (2005). Question answering with QED at TREC-2005. In Proceedings of the fourteenth text retrieval conference (TREC 2005).

  • Benner, S., Cohen, M., & Gonnet, G. (1993). Empirical and structural models for insertions and deletions in the divergent evolution of proteins. Journal of Molecular Biology, 229, 1065–1082.

    Article  Google Scholar 

  • Chen, J., Ge, H., Wu, Y., & Jiang, S. (2004). Question answering combining multiple evidences. In Proceedings of TREC.

  • Curran, J., & Clark, S. (2003). Language independent NER using a maximum entropy tagger. In Proceedings of CoNLL-2003 (pp. 164–167). Edmonton, Canada.

  • Dayhoff, M., Schwartz, R., & Orcutt, B. (1978). A model of evolutionary change in proteins. In Atlas of protein sequence and structure (Vol. 5, Suppl. 3, pp. 345–352). Washington D.C.: National Biomedical Research Foundation.

    Google Scholar 

  • De Chalendar, G., Dalmas, T., Elkateb-Gara, F., Ferret, O., Grau, B., Hurault-Planet, M., et al. (2003). The question answering system QALC at LIMSI: Experiments in using Web and WordNet. NIST Special Publication SP.

  • Dumais, S., Banko, M., Brill, E., Lin, J., & Ng, A. (2001). Data-intensive question answering. In Proceedings of the tenth text retrieval conference (TREC 2001). Gaithersburg, Maryland, November 2001.

  • Dumais, S., Banko, M., Brill, E., Lin, J., & Ng, A. (2002). Web question answering: Is more always better? In Proceedings of SIGIR-2002.

  • Echihabi, A., Hermjakob, U., Hovy, E., Marcu, D., Melz, E., & Ravichadran, D. (2004). How to select an answer String? In Advances in textual question answering. Deventer: Kluwer.

    Google Scholar 

  • Echihabi, A., & Marcu, D. (2003). A noisy-channel approach to question answering. In Proceedings of the 41st annual meeting of the association for computational linguistics (pp. 16–23). July 2003.

  • Eddy, S. (2004). Where did the BLOSUM62 alignment score matrix come from? Nature Biotechnology, 22(8), 1035–1036.

    Article  Google Scholar 

  • Figueroa, A., & Neumann, G. (2006). Language independent answer prediction from the web. In Proceedings of the FinTAL 5th international conference on natural language processing. Turku, Finland, 23–25 August.

  • Gusfield, D. (1997). Algorithms on strings, trees, and sequences. Cambridge: Cambridge University Press.

    MATH  Google Scholar 

  • Henikoff, S., & Henikoff, J. (1992). Amino acid substitution matrices from protein blocks. Proceedings of the National Academy of Sciences of the United States of America, 89(2), 10915–10919.

    Article  Google Scholar 

  • Lita, L., & Carbonell, J. (2004a). Unsupervised question answering data acquisition from local corpora. In Proceedings of the thirteenth conference on information and knowledge management (CIKM 2004). Washington, DC, USA, 8–13 November.

  • Lita, L., & Carbonell, J. (2004b). Instance-based question answering: A data driven approach. In Proceedings of EMNLP.

  • Luhn, H. (1958). The automatic creation of literature abstracts. IBM Journal of Research and Development, 2, 159–165.

    Article  MathSciNet  Google Scholar 

  • Moldovan, D., Harabagui, S., Clark, C., Bowden, M., & Lehmann, J. (2004). Experiments and analysis of LCC’s two QA systems over TREC 2004. TREC 2004.

  • Monz, C. (2003). From document retrieval to question answering. IILC Dissertation Series DS-2003-4. Institute for Logic, Language and Computation, University of Amsterdam.

  • Needleman, N., & Wunsch, J. (1970). A general method applicable to the search for similarities in the amino acid sequence of two proteins. Journal of Molecular Biology, 48, 443–453.

    Article  Google Scholar 

  • Radev, D., Qi, H., Zheng, Z., Blair-Goldensohn, S., Zhang, Z., & Fan, W. (2001). Mining the web for answers to natural language questions. In Proceedings of the tenth international conference on information and knowledge management table of contents (pp. 143–150). Atlanta, Georgia, USA.

  • Radev, D., Libner, K., & Fan, W. (2002). Getting answers to natural language questions on the web. Journal of the American Society for Information Science and Technology, 53(5), 359–364.

    Article  Google Scholar 

  • Rijke, M., & Monz, C. (2002). Tequesta: The University of Amsterdam’s Textual Question Answering System. NIST Special Publication SP.

  • Robertson, S. (2004). Understanding inverse document frequency: On theoretical arguments for IDF. Journal of Documentation, 60(5), 503–520.

    Article  Google Scholar 

  • Savary, A., & Jacquemin, C. (2000). Reducing information variation in text. In ELSNET Summer School (pp. 145–181).

  • Smith, T., & Waterman, M. (1981). Identification of common molecular subsequences. Journal of Molecular Biology, 147, 195–197.

    Article  Google Scholar 

  • van Rijsbergen, C. (1979). Information Retreival. Toronto: Butterworths.

    Google Scholar 

  • Voorhees, E. (2000). Overview of trec-9 question-answering track. In The ninth text retrieval conference (TREC-9). Gaitherburg, USA.

  • Waterman, M. (1994). Estimating statistical of sequence alignments. In Philosophy transactions of the royal society of London, (No. 344, pp. 383–390).

  • Wei, K. (2005). Improving answer precision and recall of list questions. MSc thesis, School of Informatics, University of Edinburgh, UK.

  • Zipf, H. (1949). Human behaviour and the principle of the least effort. Cambridge: Addison-Wesley.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to John Atkinson.

Additional information

This research was partially supported by the National Council for Scientific and Technological Research (FONDECYT, Chile) under grant number 1070714: “An Interactive Natural-Language Dialogue Model for Intelligent Filtering based on Patterns Discovered from Text Documents”

Rights and permissions

Reprints and permissions

About this article

Cite this article

Figueroa, A., Atkinson, J. Intelligent answering location questions from the web using molecular alignment. J Intell Inf Syst 35, 75–90 (2010). https://doi.org/10.1007/s10844-009-0089-4

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10844-009-0089-4

Keywords

Navigation