Abstract
In this paper we compare four selection strategies in evolutionary optimization of information retrieval (IR) in a question answering setting. The IR index has been augmented by linguistic features to improve the retrieval performance of potential answer passages using queries generated from natural language questions. We use a genetic algorithm to optimize the selection of features and their weights when querying the IR database. With our experiments, we can show that the genetic algorithm applied is robust to strategy changes used for selecting individuals. All experiments yield query settings with improved retrieval performance when applied to unseen data. However, we can observe significant runtime differences when applying the various selection approaches which should be considered when choosing one of these approaches.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Tellex, S., Katz, B., Lin, J., Fernandes, A., Marton, G.: Quantitative evaluation of passage retrieval algorithms for question answering. In: Proceedings of the SIGIR conference on Research and development in information retrieval, pp. 41–47. ACM Press, New York (2003)
Roberts, I., Gaizauskas, R.: Evaluating passage retrieval approaches for question answering. In: Proceedings of 26th European Conference on Information Retrieval (2004)
Katz, B., Lin, J.: Selectively using relations to improve precision in question answering. In: Proceedings of the EACL 2003 Workshop on Natural Language Processing for Question Answering (2003)
Bouma, G., Mur, J., van Noord, G., van der Plas, L., Tiedemann, J.: Linguistic knowledge and question answering (Special Issue on Question Answering Systems). Traitement Automatique des Langues 46(3), 15–39 (2005)
Bouma, G., Van Noord, G., Malouf, R.: Alpino: Wide coverage computational analysis of Dutch. In: Computational Linguistics in the Netherlands CLIN, 2000, Rodopi (2001)
Apache: Lucene - a high-performance, full-featured text search engine library (2004), http://lucene.apache.org/java/docs/index.html
Chen, H.: Machine learning for information retrieval: Neural networks, symbolic learning, and genetic algorithms. Journal of the American Society for Information Science 46(3), 194–216 (1995)
Fan, W., Gordon, M.D., Pathak, P.: A generic ranking function discovery framework by genetic programming for information retrieval. Inf. Process. Manage. 40(4), 587–602 (2004)
Horng, J.-T., Yeh, C.-C.: Applying genetic algorithms to query optimization in document retrieval. Information Processing and Management 36(5), 737–759 (2000)
Boughanem, M., Chrisment, C., Tamine, L.: On using genetic algorithms for multimodal relevance optimization in information retrieval. Journal of the American Society for Information Science and Technology 53(11), 934–942 (2002)
Billhardt, H., Borrajo, D., Maojo, V.: Learning retrieval expert combinations with genetic algorithms. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems 11(1), 87–113 (2003)
Trotman, A.: Choosing document structure weights. Information Processing and Management 41(2), 243–264 (2005)
López-Pujalte, C., Bote, V.P.G., de Moya Anegón, F.: A test of genetic algorithms in relevance feedback. Inf. Process. Manage. 38(6), 793–805 (2002)
Cordón, O., Moya, F., Zarco, C.: A new evolutionary algorithm combining simulated annealing and genetic programming for relevante in fuzzy information retrieval systems. Soft Computing 6, 308–319 (2002)
López-Pujalte, C., Guerrero-Bote, V.P., de Moya-Anegón, F.: Genetic algorithms in relevance feedback: a second test and new contributions. Inf. Process. Manage. 39(5), 669–687 (2003)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Tiedemann, J. (2007). A Comparison of Genetic Algorithms for Optimizing Linguistically Informed IR in Question Answering. In: Basili, R., Pazienza, M.T. (eds) AI*IA 2007: Artificial Intelligence and Human-Oriented Computing. AI*IA 2007. Lecture Notes in Computer Science(), vol 4733. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74782-6_35
Download citation
DOI: https://doi.org/10.1007/978-3-540-74782-6_35
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74781-9
Online ISBN: 978-3-540-74782-6
eBook Packages: Computer ScienceComputer Science (R0)