Abstract
This paper describes a query generation using semantic features to represent the information demand of users for question answering and information retrieval. One of fundamental reasons why unwanted results are included in responses of all information retrieval systems is because queries do not exactly represent the information demand of users. To solve this problem, a query generaton using the semantic feature is intended to extract semantic features which appear commonly in natural language questions of similar type and utilize them for question answering and information retrieval. We extract semantic features from natural language questions using a grammar and generate queries which represent enough information demands of users using semantic features and syntactic structures. For performance improvement of question answering and information retrieval, we introduce a query-document similarity used to rank documents which include generated queries in the high position. We evaluated our mechanism using 100 queries about a person in the web. There was a notable improvement in the precision at N documents when our approach is applied. Especially, we found that an efficient document retrieval is possible by a question analysis based on semantic features on natural language questions which are comparatively short but fully expressing the information demand of users.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Bilotti, M.W., Katz, B., Lin, J.: What Works Better for Question Answering: Stemming or Morphological Query Expansion? In: IR4QA: Information Retrieval for Question Answering, A SIGIR 2004 Workshop (2004)
Arampatzis, A.T., Tsoris, T., Koster, C.H.A., van der Weide, T.P.: Phrase-based Information Retrieval. Journal of Information Processing & Management 34(6), 693–707 (1998)
Dobrow, B.V., Loukachevitch, N.V., Yudina, T.N.: Conceptual Indexing Using thematic Representation of Texts. TREC-6 (1997)
Perez-Carballo, J., Strzalkowski, T.: Natural language information retrieval: progress report. Journal of Information Processing & Management 36(1), 155–178 (2000)
Zhai, C.: Fast Statistical Parsing of Noun Phrases for Document Indexing. In: Proceedings of the Fifth Conference of Applied Natural Language Processing (1997)
Myaeng, S.H.: Current Status and New Directions of Information Retrieval Technique. Communications of the Korea Information Science Society 24(4), 6–14 (2004)
Salton, G., Fox, E., Wu, H.: Extended boolean information retrieval. Communication of the ACM 26(11), 1022–1036 (1983)
Salton, G.: Automatic Text Processing. Addison-Wesley, Reading (1989)
Maron, M.E., Kuhns, J.L.: On relevance, probabilistic indexing and information retrieval. Journal of the ACM, 216–244 (1960)
Voorhees, E.: Query Expansion using Lexical Semantic Relation. In: Proceedings of the 17th ACM-SIGIR Conference, pp. 61–69 (1994)
Fitzpatrick, L., Dent, M.: Automatic Feedback Using Past Queries: Social Searching? In: Proc. 20’th ACM SIGIR International Conference on Research and Development in Information Retrieval, pp. 306–313 (1997)
Mandela, R., Tokunage, T., Tanaka, H.: Combining Multiple Evidence from Different Types of Thesaurus for Query Expansion. In: Proceedings of the 22nd Annual International ACM SIGIR Conference, pp. 15–19 (1999)
Moldovan, D., Mihalcea, R.: Using WordNet and Lexical Operators to Improve Internet Searches. In: Proceedings of IEEE Internet Computing, pp. 34–43 (2000)
Zukerman, I., Raskutti, B.: Lexical Query Paraphrasing for Document Retrieval. In: The 17th International Conference on Computational Linguistics, COLING 2002 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Shin, SE., Seo, YH. (2006). Query Generation Using Semantic Features. In: Sugimoto, S., Hunter, J., Rauber, A., Morishima, A. (eds) Digital Libraries: Achievements, Challenges and Opportunities. ICADL 2006. Lecture Notes in Computer Science, vol 4312. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11931584_26
Download citation
DOI: https://doi.org/10.1007/11931584_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-49375-4
Online ISBN: 978-3-540-49377-8
eBook Packages: Computer ScienceComputer Science (R0)